ALLELES - Word Related Documents

#	Rank	Similarity	Title + Abs.	Year	PMID
0	1	2	3	4	5
8451	0	0.9956	Genome-wide analysis of NBS-encoding disease resistance genes in Cucumis sativus and phylogenetic study of NBS-encoding genes in Cucurbitaceae crops. BACKGROUND: Plant nucleotide-binding site (NBS)-leucine-rich repeat (LRR) proteins encoded by resistance genes play an important role in the responses of plants to various pathogens, including viruses, bacteria, fungi, and nematodes. In this study, a comprehensive analysis of NBS-encoding genes within the whole cucumber genome was performed, and the phylogenetic relationships of NBS-encoding resistance gene homologues (RGHs) belonging to six species in five genera of Cucurbitaceae crops were compared. RESULTS: Cucumber has relatively few NBS-encoding genes. Nevertheless, cucumber maintains genes belonging to both Toll/interleukine-1 receptor (TIR) and CC (coiled-coil) families. Eight commonly conserved motifs have been established in these two families which support the grouping into TIR and CC families. Moreover, three additional conserved motifs, namely, CNBS-1, CNBS-2 and TNBS-1, have been identified in sequences from CC and TIR families. Analyses of exon/intron configurations revealed that some intron loss or gain events occurred during the structural evolution between the two families. Phylogenetic analyses revealed that gene duplication, sequence divergence, and gene loss were proposed as the major modes of evolution of NBS-encoding genes in Cucurbitaceae species. Compared with NBS-encoding sequences from the Arabidopsis thaliana genome, the remaining seven TIR familes of NBS proteins and RGHs from Cucurbitaceae species have been shown to be phylogenetically distinct from the TIR family of NBS-encoding genes in Arabidopsis, except for two subfamilies (TIR4 and TIR9). On the other hand, in the CC-NBS family, they grouped closely with the CC family of NBS-encoding genes in Arabidopsis. Thus, the NBS-encoding genes in Cucurbitaceae crops are shown to be ancient, and NBS-encoding gene expansions (especially the TIR family) may have occurred before the divergence of Cucurbitaceae and Arabidopsis. CONCLUSION: The results of this paper will provide a genomic framework for the further isolation of candidate disease resistance NBS-encoding genes in cucumber, and contribute to the understanding of the evolutionary mode of NBS-encoding genes in Cucurbitaceae crops.	2013	23418910
8450	1	0.9956	Genome-wide mapping of NBS-LRR genes and their association with disease resistance in soybean. BACKGROUND: R genes are a key component of genetic interactions between plants and biotrophic bacteria and are known to regulate resistance against bacterial invasion. The most common R proteins contain a nucleotide-binding site and a leucine-rich repeat (NBS-LRR) domain. Some NBS-LRR genes in the soybean genome have also been reported to function in disease resistance. In this study, the number of NBS-LRR genes was found to correlate with the number of disease resistance quantitative trait loci (QTL) that flank these genes in each chromosome. NBS-LRR genes co-localized with disease resistance QTL. The study also addressed the functional redundancy of disease resistance on recently duplicated regions that harbor NBS-LRR genes and NBS-LRR gene expression in the bacterial leaf pustule (BLP)-induced soybean transcriptome. RESULTS: A total of 319 genes were determined to be putative NBS-LRR genes in the soybean genome. The number of NBS-LRR genes on each chromosome was highly correlated with the number of disease resistance QTL in the 2-Mb flanking regions of NBS-LRR genes. In addition, the recently duplicated regions contained duplicated NBS-LRR genes and duplicated disease resistance QTL, and possessed either an uneven or even number of NBS-LRR genes on each side. The significant difference in NBS-LRR gene expression between a resistant near-isogenic line (NIL) and a susceptible NIL after inoculation of Xanthomonas axonopodis pv. glycines supports the conjecture that NBS-LRR genes have disease resistance functions in the soybean genome. CONCLUSIONS: The number of NBS-LRR genes and disease resistance QTL in the 2-Mb flanking regions of each chromosome was significantly correlated, and several recently duplicated regions that contain NBS-LRR genes harbored disease resistance QTL for both sides. In addition, NBS-LRR gene expression was significantly different between the BLP-resistant NIL and the BLP-susceptible NIL in response to bacterial infection. From these observations, NBS-LRR genes are suggested to contribute to disease resistance in soybean. Moreover, we propose models for how NBS-LRR genes were duplicated, and apply Ks values for each NBS-LRR gene cluster.	2012	22877146
9073	2	0.9955	EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains. In reverse vaccinology approaches, complete proteomes of bacteria are submitted to multiple computational prediction steps in order to filter proteins that are possible vaccine candidates. Most available tools perform such analysis only in a single strain, or a very limited number of strains. But the vast amount of genomic data had shown that most bacteria contain pangenomes, i.e., their genomic information contains core, conserved genes, and random accessory genes specific to each strain. Therefore, in reverse vaccinology methods it is of the utmost importance to define core proteins and core epitopes. EpitoCore is a decision-tree pipeline developed to fulfill that need. It provides surfaceome prediction of proteins from related strains, defines core proteins within those, calculate their immunogenicity, predicts epitopes for a given set of MHC alleles defined by the user, and then reports if epitopes are located extracellularly and if they are conserved among the core homologs. Pipeline performance is illustrated by mining peptide vaccine candidates in Mycobacterium avium hominissuis strains. From a total proteome of ~4,800 proteins per strain, EpitoCore predicted 103 highly immunogenic core homologs located at cell surface, many of those related to virulence and drug resistance. Conserved epitopes identified among these homologs allows the users to define sets of peptides with potential to immunize the largest coverage of tested HLA alleles using peptide-based vaccines. Therefore, EpitoCore is able to provide automated identification of conserved epitopes in bacterial pangenomic datasets.	2020	32431712
8446	3	0.9953	Genome-wide association study for resistance to Pseudomonas syringae pv. garcae in Coffea arabica. Bacteria halo blight (BHB), a coffee plant disease caused by Pseudomonas syringae pv. garcae, has been gaining importance in producing mountain regions and mild temperatures areas as well as in coffee nurseries. Most Coffea arabica cultivars are susceptible to this disease. In contrast, a great source of genetic diversity and resistance to BHB are found in C. arabica Ethiopian accessions. Aiming to identify quantitative trait nucleotides (QTNs) associated with resistance to BHB and the influence of these genomic regions during the domestication of C. arabica, we conducted an analysis of population structure and a Genome-Wide Association Study (GWAS). For this, we used genotyping by sequencing (GBS) and phenotyping for resistance to BHB of a panel with 120 C. arabica Ethiopian accessions from a historical FAO collection, 11 C. arabica cultivars, and the BA-10 genotype. Population structure analysis based on single-nucleotide polymorphisms (SNPs) markers showed that the 132 accessions are divided into 3 clusters: most wild Ethiopian accessions, domesticated Ethiopian accessions, and cultivars. GWAS, using the single-locus model MLM and the multi-locus models mrMLM, FASTmrMLM, FASTmrEMMA, and ISIS EM-BLASSO, identified 11 QTNs associated with resistance to BHB. Among these QTNs, the four with the highest values of association for resistance to BHB are linked to g000 (Chr_0_434_435) and g010741 genes, which are predicted to encode a serine/threonine-kinase protein and a nucleotide binding site leucine-rich repeat (NBS-LRR), respectively. These genes displayed a similar transcriptional downregulation profile in a C. arabica susceptible cultivar and in a C. arabica cultivar with quantitative resistance, when infected with P. syringae pv. garcae. However, peaks of upregulation were observed in a C. arabica cultivar with qualitative resistance, for both genes. Our results provide SNPs that have potential for application in Marker Assisted Selection (MAS) and expand our understanding about the complex genetic control of the resistance to BHB in C. arabica. In addition, the findings contribute to increasing understanding of the C. arabica domestication history.	2022	36330243
4345	4	0.9950	Comprehensive identification of single nucleotide polymorphisms associated with beta-lactam resistance within pneumococcal mosaic genes. Traditional genetic association studies are very difficult in bacteria, as the generally limited recombination leads to large linked haplotype blocks, confounding the identification of causative variants. Beta-lactam antibiotic resistance in Streptococcus pneumoniae arises readily as the bacteria can quickly incorporate DNA fragments encompassing variants that make the transformed strains resistant. However, the causative mutations themselves are embedded within larger recombined blocks, and previous studies have only analysed a limited number of isolates, leading to the description of "mosaic genes" as being responsible for resistance. By comparing a large number of genomes of beta-lactam susceptible and non-susceptible strains, the high frequency of recombination should break up these haplotype blocks and allow the use of genetic association approaches to identify individual causative variants. Here, we performed a genome-wide association study to identify single nucleotide polymorphisms (SNPs) and indels that could confer beta-lactam non-susceptibility using 3,085 Thai and 616 USA pneumococcal isolates as independent datasets for the variant discovery. The large sample sizes allowed us to narrow the source of beta-lactam non-susceptibility from long recombinant fragments down to much smaller loci comprised of discrete or linked SNPs. While some loci appear to be universal resistance determinants, contributing equally to non-susceptibility for at least two classes of beta-lactam antibiotics, some play a larger role in resistance to particular antibiotics. All of the identified loci have a highly non-uniform distribution in the populations. They are enriched not only in vaccine-targeted, but also non-vaccine-targeted lineages, which may raise clinical concerns. Identification of single nucleotide polymorphisms underlying resistance will be essential for future use of genome sequencing to predict antibiotic sensitivity in clinical microbiology.	2014	25101644
9070	5	0.9949	Automated annotation of mobile antibiotic resistance in Gram-negative bacteria: the Multiple Antibiotic Resistance Annotator (MARA) and database. BACKGROUND: Multiresistance in Gram-negative bacteria is often due to acquisition of several different antibiotic resistance genes, each associated with a different mobile genetic element, that tend to cluster together in complex conglomerations. Accurate, consistent annotation of resistance genes, the boundaries and fragments of mobile elements, and signatures of insertion, such as DR, facilitates comparative analysis of complex multiresistance regions and plasmids to better understand their evolution and how resistance genes spread. OBJECTIVES: To extend the Repository of Antibiotic resistance Cassettes (RAC) web site, which includes a database of 'features', and the Attacca automatic DNA annotation system, to encompass additional resistance genes and all types of associated mobile elements. METHODS: Antibiotic resistance genes and mobile elements were added to RAC, from existing registries where possible. Attacca grammars were extended to accommodate the expanded database, to allow overlapping features to be annotated and to identify and annotate features such as composite transposons and DR. RESULTS: The Multiple Antibiotic Resistance Annotator (MARA) database includes antibiotic resistance genes and selected mobile elements from Gram-negative bacteria, distinguishing important variants. Sequences can be submitted to the MARA web site for annotation. A list of positions and orientations of annotated features, indicating those that are truncated, DR and potential composite transposons is provided for each sequence, as well as a diagram showing annotated features approximately to scale. CONCLUSIONS: The MARA web site (http://mara.spokade.com) provides a comprehensive database for mobile antibiotic resistance in Gram-negative bacteria and accurately annotates resistance genes and associated mobile elements in submitted sequences to facilitate comparative analysis.	2018	29373760
9072	6	0.9948	PanGeT: Pan-genomics tool. A decade after the concept of Pan-genome was first introduced; research in this field has spread its tentacles to areas such as pathogenesis of diseases, bacterial evolutionary studies and drug resistance. Gene content-based differentiation of virulent and a virulent strains of bacteria and identification of pathogen specific genes is imperative to understand their physiology and gain insights into the mechanism of genome evolution. Subsequently, this will aid in identifying diagnostic targets and in developing and selecting vaccines. The root of pan-genomic studies, however, is to identify the core genes, dispensable genes and strain specific genes across the genomes belonging to a clade. To this end, we have developed a tool, "PanGeT - Pan-genomics Tool" to compute the 'pan-genome' based on comparisons at the genome as well as the proteome levels. This automated tool is implemented using LaTeX libraries for effective visualization of overall pan-genome through graphical plots. Links to retrieve sequence information and functional annotations have also been provided. PanGeT can be downloaded from http://pranag.physics.iisc.ernet.in/PanGeT/ or https://github.com/PanGeTv1/PanGeT.	2017	27851981
8393	7	0.9948	The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance. BACKGROUND: The whitefly Bemisia tabaci (Hemiptera: Aleyrodidae) is among the 100 worst invasive species in the world. As one of the most important crop pests and virus vectors, B. tabaci causes substantial crop losses and poses a serious threat to global food security. RESULTS: We report the 615-Mb high-quality genome sequence of B. tabaci Middle East-Asia Minor 1 (MEAM1), the first genome sequence in the Aleyrodidae family, which contains 15,664 protein-coding genes. The B. tabaci genome is highly divergent from other sequenced hemipteran genomes, sharing no detectable synteny. A number of known detoxification gene families, including cytochrome P450s and UDP-glucuronosyltransferases, are significantly expanded in B. tabaci. Other expanded gene families, including cathepsins, large clusters of tandemly duplicated B. tabaci-specific genes, and phosphatidylethanolamine-binding proteins (PEBPs), were found to be associated with virus acquisition and transmission and/or insecticide resistance, likely contributing to the global invasiveness and efficient virus transmission capacity of B. tabaci. The presence of 142 horizontally transferred genes from bacteria or fungi in the B. tabaci genome, including genes encoding hopanoid/sterol synthesis and xenobiotic detoxification enzymes that are not present in other insects, offers novel insights into the unique biological adaptations of this insect such as polyphagy and insecticide resistance. Interestingly, two adjacent bacterial pantothenate biosynthesis genes, panB and panC, have been co-transferred into B. tabaci and fused into a single gene that has acquired introns during its evolution. CONCLUSIONS: The B. tabaci genome contains numerous genetic novelties, including expansions in gene families associated with insecticide resistance, detoxification and virus transmission, as well as numerous horizontally transferred genes from bacteria and fungi. We believe these novelties likely have shaped B. tabaci as a highly invasive polyphagous crop pest and efficient vector of plant viruses. The genome serves as a reference for resolving the B. tabaci cryptic species complex, understanding fundamental biological novelties, and providing valuable genetic information to assist the development of novel strategies for controlling whiteflies and the viruses they transmit.	2016	27974049
3767	8	0.9948	Transposon insertion sequencing reveals novel hypermutator genes in Acinetobacter baumannii. Mutation rates in bacteria are an important determinant of adaptation to new environments and success in different niches. In some bacterial pathogens, "hypermutator" variants-most often associated with mutations in components of the DNA mismatch repair system-are associated with increased antibiotic resistance and poorer patient outcomes. We report the serendipitous finding of novel hypermutator genes in Acinetobacter baumannii through genome-scale mutant fitness screening. Exposure of a transposon insertion mutant library of A. baumannii to extended weak antibiotic selection resulted in selection for mutations that directly increased fitness as expected, but also revealed genes where transposon insertion indirectly increased fitness due to elevated general mutation rates. Three novel hypermutator genes were confirmed in A. baumannii: nusB, encoding a transcription antiterminator; ABUW_0208, encoding a hypothetical protein; and ABUW_2121, which encodes a sulfite transporter. We find selection for hypermutator variants in transposon insertion sequencing (TIS) data sets from diverse bacteria under various antibiotic treatments. Our results expand the range of biological functions linked to hypermutator phenotypes in bacteria and provide a workflow for the identification of putative hypermutators by TIS.IMPORTANCEAll organisms have the capacity for evolution through mutation. Bacteria with high mutation rates have a survival advantage in some stressful environments because they generate beneficial mutations more frequently. "Hypermutators" are bacterial strains that carry gene inactivations that increase general mutation rates. These variants are important in chronic infections, as their increased genetic diversity allows higher drug resistance and prolonged survival in the host. Only a few different hypermutator genes are known, and there is no high-throughput method for their identification. We have made the serendipitous finding that hypermutator genes can be identified by genome-wide mutant fitness screening under specific selection conditions. We have identified novel hypermutator alleles in the notorious hospital pathogen Acinetobacter baumannii and show that hypermutator variants can be detected in screens of a wide range of pathogens.	2025	40576344
8459	9	0.9948	A physical map of traits of agronomic importance based on potato and tomato genome sequences. Potato, tomato, pepper, and eggplant are worldwide important crop and vegetable species of the Solanaceae family. Molecular linkage maps of these plants have been constructed and used to map qualitative and quantitative traits of agronomic importance. This research has been undertaken with the vision to identify the molecular basis of agronomic characters on the one hand, and on the other hand, to assist the selection of improved varieties in breeding programs by providing DNA-based markers that are diagnostic for specific agronomic characters. Since 2011, whole genome sequences of tomato and potato became available in public databases. They were used to combine the results of several hundred mapping and map-based cloning studies of phenotypic characters between 1988 and 2022 in physical maps of the twelve tomato and potato chromosomes. The traits evaluated were qualitative and quantitative resistance to pathogenic oomycetes, fungi, bacteria, viruses, nematodes, and insects. Furthermore, quantitative trait loci for yield and sugar content of tomato fruits and potato tubers and maturity or earliness were physically mapped. Cloned genes for pathogen resistance, a few genes underlying quantitative trait loci for yield, sugar content, and maturity, and several hundred candidate genes for these traits were included in the physical maps. The comparison between the physical chromosome maps revealed, in addition to known intrachromosomal inversions, several additional inversions and translocations between the otherwise highly collinear tomato and potato genomes. The integration of the positional information from independent mapping studies revealed the colocalization of qualitative and quantitative loci for resistance to different types of pathogens, called resistance hotspots, suggesting a similar molecular basis. Synteny between potato and tomato with respect to genomic positions of quantitative trait loci was frequently observed, indicating eventual similarity between the underlying genes.	2023	37564870
4453	10	0.9947	dfrA trimethoprim resistance genes found in Gram-negative bacteria: compilation and unambiguous numbering. To track the spread of antibiotic resistance genes, accurate identification of individual genes is essential. Acquired trimethoprim resistance genes encoding trimethoprim-insensitive homologues of the sensitive dihydrofolate reductases encoded by the folA genes of bacteria are increasingly found in genome sequences. However, naming and numbering in publicly available records (journal publications or entries in the GenBank non-redundant DNA database) has not always been unambiguous. In addition, the nomenclature has evolved over time. Here, the changes in nomenclature and the most commonly encountered problems and pitfalls affecting dfrA gene identification arising from historically incorrect or inaccurate numbering are explained. The complete set of dfrA genes/DfrA proteins found in Gram-negative bacteria for which readily searchable sequence information is currently available has been compiled using less than 98% identity for both the gene and the derived protein sequence as the criteria for assignment of a new number. In most cases, trimethoprim resistance has been demonstrated. The gene context, predominantly in a gene cassette or near the ori end of CR1 or CR2, is also covered. The RefSeq database that underpins the programs used to automatically identify resistance genes in genome data sets has been curated to assign all sequences listed to the correct number. This led to the assignment of corrected or new gene numbers to several mis-assigned sequences. The unique numbers assigned for the dfrA/DfrA set are now listed in the RefSeq database, which we propose provides a way forward that should end future duplication of numbers and the confusion that causes.	2021	34180526
243	11	0.9947	Phylogenetic distribution of translational GTPases in bacteria. BACKGROUND: Translational GTPases are a family of proteins in which GTPase activity is stimulated by the large ribosomal subunit. Conserved sequence features allow members of this family to be identified. RESULTS: To achieve accurate protein identification and grouping we have developed a method combining searches with Hidden Markov Model profiles and tree based grouping. We found all the genes for translational GTPases in 191 fully sequenced bacterial genomes. The protein sequences were grouped into nine subfamilies. Analysis of the results shows that three translational GTPases, the translation factors EF-Tu, EF-G and IF2, are present in all organisms examined. In addition, several copies of the genes encoding EF-Tu and EF-G are present in some genomes. In the case of multiple genes for EF-Tu, the gene copies are nearly identical; in the case of multiple EF-G genes, the gene copies have been considerably diverged. The fourth translational GTPase, LepA, the function of which is currently unknown, is also nearly universally conserved in bacteria, being absent from only one organism out of the 191 analyzed. The translation regulator, TypA, is also present in most of the organisms examined, being absent only from bacteria with small genomes.Surprisingly, some of the well studied translational GTPases are present only in a very small number of bacteria. The translation termination factor RF3 is absent from many groups of bacteria with both small and large genomes. The specialized translation factor for selenocysteine incorporation--SelB--was found in only 39 organisms. Similarly, the tetracycline resistance proteins (Tet) are present only in a small number of species. Proteins of the CysN/NodQ subfamily have acquired functions in sulfur metabolism and production of signaling molecules. The genes coding for CysN/NodQ proteins were found in 74 genomes. This protein subfamily is not confined to Proteobacteria, as suggested previously but present also in many other groups of bacteria. CONCLUSION: Four of the translational GTPase subfamilies (IF2, EF-Tu, EF-G and LepA) are represented by at least one member in each bacterium studied, with one exception in LepA. This defines the set of translational GTPases essential for basic cell functions.	2007	17214893
8405	12	0.9947	Mapping Major Disease Resistance Genes in Soybean by Genome-Wide Association Studies. Soybean is one of the most valuable agricultural crops in the world. Besides, this legume is constantly attacked by a wide range of pathogens (fungi, bacteria, viruses, and nematodes) compromising yield and increasing production costs. One of the major disease management strategies is the genetic resistance provided by single genes and quantitative trait loci (QTL). Identifying the genomic regions underlying the resistance against these pathogens on soybean is one of the first steps performed by molecular breeders. In the past, genetic mapping studies have been widely used to discover these genomic regions. However, over the last decade, advances in next-generation sequencing technologies and their subsequent cost decreasing led to the development of cost-effective approaches to high-throughput genotyping. Thus, genome-wide association studies applying thousands of SNPs in large sets composed of diverse soybean accessions have been successfully done. In this chapter, a comprehensive review of the majority of GWAS for soybean diseases published since this approach was developed is provided. Important diseases caused by Heterodera glycines, Phytophthora sojae, and Sclerotinia sclerotiorum have been the focus of the several GWAS. However, other bacterial and fungi diseases also have been targets of GWAS. As such, this GWAS summary can serve as a guide for future studies of these diseases. The protocol begins by describing several considerations about the pathogens and bringing different procedures of molecular characterization of them. Advice to choose the best isolate/race to maximize the discovery of multiple R genes or to directly map an effective R gene is provided. A summary of protocols, methods, and tools to phenotyping the soybean panel is given to several diseases. We also give details of options of DNA extraction protocols and genotyping methods, and we describe parameters of SNP quality to soybean data. Websites and their online tools to obtain genotypic and phenotypic data for thousands of soybean accessions are highlighted. Finally, we report several tricks and tips in Subheading 4, especially related to composing the soybean panel as well as generating and analyzing the phenotype data. We hope this protocol will be helpful to achieve GWAS success in identifying resistance genes on soybean.	2022	35641772
64	13	0.9947	Mutational analysis of the Arabidopsis RPS2 disease resistance gene and the corresponding pseudomonas syringae avrRpt2 avirulence gene. Plants have evolved a large number of disease resistance genes that encode proteins containing conserved structural motifs that function to recognize pathogen signals and to initiate defense responses. The Arabidopsis RPS2 gene encodes a protein representative of the nucleotide-binding site-leucine-rich repeat (NBS-LRR) class of plant resistance proteins. RPS2 specifically recognizes Pseudomonas syringae pv. tomato strains expressing the avrRpt2 gene and initiates defense responses to bacteria carrying avrRpt2, including a hypersensitive cell death response (HR). We present an in planta mutagenesis experiment that resulted in the isolation of a series of rps2 and avrRpt2 alleles that disrupt the RPS2-avrRpt2 gene-for-gene interaction. Seven novel avrRpt2 alleles incapable of eliciting an RPS2-dependent HR all encode proteins with lesions in the C-terminal portion of AvrRpt2 previously shown to be sufficient for RPS2 recognition. Ten novel rps2 alleles were characterized with mutations in the NBS and the LRR. Several of these alleles code for point mutations in motifs that are conserved among NBS-LRR resistance genes, including the third LRR, which suggests the importance of these motifs for resistance gene function.	2001	11204781
3771	14	0.9947	RFPlasmid: predicting plasmid sequences from short-read assembly data using machine learning. Antimicrobial-resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know whether the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole-genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely on single features, like k-mer composition, circularity of the DNA molecule, copy number or sequence identity to plasmid replication genes, all of which have their drawbacks, especially when faced with large single-copy plasmids, which often carry resistance genes. With our newly developed prediction tool RFPlasmid, we use a combination of multiple features, including k-mer composition and databases with plasmid and chromosomal marker proteins, to predict whether the likely source of a contig is plasmid or chromosomal. The tool RFPlasmid supports models for 17 different bacterial taxa, including Campylobacter, Escherichia coli and Salmonella, and has a taxon agnostic model for metagenomic assemblies or unsupported organisms. RFPlasmid is available both as a standalone tool and via a web interface.	2021	34846288
9071	15	0.9947	RAC: Repository of Antibiotic resistance Cassettes. Antibiotic resistance in bacteria is often due to acquisition of resistance genes associated with different mobile genetic elements. In Gram-negative bacteria, many resistance genes are found as part of small mobile genetic elements called gene cassettes, generally found integrated into larger elements called integrons. Integrons carrying antibiotic resistance gene cassettes are often associated with mobile elements and here are designated 'mobile resistance integrons' (MRIs). More than one cassette can be inserted in the same integron to create arrays that contribute to the spread of multi-resistance. In many sequences in databases such as GenBank, only the genes within cassettes, rather than whole cassettes, are annotated and the same gene/cassette may be given different names in different entries, hampering analysis. We have developed the Repository of Antibiotic resistance Cassettes (RAC) website to provide an archive of gene cassettes that includes alternative gene names from multiple nomenclature systems and allows the community to contribute new cassettes. RAC also offers an additional function that allows users to submit sequences containing cassettes or arrays for annotation using the automatic annotation system Attacca. Attacca recognizes features (gene cassettes, integron regions) and identifies cassette arrays as patterns of features and can also distinguish minor cassette variants that may encode different resistance phenotypes (aacA4 cassettes and bla cassettes-encoding β-lactamases). Gaps in annotations are manually reviewed and those found to correspond to novel cassettes are assigned unique names. While there are other websites dedicated to integrons or antibiotic resistance genes, none includes a complete list of antibiotic resistance gene cassettes in MRI or offers consistent annotation and appropriate naming of all of these cassettes in submitted sequences. RAC thus provides a unique resource for researchers, which should reduce confusion and improve the quality of annotations of gene cassettes in integrons associated with antibiotic resistance. DATABASE URL: http://www2.chi.unsw.edu.au/rac.	2011	22140215
8759	16	0.9947	Genetic and transcriptomic dissection of host defense to Goss's bacterial wilt and leaf blight of maize. Goss's wilt, caused by the Gram-positive actinobacterium Clavibacter nebraskensis, is an important bacterial disease of maize. The molecular and genetic mechanisms of resistance to the bacterium, or, in general, Gram-positive bacteria causing plant diseases, remain poorly understood. Here, we examined the genetic basis of Goss's wilt through differential gene expression, standard genome-wide association mapping (GWAS), extreme phenotype (XP) GWAS using highly resistant (R) and highly susceptible (S) lines, and quantitative trait locus (QTL) mapping using 3 bi-parental populations, identifying 11 disease association loci. Three loci were validated using near-isogenic lines or recombinant inbred lines. Our analysis indicates that Goss's wilt resistance is highly complex and major resistance genes are not commonly present. RNA sequencing of samples separately pooled from R and S lines with or without bacterial inoculation was performed, enabling identification of common and differential gene responses in R and S lines. Based on expression, in both R and S lines, the photosynthesis pathway was silenced upon infection, while stress-responsive pathways and phytohormone pathways, namely, abscisic acid, auxin, ethylene, jasmonate, and gibberellin, were markedly activated. In addition, 65 genes showed differential responses (up- or down-regulated) to infection in R and S lines. Combining genetic mapping and transcriptional data, individual candidate genes conferring Goss's wilt resistance were identified. Collectively, aspects of the genetic architecture of Goss's wilt resistance were revealed, providing foundational data for mechanistic studies.	2023	37652038
8367	17	0.9946	A hybrid NRPS-PKS gene cluster related to the bleomycin family of antitumor antibiotics in Alteromonas macleodii strains. Although numerous marine bacteria are known to produce antibiotics via hybrid NRPS-PKS gene clusters, none have been previously described in an Alteromonas species. In this study, we describe in detail a novel hybrid NRPS-PKS cluster identified in the plasmid of the Alteromonasmacleodii strain AltDE1 and analyze its relatedness to other similar gene clusters in a sequence-based characterization. This is a mobile cluster, flanked by transposase-like genes, that has even been found inserted into the chromosome of some Alteromonasmacleodii strains. The cluster contains separate genes for NRPS and PKS activity. The sole PKS gene appears to carry a novel acyltransferase domain, quite divergent from those currently characterized. The predicted specificities of the adenylation domains of the NRPS genes suggest that the final compound has a backbone very similar to bleomycin related compounds. However, the lack of genes involved in sugar biosynthesis indicates that the final product is not a glycopeptide. Even in the absence of these genes, the presence of the cluster appears to confer complete or partial resistance to phleomycin, which may be attributed to a bleomycin-resistance-like protein identified within the cluster. This also suggests that the compound still shares significant structural similarity to bleomycin. Moreover, transcriptomic evidence indicates that the NRPS-PKS cluster is expressed. Such sequence-based approaches will be crucial to fully explore and analyze the diversity and potential of secondary metabolite production, especially from increasingly important sources like marine microbes.	2013	24069455
4342	18	0.9946	Evolution and diversity of clonal bacteria: the paradigm of Mycobacterium tuberculosis. BACKGROUND: Mycobacterium tuberculosis complex species display relatively static genomes and 99.9% nucleotide sequence identity. Studying the evolutionary history of such monomorphic bacteria is a difficult and challenging task. PRINCIPAL FINDINGS: We found that single-nucleotide polymorphism (SNP) analysis of DNA repair, recombination and replication (3R) genes in a comprehensive selection of M. tuberculosis complex strains from across the world, yielded surprisingly high levels of polymorphisms as compared to house-keeping genes, making it possible to distinguish between 80% of clinical isolates analyzed in this study. Bioinformatics analysis suggests that a large number of these polymorphisms are potentially deleterious. Site frequency spectrum comparison of synonymous and non-synonymous variants and Ka/Ks ratio analysis suggest a general negative/purifying selection acting on these sets of genes that may lead to suboptimal 3R system activity. In turn, the relaxed fidelity of 3R genes may allow the occurrence of adaptive variants, some of which will survive. Furthermore, 3R-based phylogenetic trees are a new tool for distinguishing between M. tuberculosis complex strains. CONCLUSIONS/SIGNIFICANCE: This situation, and the consequent lack of fidelity in genome maintenance, may serve as a starting point for the evolution of antibiotic resistance, fitness for survival and pathogenicity, possibly conferring a selective advantage in certain stressful situations. These findings suggest that 3R genes may play an important role in the evolution of highly clonal bacteria, such as M. tuberculosis. They also facilitate further epidemiological studies of these bacteria, through the development of high-resolution tools. With many more microbial genomes being sequenced, our results open the door to 3R gene-based studies of adaptation and evolution of other, highly clonal bacteria.	2008	18253486
9987	19	0.9946	Four genes essential for recombination define GInts, a new type of mobile genomic island widespread in bacteria. Integrases are a family of tyrosine recombinases that are highly abundant in bacterial genomes, actively disseminating adaptive characters such as pathogenicity determinants and antibiotics resistance. Using comparative genomics and functional assays, we identified a novel type of mobile genetic element, the GInt, in many diverse bacterial groups but not in archaea. Integrated as genomic islands, GInts show a tripartite structure consisting of the ginABCD operon, a cargo DNA region from 2.5 to at least 70 kb, and a short AT-rich 3' end. The gin operon is characteristic of GInts and codes for three putative integrases and a small putative helix-loop-helix protein, all of which are essential for integration and excision of the element. Genes in the cargo DNA are acquired mostly from phylogenetically related bacteria and often code for traits that might increase fitness, such as resistance to antimicrobials or virulence. GInts also tend to capture clusters of genes involved in complex processes, such as the biosynthesis of phaseolotoxin by Pseudomonas syringae. GInts integrate site-specifically, generating two flanking direct imperfect repeats, and excise forming circular molecules. The excision process generates sequence variants at the element attachment site, which can increase frequency of integration and drive target specificity.	2017	28393892