# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 9079 | 0 | 1.0000 | Review, Evaluation, and Directions for Gene-Targeted Assembly for Ecological Analyses of Metagenomes. Shotgun metagenomics has greatly advanced our understanding of microbial communities over the last decade. Metagenomic analyses often include assembly and genome binning, computationally daunting tasks especially for big data from complex environments such as soil and sediments. In many studies, however, only a subset of genes and pathways involved in specific functions are of interest; thus, it is not necessary to attempt global assembly. In addition, methods that target genes can be computationally more efficient and produce more accurate assembly by leveraging rich databases, especially for those genes that are of broad interest such as those involved in biogeochemical cycles, biodegradation, and antibiotic resistance or used as phylogenetic markers. Here, we review six gene-targeted assemblers with unique algorithms for extracting and/or assembling targeted genes: Xander, MegaGTA, SAT-Assembler, HMM-GRASPx, GenSeed-HMM, and MEGAN. We tested these tools using two datasets with known genomes, a synthetic community of artificial reads derived from the genomes of 17 bacteria, shotgun sequence data from a mock community with 48 bacteria and 16 archaea genomes, and a large soil shotgun metagenomic dataset. We compared assemblies of a universal single copy gene (rplB) and two N cycle genes (nifH and nirK). We measured their computational efficiency, sensitivity, specificity, and chimera rate and found Xander and MegaGTA, which both use a probabilistic graph structure to model the genes, have the best overall performance with all three datasets, although MEGAN, a reference matching assembler, had better sensitivity with synthetic and mock community members chosen from its reference collection. Also, Xander and MegaGTA are the only tools that include post-assembly scripts tuned for common molecular ecology and diversity analyses. Additionally, we provide a mathematical model for estimating the probability of assembling targeted genes in a metagenome for estimating required sequencing depth. | 2019 | 31749830 |
| 9669 | 1 | 0.9994 | Genomic encyclopedia of bacteria and archaea: sequencing a myriad of type strains. Microbes hold the key to life. They hold the secrets to our past (as the descendants of the earliest forms of life) and the prospects for our future (as we mine their genes for solutions to some of the planet's most pressing problems, from global warming to antibiotic resistance). However, the piecemeal approach that has defined efforts to study microbial genetic diversity for over 20 years and in over 30,000 genome projects risks squandering that promise. These efforts have covered less than 20% of the diversity of the cultured archaeal and bacterial species, which represent just 15% of the overall known prokaryotic diversity. Here we call for the funding of a systematic effort to produce a comprehensive genomic catalog of all cultured Bacteria and Archaea by sequencing, where available, the type strain of each species with a validly published name (currently∼11,000). This effort will provide an unprecedented level of coverage of our planet's genetic diversity, allow for the large-scale discovery of novel genes and functions, and lead to an improved understanding of microbial evolution and function in the environment. | 2014 | 25093819 |
| 5101 | 2 | 0.9993 | Identification of Key Features Pivotal to the Characteristics and Functions of Gut Bacteria Taxa through Machine Learning Methods. BACKGROUND: Gut bacteria critically influence digestion, facilitate the breakdown of complex food substances, aid in essential nutrient synthesis, and contribute to immune system balance. However, current knowledge regarding intestinal bacteria remains insufficient. OBJECTIVE: This study aims to discover essential differences for different intestinal bacteria. METHODS: This study was conducted by investigating a total of 1478 gut bacterial samples comprising 235 Actinobacteria, 447 Bacteroidetes, and 796 Firmicutes, by utilizing sophisticated machine learning algorithms. By building on the dataset provided by Chen et al., we engaged sophisticated machine learning techniques to further investigate and analyze the gut bacterial samples. Each sample in the dataset was described by 993 unique features associated with gut bacteria, including 342 features annotated by the Antibiotic Resistance Genes Database, Comprehensive Antibiotic Research Database, Kyoto Encyclopedia of Genes and Genomes, and Virulence Factors of Pathogenic Bacteria. We employed incremental feature selection methods within a computational framework to identify the optimal features for classification. RESULTS: Eleven feature ranking algorithms selected several key features as pivotal to the characteristics and functions of gut bacteria. These features appear to facilitate the identification of specific gut bacterial species. Additionally, we established quantitative rules for identifying Actinobacteria, Bacteroidetes, and Firmicutes. CONCLUSION: This research underscores the significant potential of machine learning in studying gut microbes and enhances our understanding of the multifaceted roles of gut bacteria. | 2025 | 40671232 |
| 9080 | 3 | 0.9993 | Comparison of de-novo assembly tools for plasmid metagenome analysis. BACKGROUND: With the advent of next-generation sequencing techniques, culture-independent metagenome approaches have now made it possible to predict possible presence of genes in the environmental bacteria most of which may be non-cultivable. Short reads obtained from the deep sequencing can be assembled into long contigs some of which include plasmids. Plasmids are the circular double stranded DNA in bacteria and known as one of the major carriers of antibiotic resistance genes. OBJECTIVE: Metagenomic analyses, especially focused on plasmids, could help us predict dissemination mechanisms of antibiotic resistance genes in the environment. However, with the availability of a myriad of metagenomic assemblers, the selection of the most appropriate metagenome assembler for the plasmid metagenome study might be challenging. Therefore, in this study, we compared five open source assemblers to suggest most effective way of plasmid metagenome analysis. METHODS: IDBA-UD, MEGAHIT, SPAdes, SOAPdenovo2, and Velvet are compared for conducting plasmid metagenome analyses using two water samples. RESULTS: Our results clearly showed that abundance and types of antibiotic resistance genes on plasmids varied depending on the selection of assembly tools. IDBA-UD and MEGAHIT demonstrated the overall best assembly statistics with high N50 values with higher portion of longer contigs. CONCLUSION: These two assemblers also detected more diverse plasmids. Among the two, MEGAHIT showed more memory efficient assembly, therefore we suggest that the use of MEGAHIT for plasmid metagenome analysis may offer more diverse plasmids with less computer resource required. Here, we also summarized a fundamental plasmid metagenome work flow, especially for antibiotic resistance gene investigation. | 2019 | 31187446 |
| 8399 | 4 | 0.9992 | SYN-View: A Phylogeny-Based Synteny Exploration Tool for the Identification of Gene Clusters Linked to Antibiotic Resistance. The development of new antibacterial drugs has become one of the most important tasks of the century in order to overcome the posing threat of drug resistance in pathogenic bacteria. Many antibiotics originate from natural products produced by various microorganisms. Over the last decades, bioinformatical approaches have facilitated the discovery and characterization of these small compounds using genome mining methodologies. A key part of this process is the identification of the most promising biosynthetic gene clusters (BGCs), which encode novel natural products. In 2017, the Antibiotic Resistant Target Seeker (ARTS) was developed in order to enable an automated target-directed genome mining approach. ARTS identifies possible resistant target genes within antibiotic gene clusters, in order to detect promising BGCs encoding antibiotics with novel modes of action. Although ARTS can predict promising targets based on multiple criteria, it provides little information about the cluster structures of possible resistant genes. Here, we present SYN-view. Based on a phylogenetic approach, SYN-view allows for easy comparison of gene clusters of interest and distinguishing genes with regular housekeeping functions from genes functioning as antibiotic resistant targets. Our aim is to implement our proposed method into the ARTS web-server, further improving the target-directed genome mining strategy of the ARTS pipeline. | 2020 | 33396183 |
| 9670 | 5 | 0.9992 | An Approach to In Silico Dissection of Bacterial Intelligence Through Selective Genomic Tools. All the genetic potential and the intelligence a bacteria can showcase in a given environment are embedded in its genome. In this study, we have presented systematic guidelines to understand a bacterial genome with the relevant set of in silico tools using a novel bacteria as an example. This study presents a multi-dimensional approach from genome annotation to tracing genes and their network of metabolism operating in an organism. It also shows how the sequence can be used to mine the enzymes and construction of its 3-dimensional structure so that its functional behavior can be predicted and compared. The discriminating algorithm allows analysis of the promoter region and provides the insight in the regulation of genes in spite of the similarity in its sequences. The ecological niche specific bacterial behavior and adapted altered physiology can be understood through the presence of secondary metabolite, antibiotic resistance genes, and viral genes; and it helps in the valorization of genetic information for developing new biological application/processes. This study provides an in silico work plan and necessary steps for genome analysis of novel bacteria without any rigorous wet lab experiments. | 2018 | 30013271 |
| 7698 | 6 | 0.9992 | Detecting horizontal gene transfer with metagenomics co-barcoding sequencing. Horizontal gene transfer (HGT) is the process through which genetic information is transferred between different genomes and that played a crucial role in bacterial evolution. HGT can enable bacteria to rapidly acquire antibiotic resistance and bacteria that have acquired resistance is spreading within the microbiome. Conventional methods of characterizing HGT patterns include short-read metagenomic sequencing (short-reads mNGS), long-read sequencing, and single-cell sequencing. These approaches present several limitations, such as short-read fragments, high amounts of input DNA, and sequencing costs, respectively. Here, we attempt to circumvent present limitations to detect HGT by developing a metagenomics co-barcode sequencing workflow (MECOS) and applying it to the human and mouse gut microbiomes. In addition to that, we have over 10-fold increased contig length compared to short-reads mNGS; we also obtained exceeding 30 million paired reads with co-barcode information. Applying the novel bioinformatic pipeline, we integrated this co-barcoding information and the context information from long reads, and observed over 50-fold HGT events after we corrected the potential wrong HGT events. Specifically, we detected approximately 3,000 HGT blocks in individual samples, encompassing ~6,000 genes and ~100 taxonomic groups, including loci conferring tetracycline resistance through ribosomal protection. MECOS provides a valuable tool for investigating HGT and advance our understanding on the evolution of natural microbial communities within hosts.IMPORTANCEIn this study, to better identify horizontal gene transfer (HGT) in individual samples, we introduce a new co-barcoding sequencing system called metagenomics co-barcoding sequencing (MECOS), which has three significant improvements: (i) long DNA fragment extraction, (ii) a special transposome insertion, (iii) hybridization of DNA to barcode beads, and (4) an integrated bioinformatic pipeline. Using our approach, we have over 10-fold increased contig length compared to short-reads mNGS, and observed over 50-fold HGT events after we corrected the potential wrong HGT events. Our results indicate the presence of approximately 3,000 HGT blocks, involving roughly 6,000 genes and 100 taxonomic groups in individual samples. Notably, these HGT events are predominantly enriched in genes that confer tetracycline resistance via ribosomal protection. MECOS is a useful tool for investigating HGT and the evolution of natural microbial communities within hosts, thereby advancing our understanding of microbial ecology and evolution. | 2024 | 38315121 |
| 3776 | 7 | 0.9992 | FARME DB: a functional antibiotic resistance element database. Antibiotic resistance (AR) is a major global public health threat but few resources exist that catalog AR genes outside of a clinical context. Current AR sequence databases are assembled almost exclusively from genomic sequences derived from clinical bacterial isolates and thus do not include many microbial sequences derived from environmental samples that confer resistance in functional metagenomic studies. These environmental metagenomic sequences often show little or no similarity to AR sequences from clinical isolates using standard classification criteria. In addition, existing AR databases provide no information about flanking sequences containing regulatory or mobile genetic elements. To help address this issue, we created an annotated database of DNA and protein sequences derived exclusively from environmental metagenomic sequences showing AR in laboratory experiments. Our Functional Antibiotic Resistant Metagenomic Element (FARME) database is a compilation of publically available DNA sequences and predicted protein sequences conferring AR as well as regulatory elements, mobile genetic elements and predicted proteins flanking antibiotic resistant genes. FARME is the first database to focus on functional metagenomic AR gene elements and provides a resource to better understand AR in the 99% of bacteria which cannot be cultured and the relationship between environmental AR sequences and antibiotic resistant genes derived from cultured isolates.Database URL: http://staff.washington.edu/jwallace/farme. | 2017 | 28077567 |
| 9671 | 8 | 0.9992 | Genome-scale genetic manipulation methods for exploring bacterial molecular biology. Bacteria are diverse and abundant, playing key roles in human health and disease, the environment, and biotechnology. Despite progress in genome sequencing and bioengineering, much remains unknown about the functional organization of prokaryotes. For instance, roughly a third of the protein-coding genes of the best-studied model bacterium, Escherichia coli, currently lack experimental annotations. Systems-level experimental approaches for investigating the functional associations of bacterial genes and genetic structures are essential for defining the fundamental molecular biology of microbes, preventing the spread of antibacterial resistance in the clinic, and driving the development of future biotechnological applications. This review highlights recently introduced large-scale genetic manipulation and screening procedures for the systematic exploration of bacterial gene functions, molecular relationships, and the global organization of bacteria at the gene, pathway, and genome levels. | 2012 | 22517266 |
| 9081 | 9 | 0.9992 | Identification and reconstruction of novel antibiotic resistance genes from metagenomes. BACKGROUND: Environmental and commensal bacteria maintain a diverse and largely unknown collection of antibiotic resistance genes (ARGs) that, over time, may be mobilized and transferred to pathogens. Metagenomics enables cultivation-independent characterization of bacterial communities but the resulting data is noisy and highly fragmented, severely hampering the identification of previously undescribed ARGs. We have therefore developed fARGene, a method for identification and reconstruction of ARGs directly from shotgun metagenomic data. RESULTS: fARGene uses optimized gene models and can therefore with high accuracy identify previously uncharacterized resistance genes, even if their sequence similarity to known ARGs is low. By performing the analysis directly on the metagenomic fragments, fARGene also circumvents the need for a high-quality assembly. To demonstrate the applicability of fARGene, we reconstructed β-lactamases from five billion metagenomic reads, resulting in 221 ARGs, of which 58 were previously not reported. Based on 38 ARGs reconstructed by fARGene, experimental verification showed that 81% provided a resistance phenotype in Escherichia coli. Compared to other methods for detecting ARGs in metagenomic data, fARGene has superior sensitivity and the ability to reconstruct previously unknown genes directly from the sequence reads. CONCLUSIONS: We conclude that fARGene provides an efficient and reliable way to explore the unknown resistome in bacterial communities. The method is applicable to any type of ARGs and is freely available via GitHub under the MIT license. | 2019 | 30935407 |
| 9657 | 10 | 0.9992 | Machine Learning Leveraging Genomes from Metagenomes Identifies Influential Antibiotic Resistance Genes in the Infant Gut Microbiome. Antibiotic resistance in pathogens is extensively studied, and yet little is known about how antibiotic resistance genes of typical gut bacteria influence microbiome dynamics. Here, we leveraged genomes from metagenomes to investigate how genes of the premature infant gut resistome correspond to the ability of bacteria to survive under certain environmental and clinical conditions. We found that formula feeding impacts the resistome. Random forest models corroborated by statistical tests revealed that the gut resistome of formula-fed infants is enriched in class D beta-lactamase genes. Interestingly, Clostridium difficile strains harboring this gene are at higher abundance in formula-fed infants than C. difficile strains lacking this gene. Organisms with genes for major facilitator superfamily drug efflux pumps have higher replication rates under all conditions, even in the absence of antibiotic therapy. Using a machine learning approach, we identified genes that are predictive of an organism's direction of change in relative abundance after administration of vancomycin and cephalosporin antibiotics. The most accurate results were obtained by reducing annotated genomic data to five principal components classified by boosted decision trees. Among the genes involved in predicting whether an organism increased in relative abundance after treatment are those that encode subclass B2 beta-lactamases and transcriptional regulators of vancomycin resistance. This demonstrates that machine learning applied to genome-resolved metagenomics data can identify key genes for survival after antibiotics treatment and predict how organisms in the gut microbiome will respond to antibiotic administration. IMPORTANCE The process of reconstructing genomes from environmental sequence data (genome-resolved metagenomics) allows unique insight into microbial systems. We apply this technique to investigate how the antibiotic resistance genes of bacteria affect their ability to flourish in the gut under various conditions. Our analysis reveals that strain-level selection in formula-fed infants drives enrichment of beta-lactamase genes in the gut resistome. Using genomes from metagenomes, we built a machine learning model to predict how organisms in the gut microbial community respond to perturbation by antibiotics. This may eventually have clinical applications. | 2018 | 29359195 |
| 5102 | 11 | 0.9992 | Pipeline for Antimicrobial Resistance Gene Quantification from Host Tissue. Antibiotics are frequently used in food production animals to control disease and improve productivity, but this promotes the development of antimicrobial resistance (AMR) and subsequent broader spread of AMR bacteria throughout food chain, endangering the well-being and health of both animals and humans. In humans, the gut microbiome harbors a diverse range of AMR bacteria, known as the resistome. To effectively mitigate AMR in food animals requires first determining the expression and abundance of AMR-related genes in the gut resistome. Currently, such knowledge in regard to food animals is largely lacking. Gut tissue RNA sequencing (GTRS) can capture metabolically active transcripts from both the host and the microbes attached to the gut epithelium. Ideally, AMR genes can be quantified using GTRS data, making it possible to study the relationship between host and microbe. For the majority of these GTRS studies, only host transcriptome changes have been reported, while the microbial AMR remains largely unexamined, mainly due to the lack of easily implementable bioinformatics tools. Here we present a straightforward workflow to accomplish that using common command-line bioinformatics tools. With this pipeline, the host is considered noise, and host data are filtered out from the microbial reads. Transcript quantification of the AMR genes is then performed. The pipeline then continues through AMR transcript quantification, differential gene expression, and SNP analysis. Using open-source tools, we made this analytical pipeline easy to implement and able to generate results ready to be incorporated into publishable reports. Published 2025. This article is a U.S. Government work and is in the public domain in the USA. Basic Protocol: Running the gene quantification pipeline Support Protocol 1: Downloading FASTQ files from the NCBI database Support Protocol 2: Building a genome reference index of the host Support Protocol 3: Differential gene expression analysis Support Protocol 4: Single-nucleotide polymorphism (SNP) analysis. | 2025 | 40145236 |
| 8711 | 12 | 0.9992 | Novel soil bacteria possess diverse genes for secondary metabolite biosynthesis. In soil ecosystems, microorganisms produce diverse secondary metabolites such as antibiotics, antifungals and siderophores that mediate communication, competition and interactions with other organisms and the environment(1,2). Most known antibiotics are derived from a few culturable microbial taxa (3) , and the biosynthetic potential of the vast majority of bacteria in soil has rarely been investigated (4) . Here we reconstruct hundreds of near-complete genomes from grassland soil metagenomes and identify microorganisms from previously understudied phyla that encode diverse polyketide and nonribosomal peptide biosynthetic gene clusters that are divergent from well-studied clusters. These biosynthetic loci are encoded by newly identified members of the Acidobacteria, Verrucomicobia and Gemmatimonadetes, and the candidate phylum Rokubacteria. Bacteria from these groups are highly abundant in soils(5-7), but have not previously been genomically linked to secondary metabolite production with confidence. In particular, large numbers of biosynthetic genes were characterized in newly identified members of the Acidobacteria, which is the most abundant bacterial phylum across soil biomes (5) . We identify two acidobacterial genomes from divergent lineages, each of which encodes an unusually large repertoire of biosynthetic genes with up to fifteen large polyketide and nonribosomal peptide biosynthetic loci per genome. To track gene expression of genes encoding polyketide synthases and nonribosomal peptide synthetases in the soil ecosystem that we studied, we sampled 120 time points in a microcosm manipulation experiment and, using metatranscriptomics, found that gene clusters were differentially co-expressed in response to environmental perturbations. Transcriptional co-expression networks for specific organisms associated biosynthetic genes with two-component systems, transcriptional activation, putative antimicrobial resistance and iron regulation, linking metabolite biosynthesis to processes of environmental sensing and ecological competition. We conclude that the biosynthetic potential of abundant and phylogenetically diverse soil microorganisms has previously been underestimated. These organisms may represent a source of natural products that can address needs for new antibiotics and other pharmaceutical compounds. | 2018 | 29899444 |
| 9668 | 13 | 0.9992 | Genomic and functional techniques to mine the microbiome for novel antimicrobials and antimicrobial resistance genes. Microbial communities contain diverse bacteria that play important roles in every environment. Advances in sequencing and computational methodologies over the past decades have illuminated the phylogenetic and functional diversity of microbial communities from diverse habitats. Among the activities encoded in microbiomes are the abilities to synthesize and resist small molecules, yielding antimicrobial activity. These functions are of particular interest when viewed in light of the public health emergency posed by the increase in clinical antimicrobial resistance and the dwindling antimicrobial discovery and approval pipeline, and given the intimate ecological and evolutionary relationship between antimicrobial biosynthesis and resistance. Here, we review genomic and functional methods that have been developed for accessing the antimicrobial biosynthesis and resistance capacity of microbiomes and highlight outstanding examples of their applications. | 2017 | 27768825 |
| 8377 | 14 | 0.9992 | Genome-Wide Association Analyses in the Model Rhizobium Ensifer meliloti. Genome-wide association studies (GWAS) can identify genetic variants responsible for naturally occurring and quantitative phenotypic variation. Association studies therefore provide a powerful complement to approaches that rely on de novo mutations for characterizing gene function. Although bacteria should be amenable to GWAS, few GWAS have been conducted on bacteria, and the extent to which nonindependence among genomic variants (e.g., linkage disequilibrium [LD]) and the genetic architecture of phenotypic traits will affect GWAS performance is unclear. We apply association analyses to identify candidate genes underlying variation in 20 biochemical, growth, and symbiotic phenotypes among 153 strains of Ensifer meliloti For 11 traits, we find genotype-phenotype associations that are stronger than expected by chance, with the candidates in relatively small linkage groups, indicating that LD does not preclude resolving association candidates to relatively small genomic regions. The significant candidates show an enrichment for nucleotide polymorphisms (SNPs) over gene presence-absence variation (PAV), and for five traits, candidates are enriched in large linkage groups, a possible signature of epistasis. Many of the variants most strongly associated with symbiosis phenotypes were in genes previously identified as being involved in nitrogen fixation or nodulation. For other traits, apparently strong associations were not stronger than the range of associations detected in permuted data. In sum, our data show that GWAS in bacteria may be a powerful tool for characterizing genetic architecture and identifying genes responsible for phenotypic variation. However, careful evaluation of candidates is necessary to avoid false signals of association.IMPORTANCE Genome-wide association analyses are a powerful approach for identifying gene function. These analyses are becoming commonplace in studies of humans, domesticated animals, and crop plants but have rarely been conducted in bacteria. We applied association analyses to 20 traits measured in Ensifer meliloti, an agriculturally and ecologically important bacterium because it fixes nitrogen when in symbiosis with leguminous plants. We identified candidate alleles and gene presence-absence variants underlying variation in symbiosis traits, antibiotic resistance, and use of various carbon sources; some of these candidates are in genes previously known to affect these traits whereas others were in genes that have not been well characterized. Our results point to the potential power of association analyses in bacteria, but also to the need to carefully evaluate the potential for false associations. | 2018 | 30355664 |
| 7699 | 15 | 0.9992 | Effects of different assembly strategies on gene annotation in activated sludge. Activated sludge comprises diverse bacteria, fungi, and other microorganisms, featuring a rich repertoire of genes involved in antibiotic resistance, pollutant degradation, and elemental cycling. In this regard, hybrid assembly technology can revolutionize metagenomics by detecting greater gene diversity in environmental samples. Nonetheless, the optimal utilization and comparability of genomic information between hybrid assembly and short- or long-read technology remain unclear. To address this gap, we compared the performance of the hybrid assembly, short- and long-read technologies, abundance and diversity of annotated genes, and taxonomic diversity by analysing 46, 161, and 45 activated sludge metagenomic datasets, respectively. The results revealed that hybrid assembly technology exhibited the best performance, generating the most contiguous and longest contigs but with a lower proportion of high-quality metagenome-assembled genomes than short-read technology. Compared with short- or long-read technologies, hybrid assembly technology can detect a greater diversity of microbiota and antibiotic resistance genes, as well as a wider range of potential hosts. However, this approach may yield lower gene abundance and pathogen detection. Our study revealed the specific advantages and disadvantages of hybrid assembly and short- and long-read applications in wastewater treatment plants, and our approach could serve as a blueprint to be extended to terrestrial environments. | 2024 | 38734289 |
| 9666 | 16 | 0.9991 | The comprehensive antibiotic resistance database. The field of antibiotic drug discovery and the monitoring of new antibiotic resistance elements have yet to fully exploit the power of the genome revolution. Despite the fact that the first genomes sequenced of free living organisms were those of bacteria, there have been few specialized bioinformatic tools developed to mine the growing amount of genomic data associated with pathogens. In particular, there are few tools to study the genetics and genomics of antibiotic resistance and how it impacts bacterial populations, ecology, and the clinic. We have initiated development of such tools in the form of the Comprehensive Antibiotic Research Database (CARD; http://arpcard.mcmaster.ca). The CARD integrates disparate molecular and sequence data, provides a unique organizing principle in the form of the Antibiotic Resistance Ontology (ARO), and can quickly identify putative antibiotic resistance genes in new unannotated genome sequences. This unique platform provides an informatic tool that bridges antibiotic resistance concerns in health care, agriculture, and the environment. | 2013 | 23650175 |
| 9667 | 17 | 0.9991 | Novel resistance functions uncovered using functional metagenomic investigations of resistance reservoirs. Rates of infection with antibiotic-resistant bacteria have increased precipitously over the past several decades, with far-reaching healthcare and societal costs. Recent evidence has established a link between antibiotic resistance genes in human pathogens and those found in non-pathogenic, commensal, and environmental organisms, prompting deeper investigation of natural and human-associated reservoirs of antibiotic resistance. Functional metagenomic selections, in which shotgun-cloned DNA fragments are selected for their ability to confer survival to an indicator host, have been increasingly applied to the characterization of many antibiotic resistance reservoirs. These experiments have demonstrated that antibiotic resistance genes are highly diverse and widely distributed, many times bearing little to no similarity to known sequences. Through unbiased selections for survival to antibiotic exposure, functional metagenomics can improve annotations by reducing the discovery of false-positive resistance and by allowing for the identification of previously unrecognizable resistance genes. In this review, we summarize the novel resistance functions uncovered using functional metagenomic investigations of natural and human-impacted resistance reservoirs. Examples of novel antibiotic resistance genes include those highly divergent from known sequences, those for which sequence is entirely unable to predict resistance function, bifunctional resistance genes, and those with unconventional, atypical resistance mechanisms. Overcoming antibiotic resistance in the clinic will require a better understanding of existing resistance reservoirs and the dissemination networks that govern horizontal gene exchange, informing best practices to limit the spread of resistance-conferring genes to human pathogens. | 2013 | 23760651 |
| 4347 | 18 | 0.9991 | Going through phages: a computational approach to revealing the role of prophage in Staphylococcus aureus. Prophages have important roles in virulence, antibiotic resistance, and genome evolution in Staphylococcus aureus . Rapid growth in the number of sequenced S. aureus genomes allows for an investigation of prophage sequences at an unprecedented scale. We developed a novel computational pipeline for phage discovery and annotation. We combined PhiSpy, a phage discovery tool, with VGAS and PROKKA, genome annotation tools to detect and analyse prophage sequences in nearly 10 011 S . aureus genomes, discovering thousands of putative prophage sequences with genes encoding virulence factors and antibiotic resistance. To our knowledge, this is the first large-scale application of PhiSpy on a large-scale set of genomes (10 011 S . aureus ). Determining the presence of virulence and resistance encoding genes in prophage has implications for the potential transfer of these genes/functions to other bacteria via transduction and thus can provide insight into the evolution and spread of these genes/functions between bacterial strains. While the phage we have identified may be known, these phages were not necessarily known or characterized in S. aureus and the clustering and comparison we did for phage based on their gene content is novel. Moreover, the reporting of these genes with the S. aureus genomes is novel. | 2023 | 37424556 |
| 9349 | 19 | 0.9991 | Gene essentiality analysis based on DEG, a database of essential genes. Essential genes are the genes that are indispensable for the survival of an organism. The genome-scale identification of essential genes has been performed in various organisms, and we consequently constructed DEG, a Database that contains currently available essential genes. Here we analyzed functional distributions of essential genes in DEG, and found that some essential-gene functions are even conserved between the prokaryote (bacteria) and the eukaryote (yeast), e.g., genes involved in information storage and processing are overrepresented, whereas those involved in metabolism are underrepresented in essential genes compared with non-essential ones. In bacteria, species specificity in functional distribution of essential genes is mainly due to those involved in cellular processes. Furthermore, within the category of information storage and processing, function of translation, ribosomal structure, and biogenesis are predominant in essential genes. Finally, some potential pitfalls for analyzing gene essentiality based on DEG are discussed. | 2008 | 18392983 |