# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 5101 | 0 | 1.0000 | Identification of Key Features Pivotal to the Characteristics and Functions of Gut Bacteria Taxa through Machine Learning Methods. BACKGROUND: Gut bacteria critically influence digestion, facilitate the breakdown of complex food substances, aid in essential nutrient synthesis, and contribute to immune system balance. However, current knowledge regarding intestinal bacteria remains insufficient. OBJECTIVE: This study aims to discover essential differences for different intestinal bacteria. METHODS: This study was conducted by investigating a total of 1478 gut bacterial samples comprising 235 Actinobacteria, 447 Bacteroidetes, and 796 Firmicutes, by utilizing sophisticated machine learning algorithms. By building on the dataset provided by Chen et al., we engaged sophisticated machine learning techniques to further investigate and analyze the gut bacterial samples. Each sample in the dataset was described by 993 unique features associated with gut bacteria, including 342 features annotated by the Antibiotic Resistance Genes Database, Comprehensive Antibiotic Research Database, Kyoto Encyclopedia of Genes and Genomes, and Virulence Factors of Pathogenic Bacteria. We employed incremental feature selection methods within a computational framework to identify the optimal features for classification. RESULTS: Eleven feature ranking algorithms selected several key features as pivotal to the characteristics and functions of gut bacteria. These features appear to facilitate the identification of specific gut bacterial species. Additionally, we established quantitative rules for identifying Actinobacteria, Bacteroidetes, and Firmicutes. CONCLUSION: This research underscores the significant potential of machine learning in studying gut microbes and enhances our understanding of the multifaceted roles of gut bacteria. | 2025 | 40671232 |
| 9657 | 1 | 0.9998 | Machine Learning Leveraging Genomes from Metagenomes Identifies Influential Antibiotic Resistance Genes in the Infant Gut Microbiome. Antibiotic resistance in pathogens is extensively studied, and yet little is known about how antibiotic resistance genes of typical gut bacteria influence microbiome dynamics. Here, we leveraged genomes from metagenomes to investigate how genes of the premature infant gut resistome correspond to the ability of bacteria to survive under certain environmental and clinical conditions. We found that formula feeding impacts the resistome. Random forest models corroborated by statistical tests revealed that the gut resistome of formula-fed infants is enriched in class D beta-lactamase genes. Interestingly, Clostridium difficile strains harboring this gene are at higher abundance in formula-fed infants than C. difficile strains lacking this gene. Organisms with genes for major facilitator superfamily drug efflux pumps have higher replication rates under all conditions, even in the absence of antibiotic therapy. Using a machine learning approach, we identified genes that are predictive of an organism's direction of change in relative abundance after administration of vancomycin and cephalosporin antibiotics. The most accurate results were obtained by reducing annotated genomic data to five principal components classified by boosted decision trees. Among the genes involved in predicting whether an organism increased in relative abundance after treatment are those that encode subclass B2 beta-lactamases and transcriptional regulators of vancomycin resistance. This demonstrates that machine learning applied to genome-resolved metagenomics data can identify key genes for survival after antibiotics treatment and predict how organisms in the gut microbiome will respond to antibiotic administration. IMPORTANCE The process of reconstructing genomes from environmental sequence data (genome-resolved metagenomics) allows unique insight into microbial systems. We apply this technique to investigate how the antibiotic resistance genes of bacteria affect their ability to flourish in the gut under various conditions. Our analysis reveals that strain-level selection in formula-fed infants drives enrichment of beta-lactamase genes in the gut resistome. Using genomes from metagenomes, we built a machine learning model to predict how organisms in the gut microbial community respond to perturbation by antibiotics. This may eventually have clinical applications. | 2018 | 29359195 |
| 8409 | 2 | 0.9997 | Comparative genomics reveals key adaptive mechanisms in pathogen host-niche specialization. INTRODUCTION: Understanding the key factors that enable bacterial pathogens to adapt to new hosts is crucial, as host-microbe interactions not only influence host health but also drive bacterial genome diversification, thereby enhancing pathogen survival in various ecological niches. METHODS: We conducted a comparative genomic analysis of 4,366 high-quality bacterial genomes isolated from various hosts and environments. Bioinformatics databases and machine learning approaches were used to identify genomic differences in functional categories, virulence factors, and antibiotic resistance genes across different ecological niches. RESULTS: Significant variability in bacterial adaptive strategies was observed. Human-associated bacteria, particularly from the phylum Pseudomonadota, exhibited higher detection rates of carbohydrate-active enzyme genes and virulence factors related to immune modulation and adhesion, indicating co-evolution with the human host. In contrast, bacteria from environmental sources, particularly those from the phyla Bacillota and Actinomycetota, showed greater enrichment in genes related to metabolism and transcriptional regulation, highlighting their high adaptability to diverse environments. Bacteria from clinical settings had higher detection rates of antibiotic resistance genes, particularly those related to fluoroquinolone resistance. Animal hosts were identified as important reservoirs of resistance genes. Key host-specific bacterial genes, such as hypB, were found to potentially play crucial roles in regulating metabolism and immune adaptation in human-associated bacteria. DISCUSSION: These findings highlight niche-specific genomic features and adaptive mechanisms of bacterial pathogens. This study provides valuable insights into the genetic basis of host-pathogen interactions and offers evidence to inform pathogen transmission control, infection management, and antibiotic stewardship. | 2025 | 40547794 |
| 5102 | 3 | 0.9997 | Pipeline for Antimicrobial Resistance Gene Quantification from Host Tissue. Antibiotics are frequently used in food production animals to control disease and improve productivity, but this promotes the development of antimicrobial resistance (AMR) and subsequent broader spread of AMR bacteria throughout food chain, endangering the well-being and health of both animals and humans. In humans, the gut microbiome harbors a diverse range of AMR bacteria, known as the resistome. To effectively mitigate AMR in food animals requires first determining the expression and abundance of AMR-related genes in the gut resistome. Currently, such knowledge in regard to food animals is largely lacking. Gut tissue RNA sequencing (GTRS) can capture metabolically active transcripts from both the host and the microbes attached to the gut epithelium. Ideally, AMR genes can be quantified using GTRS data, making it possible to study the relationship between host and microbe. For the majority of these GTRS studies, only host transcriptome changes have been reported, while the microbial AMR remains largely unexamined, mainly due to the lack of easily implementable bioinformatics tools. Here we present a straightforward workflow to accomplish that using common command-line bioinformatics tools. With this pipeline, the host is considered noise, and host data are filtered out from the microbial reads. Transcript quantification of the AMR genes is then performed. The pipeline then continues through AMR transcript quantification, differential gene expression, and SNP analysis. Using open-source tools, we made this analytical pipeline easy to implement and able to generate results ready to be incorporated into publishable reports. Published 2025. This article is a U.S. Government work and is in the public domain in the USA. Basic Protocol: Running the gene quantification pipeline Support Protocol 1: Downloading FASTQ files from the NCBI database Support Protocol 2: Building a genome reference index of the host Support Protocol 3: Differential gene expression analysis Support Protocol 4: Single-nucleotide polymorphism (SNP) analysis. | 2025 | 40145236 |
| 4642 | 4 | 0.9997 | Characterization of antibiotic resistance and host-microbiome interactions in the human upper respiratory tract during influenza infection. BACKGROUND: The abundance and diversity of antibiotic resistance genes (ARGs) in the human respiratory microbiome remain poorly characterized. In the context of influenza virus infection, interactions between the virus, the host, and resident bacteria with pathogenic potential are known to complicate and worsen disease, resulting in coinfection and increased morbidity and mortality of infected individuals. When pathogenic bacteria acquire antibiotic resistance, they are more difficult to treat and of global health concern. Characterization of ARG expression in the upper respiratory tract could help better understand the role antibiotic resistance plays in the pathogenesis of influenza-associated bacterial secondary infection. RESULTS: Thirty-seven individuals participating in the Household Influenza Transmission Study (HITS) in Managua, Nicaragua, were selected for this study. We performed metatranscriptomics and 16S rRNA gene sequencing analyses on nasal and throat swab samples, and host transcriptome profiling on blood samples. Individuals clustered into two groups based on their microbial gene expression profiles, with several microbial pathways enriched with genes differentially expressed between groups. We also analyzed antibiotic resistance gene expression and determined that approximately 25% of the sequence reads that corresponded to antibiotic resistance genes mapped to Streptococcus pneumoniae and Staphylococcus aureus. Following construction of an integrated network of ARG expression with host gene co-expression, we identified several host key regulators involved in the host response to influenza virus and bacterial infections, and host gene pathways associated with specific antibiotic resistance genes. CONCLUSIONS: This study indicates the host response to influenza infection could indirectly affect antibiotic resistance gene expression in the respiratory tract by impacting the microbial community structure and overall microbial gene expression. Interactions between the host systemic responses to influenza infection and antibiotic resistance gene expression highlight the importance of viral-bacterial co-infection in acute respiratory infections like influenza. Video abstract. | 2020 | 32178738 |
| 7707 | 5 | 0.9997 | Exploring the dynamics of gut microbiota, antibiotic resistance, and chemotherapy impact in acute leukemia patients: A comprehensive metagenomic analysis. Leukemia poses significant challenges to its treatment, and understanding its complex pathogenesis is crucial. This study used metagenomic sequencing to investigate the interplay between chemotherapy, gut microbiota, and antibiotic resistance in patients with acute leukemia (AL). Pre- and post-chemotherapy stool samples from patients revealed alterations in microbial richness, taxa, and antibiotic resistance genes (ARGs). The analysis revealed a decreased alpha diversity, increased dispersion in post-chemotherapy samples, and changes in the abundance of specific bacteria. Key bacteria such as Enterococcus, Klebsiella, and Escherichia coli have been identified as prevalent ARG carriers. Correlation analysis between gut microbiota and blood indicators revealed potential links between microbial species and inflammatory biomarkers, including C-reactive protein (CRP) and adenosine deaminase (ADA). This study investigated the impact of antibiotic dosage on microbiota and ARGs, revealing networks connecting co-occurring ARGs with microbial species (179 nodes, 206 edges), and networks associated with ARGs and antibiotic dosages (50 nodes, 50 edges). Antibiotics such as cephamycin and sulfonamide led to multidrug-resistant Klebsiella colonization. Our analyses revealed distinct microbial profiles with Salmonella enterica elevated post-chemotherapy in NF patients and Akkermansia muciniphila elevated pre-chemotherapy. These microbial signatures could inform strategies to modulate the gut microbiome, potentially mitigating the risk of neutropenic fever in patients undergoing chemotherapy. Finally, a comprehensive analysis of KEGG modules shed light on disrupted metabolic pathways after chemotherapy, providing insights into potential targets for managing side effects. Overall, this study revealed intricate relationships between gut microbiota, chemotherapy, and antibiotic resistance, providing new insights into improving therapy and enhancing patient outcomes. | 2024 | 39620486 |
| 4296 | 6 | 0.9997 | Twenty-first century molecular methods for analyzing antimicrobial resistance in surface waters to support One Health assessments. Antimicrobial resistance (AMR) in the environment is a growing global health concern, especially the dissemination of AMR into surface waters due to human and agricultural inputs. Within recent years, research has focused on trying to understand the impact of AMR in surface waters on human, agricultural and ecological health (One Health). While surface water quality assessments and surveillance of AMR have historically utilized culture-based methods, culturing bacteria has limitations due to difficulty in isolating environmental bacteria and the need for a priori information about the bacteria for selective isolation. The use of molecular techniques to analyze AMR at the genetic level has helped to overcome the difficulties with culture-based techniques since they do not require advance knowledge of the bacterial population and can analyze uncultivable environmental bacteria. The aim of this review is to provide an overview of common contemporary molecular methods available for analyzing AMR in surface waters, which include high throughput real-time polymerase chain reaction (HT-qPCR), metagenomics, and whole genome sequencing. This review will also feature how these methods may provide information on human and animal health risks. HT-qPCR works at the nanoliter scale, requires only a small amount of DNA, and can analyze numerous gene targets simultaneously, but may lack in analytical sensitivity and the ability to optimize individual assays compared to conventional qPCR. Metagenomics offers more detailed genomic information and taxonomic resolution than PCR by sequencing all the microbial genomes within a sample. Its open format allows for the discovery of new antibiotic resistance genes; however, the quantity of DNA necessary for this technique can be a limiting factor for surface water samples that typically have low numbers of bacteria per sample volume. Whole genome sequencing provides the complete genomic profile of a single environmental isolate and can identify all genetic elements that may confer AMR. However, a main disadvantage of this technique is that it only provides information about one bacterial isolate and is challenging to utilize for community analysis. While these contemporary techniques can quickly provide a vast array of information about AMR in surface waters, one technique does not fully characterize AMR nor its potential risks to human, animal, or ecological health. Rather, a combination of techniques (including both molecular- and culture-based) are necessary to fully understand AMR in surface waters from a One Health perspective. | 2021 | 33774111 |
| 5114 | 7 | 0.9997 | Datasets for benchmarking antimicrobial resistance genes in bacterial metagenomic and whole genome sequencing. Whole genome sequencing (WGS) is a key tool in identifying and characterising disease-associated bacteria across clinical, agricultural, and environmental contexts. One increasingly common use of genomic and metagenomic sequencing is in identifying the type and range of antimicrobial resistance (AMR) genes present in bacterial isolates in order to make predictions regarding their AMR phenotype. However, there are a large number of alternative bioinformatics software and pipelines available, which can lead to dissimilar results. It is, therefore, vital that researchers carefully evaluate their genomic and metagenomic AMR analysis methods using a common dataset. To this end, as part of the Microbial Bioinformatics Hackathon and Workshop 2021, a 'gold standard' reference genomic and simulated metagenomic dataset was generated containing raw sequence reads mapped against their corresponding reference genome from a range of 174 potentially pathogenic bacteria. These datasets and their accompanying metadata are freely available for use in benchmarking studies of bacteria and their antimicrobial resistance genes and will help improve tool development for the identification of AMR genes in complex samples. | 2022 | 35705638 |
| 5099 | 8 | 0.9997 | A machine learning-based strategy to elucidate the identification of antibiotic resistance in bacteria. Microorganisms, crucial for environmental equilibrium, could be destructive, resulting in detrimental pathophysiology to the human host. Moreover, with the emergence of antibiotic resistance (ABR), the microbial communities pose the century's largest public health challenges in terms of effective treatment strategies. Furthermore, given the large diversity and number of known bacterial strains, describing treatment choices for infected patients using experimental methodologies is time-consuming. An alternative technique, gaining popularity as sequencing prices fall and technology advances, is to use bacterial genotype rather than phenotype to determine ABR. Complementing machine learning into clinical practice provides a data-driven platform for categorization and interpretation of bacterial datasets. In the present study, k-mers were generated from nucleotide sequences of pathogenic bacteria resistant to antibiotics. Subsequently, they were clustered into groups of bacteria sharing similar genomic features using the Affinity propagation algorithm with a Silhouette coefficient of 0.82. Thereafter, a prediction model based on Random Forest algorithm was developed to explore the prediction capability of the k-mers. It yielded an overall specificity of 0.99 and a sensitivity of 0.98. Additionally, the genes and ABR drivers related to the k-mers were identified to explore their biological relevance. Furthermore, a multilayer perceptron model with a hamming loss of 0.05 was built to classify the bacterial strains into resistant and non-resistant strains against various antibiotics. Segregating pathogenic bacteria based on genomic similarities could be a valuable approach for assessing the severity of diseases caused by new bacterial strains. Utilization of this strategy could aid in enhancing our understanding of ABR patterns, paving the way for more informed and effective treatment options. | 2024 | 39816256 |
| 3874 | 9 | 0.9997 | Culture-enriched human gut microbiomes reveal core and accessory resistance genes. BACKGROUND: Low-abundance microorganisms of the gut microbiome are often referred to as a reservoir for antibiotic resistance genes. Unfortunately, these less-abundant bacteria can be overlooked by deep shotgun sequencing. In addition, it is a challenge to associate the presence of resistance genes with their risk of acquisition by pathogens. In this study, we used liquid culture enrichment of stools to assemble the genome of lower-abundance bacteria from fecal samples. We then investigated the gene content recovered from these culture-enriched and culture-independent metagenomes in relation with their taxonomic origin, specifically antibiotic resistance genes. We finally used a pangenome approach to associate resistance genes with the core or accessory genome of Enterobacteriaceae and inferred their propensity to horizontal gene transfer. RESULTS: Using culture-enrichment approaches with stools allowed assembly of 187 bacterial species with an assembly size greater than 1 million nucleotides. Of these, 67 were found only in culture-enriched conditions, and 22 only in culture-independent microbiomes. These assembled metagenomes allowed the evaluation of the gene content of specific subcommunities of the gut microbiome. We observed that differentially distributed metabolic enzymes were associated with specific culture conditions and, for the most part, with specific taxa. Gene content differences between microbiomes, for example, antibiotic resistance, were for the most part not associated with metabolic enzymes, but with other functions. We used a pangenome approach to determine if the resistance genes found in Enterobacteriaceae, specifically E. cloacae or E. coli, were part of the core genome or of the accessory genome of this species. In our healthy volunteer cohort, we found that E. cloacae contigs harbored resistance genes that were part of the core genome of the species, while E. coli had a large accessory resistome proximal to mobile elements. CONCLUSION: Liquid culture of stools contributed to an improved functional and comparative genomics study of less-abundant gut bacteria, specifically those associated with antibiotic resistance. Defining whether a gene is part of the core genome of a species helped in interpreting the genomes recovered from culture-independent or culture-enriched microbiomes. | 2019 | 30953542 |
| 9653 | 10 | 0.9997 | Evaluating the mobility potential of antibiotic resistance genes in environmental resistomes without metagenomics. Antibiotic resistance genes are ubiquitous in the environment. However, only a fraction of them are mobile and able to spread to pathogenic bacteria. Until now, studying the mobility of antibiotic resistance genes in environmental resistomes has been challenging due to inadequate sensitivity and difficulties in contig assembly of metagenome based methods. We developed a new cost and labor efficient method based on Inverse PCR and long read sequencing for studying mobility potential of environmental resistance genes. We applied Inverse PCR on sediment samples and identified 79 different MGE clusters associated with the studied resistance genes, including novel mobile genetic elements, co-selected resistance genes and a new putative antibiotic resistance gene. The results show that the method can be used in antibiotic resistance early warning systems. In comparison to metagenomics, Inverse PCR was markedly more sensitive and provided more data on resistance gene mobility and co-selected resistances. | 2016 | 27767072 |
| 6597 | 11 | 0.9997 | Exploiting a targeted resistome sequencing approach in assessing antimicrobial resistance in retail foods. BACKGROUND: With the escalating risk of antimicrobial resistance (AMR), there are limited analytical options available that can comprehensively assess the burden of AMR carried by clinical/environmental samples. Food can be a potential source of AMR bacteria for humans, but its significance in driving the clinical spread of AMR remains unclear, largely due to the lack of holistic-yet-sensitive tools for surveillance and evaluation. Metagenomics is a culture-independent approach well suited for uncovering genetic determinants of defined microbial traits, such as AMR, present within unknown bacterial communities. Despite its popularity, the conventional approach of non-selectively sequencing a sample's metagenome (namely, shotgun-metagenomics) has several technical drawbacks that lead to uncertainty about its effectiveness for AMR assessment; for instance, the low discovery rate of resistance-associated genes due to their naturally small genomic footprint within the vast metagenome. Here, we describe the development of a targeted resistome sequencing method and demonstrate its application in the characterization of the AMR gene profile of bacteria associated with several retail foods. RESULT: A targeted-metagenomic sequencing workflow using a customized bait-capture system targeting over 4,000 referenced AMR genes and 263 plasmid replicon sequences was validated against both mock and sample-derived bacterial community preparations. Compared to shotgun-metagenomics, the targeted method consistently provided for improved recovery of resistance gene targets with a much-improved target detection efficiency (> 300-fold). Targeted resistome analyses conducted on 36 retail-acquired food samples (fresh sprouts, n = 10; ground meat, n = 26) and their corresponding bacterial enrichment cultures (n = 36) reveals in-depth features regarding the identity and diversity of AMR genes, most of which were otherwise undetected by the whole-metagenome shotgun sequencing method. Furthermore, our findings suggest that foodborne Gammaproteobacteria could be the major reservoir of food-associated AMR genetic determinants, and that the resistome structure of the selected high-risk food commodities are, to a large extent, dictated by microbiome composition. CONCLUSIONS: For metagenomic sequencing-based surveillance of AMR, the target-capture method presented herein represents a more sensitive and efficient approach to evaluate the resistome profile of complex food or environmental samples. This study also further implicates retail foods as carriers of diverse resistance-conferring genes indicating a potential impact on the dissemination of AMR. | 2023 | 36991496 |
| 9654 | 12 | 0.9997 | Studying the Association between Antibiotic Resistance Genes and Insertion Sequences in Metagenomes: Challenges and Pitfalls. Antibiotic resistance is an issue in many areas of human activity. The mobilization of antibiotic resistance genes within the bacterial community makes it difficult to study and control the phenomenon. It is known that certain insertion sequences, which are mobile genetic elements, can participate in the mobilization of antibiotic resistance genes and in the expression of these genes. However, the magnitude of the contribution of insertion sequences to the mobility of antibiotic resistance genes remains understudied. In this study, the relationships between insertion sequences and antibiotic resistance genes present in the microbiome were investigated using two public datasets. The first made it possible to analyze the effects of different antibiotics in a controlled mouse model. The second dataset came from a study of the differences between conventional and organic-raised cattle. Although it was possible to find statistically significant correlations between the insertion sequences and antibiotic resistance genes in both datasets, several challenges remain to better understand the contribution of insertion sequences to the motility of antibiotic resistance genes. Obtaining more complete and less fragmented metagenomes with long-read sequencing technologies could make it possible to understand the mechanisms favoring horizontal transfers within the microbiome with greater precision. | 2023 | 36671375 |
| 6595 | 13 | 0.9997 | Methodological aspects of investigating the resistome in pig farm environments. A typical One Health issue, antimicrobial resistance (AMR) development and its spread among people, animals, and the environment attracts significant research attention. The animal sector is one of the major contributors to the development and dissemination of AMR and accounts for more than 50 % of global antibiotics usage. The use of antibiotics exerts a selective pressure for resistant bacteria in the exposed microbiome, but many questions about the epidemiology of AMR in farm environments remain unanswered. This is connected to several methodological challenges and limitations, such as inconsistent sampling methods, complexity of farm environment samples and the lack of standardized protocols for sample collection, processing and bioinformatical analysis. In this project, we combined metagenomics and bioinformatics to optimise the methodology for reproducible research on the resistome in complex samples from the indoor farm environment. The work included optimizing sample collection, transportation, and storage, as well as DNA extraction, sequencing, and bioinformatic analysis, such as metagenome assembly and antibiotic resistance gene (ARG) detection. Our studies suggest that the current most optimal and cost-effective pipeline for ARG search should be based on Illumina sequencing of sock sample material at high depth (at least 25 M 250 bp PE for AMR gene families and 43 M for gene variants). We present a computational analysis utilizing MEGAHIT assembly to balance the identification of bacteria carrying ARGs with the potential loss of diversity and abundance of resistance genes. Our findings indicate that searching against multiple ARG databases is essential for detecting the highest diversity of ARGs. | 2025 | 39954816 |
| 4052 | 14 | 0.9997 | Functional metagenomics for the investigation of antibiotic resistance. Antibiotic resistance is a major threat to human health and well-being. To effectively combat this problem we need to understand the range of different resistance genes that allow bacteria to resist antibiotics. To do this the whole microbiota needs to be investigated. As most bacteria cannot be cultivated in the laboratory, the reservoir of antibiotic resistance genes in the non-cultivatable majority remains relatively unexplored. Currently the only way to study antibiotic resistance in these organisms is to use metagenomic approaches. Furthermore, the only method that does not require any prior knowledge about the resistance genes is functional metagenomics, which involves expressing genes from metagenomic clones in surrogate hosts. In this review the methods and limitations of functional metagenomics to isolate new antibiotic resistance genes and the mobile genetic elements that mediate their spread are explored. | 2014 | 24556726 |
| 4638 | 15 | 0.9997 | Comprehensive Scanning of Prophages in Lactobacillus: Distribution, Diversity, Antibiotic Resistance Genes, and Linkages with CRISPR-Cas Systems. Prophage integration, release, and dissemination exert various effects on host bacteria. In the genus Lactobacillus, they may cause bacteriophage contamination during fermentation and even regulate bacterial populations in the gut. However, little is known about their distribution, genetic architecture, and relationships with their hosts. Here, we conducted prophage prediction analysis on 1,472 genomes from 16 different Lactobacillus species and found prophage fragments in almost all lactobacilli (99.8%), with 1,459 predicted intact prophages identified in 64.1% of the strains. We present an uneven prophage distribution among Lactobacillus species; multihabitat species retained more prophages in their genomes than restricted-habitat species. Characterization of the genome features, average nucleotide identity, and landscape visualization presented a high genome diversity of Lactobacillus prophages. We detected antibiotic resistance genes in more than 10% of Lactobacillus prophages and validated that the occurrence of resistance genes conferred by prophage integration was possibly associated with phenotypic resistance in Lactobacillus plantarum. Furthermore, our broad and comprehensive examination of the distribution of CRISPR-Cas systems across the genomes predicted type I and type III systems as potential antagonistic elements of Lactobacillus prophage. IMPORTANCE Lactobacilli are inherent microorganisms in the human gut and are widely used in the food processing industries due to their probiotic properties. Prophages were reportedly hidden in numerous Lactobacillus genomes and can potentially contaminate entire batches of fermentation or modulate the intestinal microecology once they are released. Therefore, a comprehensive scanning of prophages in Lactobacillus is essential for the safety evaluation and application development of probiotic candidates. We show that prophages are widely distributed among lactobacilli; however, intact prophages are more common in multihabitat species and display wide variations in genome feature, integration site, and genomic organization. Our data of the prophage-mediated antibiotic resistance genes (ARGs) and the resistance phenotype of lactobacilli provide evidence for deciphering the putative role of prophages as vectors of the ARGs. Furthermore, understanding the association between prophages and CRISPR-Cas systems is crucial to appreciate the coevolution of phages and Lactobacillus. | 2021 | 34060909 |
| 4036 | 16 | 0.9997 | Man-made microbial resistances in built environments. Antimicrobial resistance is a serious threat to global public health, but little is known about the effects of microbial control on the microbiota and its associated resistome. Here we compare the microbiota present on surfaces of clinical settings with other built environments. Using state-of-the-art metagenomics approaches and genome and plasmid reconstruction, we show that increased confinement and cleaning is associated with a loss of microbial diversity and a shift from Gram-positive bacteria, such as Actinobacteria and Firmicutes, to Gram-negative such as Proteobacteria. Moreover, the microbiome of highly maintained built environments has a different resistome when compared to other built environments, as well as a higher diversity in resistance genes. Our results highlight that the loss of microbial diversity correlates with an increase in resistance, and the need for implementing strategies to restore bacterial diversity in certain built environments. | 2019 | 30814504 |
| 5100 | 17 | 0.9997 | DeepPBI-KG: a deep learning method for the prediction of phage-bacteria interactions based on key genes. Phages, the natural predators of bacteria, were discovered more than 100 years ago. However, increasing antimicrobial resistance rates have revitalized phage research. Methods that are more time-consuming and efficient than wet-laboratory experiments are needed to help screen phages quickly for therapeutic use. Traditional computational methods usually ignore the fact that phage-bacteria interactions are achieved by key genes and proteins. Methods for intraspecific prediction are rare since almost all existing methods consider only interactions at the species and genus levels. Moreover, most strains in existing databases contain only partial genome information because whole-genome information for species is difficult to obtain. Here, we propose a new approach for interaction prediction by constructing new features from key genes and proteins via the application of K-means sampling to select high-quality negative samples for prediction. Finally, we develop DeepPBI-KG, a corresponding prediction tool based on feature selection and a deep neural network. The results show that the average area under the curve for prediction reached 0.93 for each strain, and the overall AUC and area under the precision-recall curve reached 0.89 and 0.92, respectively, on the independent test set; these values are greater than those of other existing prediction tools. The forward and reverse validation results indicate that key genes and key proteins regulate and influence the interaction, which supports the reliability of the model. In addition, intraspecific prediction experiments based on Klebsiella pneumoniae data demonstrate the potential applicability of DeepPBI-KG for intraspecific prediction. In summary, the feature engineering and interaction prediction approaches proposed in this study can effectively improve the robustness and stability of interaction prediction, can achieve high generalizability, and may provide new directions and insights for rapid phage screening for therapy. | 2024 | 39344712 |
| 7699 | 18 | 0.9997 | Effects of different assembly strategies on gene annotation in activated sludge. Activated sludge comprises diverse bacteria, fungi, and other microorganisms, featuring a rich repertoire of genes involved in antibiotic resistance, pollutant degradation, and elemental cycling. In this regard, hybrid assembly technology can revolutionize metagenomics by detecting greater gene diversity in environmental samples. Nonetheless, the optimal utilization and comparability of genomic information between hybrid assembly and short- or long-read technology remain unclear. To address this gap, we compared the performance of the hybrid assembly, short- and long-read technologies, abundance and diversity of annotated genes, and taxonomic diversity by analysing 46, 161, and 45 activated sludge metagenomic datasets, respectively. The results revealed that hybrid assembly technology exhibited the best performance, generating the most contiguous and longest contigs but with a lower proportion of high-quality metagenome-assembled genomes than short-read technology. Compared with short- or long-read technologies, hybrid assembly technology can detect a greater diversity of microbiota and antibiotic resistance genes, as well as a wider range of potential hosts. However, this approach may yield lower gene abundance and pathogen detection. Our study revealed the specific advantages and disadvantages of hybrid assembly and short- and long-read applications in wastewater treatment plants, and our approach could serve as a blueprint to be extended to terrestrial environments. | 2024 | 38734289 |
| 4644 | 19 | 0.9997 | Longitudinal metatranscriptomic sequencing of Southern California wastewater representing 16 million people from August 2020-21 reveals widespread transcription of antibiotic resistance genes. Municipal wastewater provides a representative sample of human fecal waste across a catchment area and contains a wide diversity of microbes. Sequencing wastewater samples provides information about human-associated and medically-important microbial populations, and may be useful to assay disease prevalence and antimicrobial resistance (AMR). Here, we present a study in which we used untargeted metatranscriptomic sequencing on RNA extracted from 275 sewage influent samples obtained from eight wastewater treatment plants (WTPs) representing approximately 16 million people in Southern California between August 2020 - August 2021. We characterized bacterial and viral transcripts, assessed metabolic pathway activity, and identified over 2,000 AMR genes/variants across all samples. Because we did not deplete ribosomal RNA, we have a unique window into AMR carried as ribosomal mutants. We show that AMR diversity varied between WTPs and that the relative abundance of many individual AMR genes/variants increased over time and may be connected to antibiotic use during the COVID-19 pandemic. Similarly, we detected transcripts mapping to human pathogenic bacteria and viruses suggesting RNA sequencing is a powerful tool for wastewater-based epidemiology and that there are geographical signatures to microbial transcription. We captured the transcription of gene pathways common to bacterial cell processes, including central carbon metabolism, nucleotide synthesis/salvage, and amino acid biosynthesis. We also posit that due to the ubiquity of many viruses and bacteria in wastewater, new biological targets for microbial water quality assessment can be developed. To the best of our knowledge, our study provides the most complete longitudinal metatranscriptomic analysis of a large population's wastewater to date and demonstrates our ability to monitor the presence and activity of microbes in complex samples. By sequencing RNA, we can track the relative abundance of expressed AMR genes/variants and metabolic pathways, increasing our understanding of AMR activity across large human populations and sewer sheds. | 2022 | 35982656 |