# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 5125 | 0 | 0.9950 | Do we still need Illumina sequencing data? Evaluating Oxford Nanopore Technologies R10.4.1 flow cells and the Rapid v14 library prep kit for Gram negative bacteria whole genome assemblies. The best whole genome assemblies are currently built from a combination of highly accurate short-read sequencing data and long-read sequencing data that can bridge repetitive and problematic regions. Oxford Nanopore Technologies (ONT) produce long-read sequencing platforms and they are continually improving their technology to obtain higher quality read data that is approaching the quality obtained from short-read platforms such as Illumina. As these innovations continue, we evaluated how much ONT read coverage produced by the Rapid Barcoding Kit v14 (SQK-RBK114) is necessary to generate high-quality hybrid and long-read-only genome assemblies for a panel of carbapenemase-producing Enterobacterales bacterial isolates. We found that 30× long-read coverage is sufficient if Illumina data are available, and that more (at least 100× long-read coverage is recommended for long-read-only assemblies. Illumina polishing is still improving single nucleotide variants (SNVs) and INDELs in long-read-only assemblies. We also examined if antimicrobial resistance genes could be accurately identified in long-read-only data, and found that Flye assemblies regardless of ONT coverage detected >96% of resistance genes at 100% identity and length. Overall, the Rapid Barcoding Kit v14 and long-read-only assemblies can be an optimal sequencing strategy (i.e., plasmid characterization and AMR detection) but finer-scale analyses (i.e., SNV) still benefit from short-read data. | 2024 | 38354391 |
| 9082 | 1 | 0.9949 | GeneMates: an R package for detecting horizontal gene co-transfer between bacteria using gene-gene associations controlled for population structure. BACKGROUND: Horizontal gene transfer contributes to bacterial evolution through mobilising genes across various taxonomical boundaries. It is frequently mediated by mobile genetic elements (MGEs), which may capture, maintain, and rearrange mobile genes and co-mobilise them between bacteria, causing horizontal gene co-transfer (HGcoT). This physical linkage between mobile genes poses a great threat to public health as it facilitates dissemination and co-selection of clinically important genes amongst bacteria. Although rapid accumulation of bacterial whole-genome sequencing data since the 2000s enables study of HGcoT at the population level, results based on genetic co-occurrence counts and simple association tests are usually confounded by bacterial population structure when sampled bacteria belong to the same species, leading to spurious conclusions. RESULTS: We have developed a network approach to explore WGS data for evidence of intraspecies HGcoT and have implemented it in R package GeneMates ( github.com/wanyuac/GeneMates ). The package takes as input an allelic presence-absence matrix of interested genes and a matrix of core-genome single-nucleotide polymorphisms, performs association tests with linear mixed models controlled for population structure, produces a network of significantly associated alleles, and identifies clusters within the network as plausible co-transferred alleles. GeneMates users may choose to score consistency of allelic physical distances measured in genome assemblies using a novel approach we have developed and overlay scores to the network for further evidence of HGcoT. Validation studies of GeneMates on known acquired antimicrobial resistance genes in Escherichia coli and Salmonella Typhimurium show advantages of our network approach over simple association analysis: (1) distinguishing between allelic co-occurrence driven by HGcoT and that driven by clonal reproduction, (2) evaluating effects of population structure on allelic co-occurrence, and (3) direct links between allele clusters in the network and MGEs when physical distances are incorporated. CONCLUSION: GeneMates offers an effective approach to detection of intraspecies HGcoT using WGS data. | 2020 | 32972363 |
| 9076 | 2 | 0.9949 | ResiDB: An automated database manager for sequence data. The amount of publicly available DNA sequence data is drastically increasing, making it a tedious task to create sequence databases necessary for the design of diagnostic assays. The selection of appropriate sequences is especially challenging in genes affected by frequent point mutations such as antibiotic resistance genes. To overcome this issue, we have designed the webtool resiDB, a rapid and user-friendly sequence database manager for bacteria, fungi, viruses, protozoa, invertebrates, plants, archaea, environmental and whole genome shotgun sequence data. It automatically identifies and curates sequence clusters to create custom sequence databases based on user-defined input sequences. A collection of helpful visualization tools gives the user the opportunity to easily access, evaluate, edit, and download the newly created database. Consequently, researchers do no longer have to manually manage sequence data retrieval, deal with hardware limitations, and run multiple independent software tools, each having its own requirements, input and output formats. Our tool was developed within the H2020 project FAPIC aiming to develop a single diagnostic assay targeting all sepsis-relevant pathogens and antibiotic resistance mechanisms. ResiDB is freely accessible to all users through https://residb.ait.ac.at/. | 2021 | 33495705 |
| 9074 | 3 | 0.9949 | BacAnt: A Combination Annotation Server for Bacterial DNA Sequences to Identify Antibiotic Resistance Genes, Integrons, and Transposable Elements. Whole genome sequencing (WGS) of bacteria has become a routine method in diagnostic laboratories. One of the clinically most useful advantages of WGS is the ability to predict antimicrobial resistance genes (ARGs) and mobile genetic elements (MGEs) in bacterial sequences. This allows comprehensive investigations of such genetic features but can also be used for epidemiological studies. A plethora of software programs have been developed for the detailed annotation of bacterial DNA sequences, such as rapid annotation using subsystem technology (RAST), Resfinder, ISfinder, INTEGRALL and The Transposon Registry. Unfortunately, to this day, a reliable annotation tool of the combination of ARGs and MGEs is not available, and the generation of genbank files requires much manual input. Here, we present a new webserver which allows the annotation of ARGs, integrons and transposable elements at the same time. The pipeline generates genbank files automatically, which are compatible with Easyfig for comparative genomic analysis. Our BacAnt code and standalone software package are available at https://github.com/xthua/bacant with an accompanying web application at http://bacant.net. | 2021 | 34367079 |
| 9075 | 4 | 0.9948 | CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter. BACKGROUND: The rapid expansion of Whole-Genome Sequencing has revolutionized the fields of clinical and food microbiology. However, its implementation as a routine laboratory technique remains challenging due to the growth of data at a faster rate than can be effectively analyzed and critical gaps in bioinformatics knowledge. RESULTS: To address both issues, CamPype was developed as a new bioinformatics workflow for the genomics analysis of sequencing data of bacteria, especially Campylobacter, which is the main cause of gastroenteritis worldwide making a negative impact on the economy of the public health systems. CamPype allows fully customization of stages to run and tools to use, including read quality control filtering, read contamination, reads extension and assembly, bacterial typing, genome annotation, searching for antibiotic resistance genes, virulence genes and plasmids, pangenome construction and identification of nucleotide variants. All results are processed and resumed in an interactive HTML report for best data visualization and interpretation. CONCLUSIONS: The minimal user intervention of CamPype makes of this workflow an attractive resource for microbiology laboratories with no expertise in bioinformatics as a first line method for bacterial typing and epidemiological analyses, that would help to reduce the costs of disease outbreaks, or for comparative genomic analyses. CamPype is publicly available at https://github.com/JoseBarbero/CamPype . | 2023 | 37474912 |
| 9083 | 5 | 0.9948 | ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences. BACKGROUND: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing. RESULTS: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG. CONCLUSIONS: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract. | 2024 | 38725076 |
| 5115 | 6 | 0.9947 | Search Engine for Antimicrobial Resistance: A Cloud Compatible Pipeline and Web Interface for Rapidly Detecting Antimicrobial Resistance Genes Directly from Sequence Data. BACKGROUND: Antimicrobial resistance remains a growing and significant concern in human and veterinary medicine. Current laboratory methods for the detection and surveillance of antimicrobial resistant bacteria are limited in their effectiveness and scope. With the rapidly developing field of whole genome sequencing beginning to be utilised in clinical practice, the ability to interrogate sequencing data quickly and easily for the presence of antimicrobial resistance genes will become increasingly important and useful for informing clinical decisions. Additionally, use of such tools will provide insight into the dynamics of antimicrobial resistance genes in metagenomic samples such as those used in environmental monitoring. RESULTS: Here we present the Search Engine for Antimicrobial Resistance (SEAR), a pipeline and web interface for detection of horizontally acquired antimicrobial resistance genes in raw sequencing data. The pipeline provides gene information, abundance estimation and the reconstructed sequence of antimicrobial resistance genes; it also provides web links to additional information on each gene. The pipeline utilises clustering and read mapping to annotate full-length genes relative to a user-defined database. It also uses local alignment of annotated genes to a range of online databases to provide additional information. We demonstrate SEAR's application in the detection and abundance estimation of antimicrobial resistance genes in two novel environmental metagenomes, 32 human faecal microbiome datasets and 126 clinical isolates of Shigella sonnei. CONCLUSIONS: We have developed a pipeline that contributes to the improved capacity for antimicrobial resistance detection afforded by next generation sequencing technologies, allowing for rapid detection of antimicrobial resistance genes directly from sequencing data. SEAR uses raw sequencing data via an intuitive interface so can be run rapidly without requiring advanced bioinformatic skills or resources. Finally, we show that SEAR is effective in detecting antimicrobial resistance genes in metagenomic and isolate sequencing data from both environmental metagenomes and sequencing data from clinical isolates. | 2015 | 26197475 |
| 5124 | 7 | 0.9947 | Oxford nanopore long-read sequencing enables the generation of complete bacterial and plasmid genomes without short-read sequencing. INTRODUCTION: Genome-based analysis is crucial in monitoring antibiotic-resistant bacteria (ARB)and antibiotic-resistance genes (ARGs). Short-read sequencing is typically used to obtain incomplete draft genomes, while long-read sequencing can obtain genomes of multidrug resistance (MDR) plasmids and track the transmission of plasmid-borne antimicrobial resistance genes in bacteria. However, long-read sequencing suffers from low-accuracy base calling, and short-read sequencing is often required to improve genome accuracy. This increases costs and turnaround time. METHODS: In this study, a novel ONT sequencing method is described, which uses the latest ONT chemistry with improved accuracy to assemble genomes of MDR strains and plasmids from long-read sequencing data only. Three strains of Salmonella carrying MDR plasmids were sequenced using the ONT SQK-LSK114 kit with flow cell R10.4.1, and de novo genome assembly was performed with average read accuracy (Q > 10) of 98.9%. RESULTS AND DISCUSSION: For a 5-Mb-long bacterial genome, finished genome sequences with accuracy of >99.99% could be obtained at 75× sequencing coverage depth using Flye and Medaka software. Thus, this new ONT method greatly improves base-calling accuracy, allowing for the de novo assembly of high-quality finished bacterial or plasmid genomes without the need for short-read sequencing. This saves both money and time and supports the application of ONT data in critical genome-based epidemiological analyses. The novel ONT approach described in this study can take the place of traditional combination genome assembly based on short- and long-read sequencing, enabling pangenomic analyses based on high-quality complete bacterial and plasmid genomes to monitor the spread of antibiotic-resistant bacteria and antibiotic resistance genes. | 2023 | 37256057 |
| 5114 | 8 | 0.9945 | Datasets for benchmarking antimicrobial resistance genes in bacterial metagenomic and whole genome sequencing. Whole genome sequencing (WGS) is a key tool in identifying and characterising disease-associated bacteria across clinical, agricultural, and environmental contexts. One increasingly common use of genomic and metagenomic sequencing is in identifying the type and range of antimicrobial resistance (AMR) genes present in bacterial isolates in order to make predictions regarding their AMR phenotype. However, there are a large number of alternative bioinformatics software and pipelines available, which can lead to dissimilar results. It is, therefore, vital that researchers carefully evaluate their genomic and metagenomic AMR analysis methods using a common dataset. To this end, as part of the Microbial Bioinformatics Hackathon and Workshop 2021, a 'gold standard' reference genomic and simulated metagenomic dataset was generated containing raw sequence reads mapped against their corresponding reference genome from a range of 174 potentially pathogenic bacteria. These datasets and their accompanying metadata are freely available for use in benchmarking studies of bacteria and their antimicrobial resistance genes and will help improve tool development for the identification of AMR genes in complex samples. | 2022 | 35705638 |
| 3771 | 9 | 0.9945 | RFPlasmid: predicting plasmid sequences from short-read assembly data using machine learning. Antimicrobial-resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know whether the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole-genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely on single features, like k-mer composition, circularity of the DNA molecule, copy number or sequence identity to plasmid replication genes, all of which have their drawbacks, especially when faced with large single-copy plasmids, which often carry resistance genes. With our newly developed prediction tool RFPlasmid, we use a combination of multiple features, including k-mer composition and databases with plasmid and chromosomal marker proteins, to predict whether the likely source of a contig is plasmid or chromosomal. The tool RFPlasmid supports models for 17 different bacterial taxa, including Campylobacter, Escherichia coli and Salmonella, and has a taxon agnostic model for metagenomic assemblies or unsupported organisms. RFPlasmid is available both as a standalone tool and via a web interface. | 2021 | 34846288 |
| 5126 | 10 | 0.9944 | Blanket antimicrobial resistance gene database with structural information, BOARDS, provides insights on historical landscape of resistance prevalence and effects of mutations in enzyme structure. Antimicrobial resistance (AMR) in pathogenic bacteria poses a significant threat to public health, yet there is still a need for development in the tools to deeply understand AMR genes based on genetic or structural information. In this study, we present an interactive web database named Blanket Overarching Antimicrobial-Resistance gene Database with Structural information (BOARDS, sbml.unist.ac.kr), a database that comprehensively includes 3,943 reported AMR gene information for 1,997 extended spectrum beta-lactamase (ESBL) and 1,946 other genes as well as a total of 27,395 predicted protein structures. These structures, which include both wild-type AMR genes and their mutants, were derived from 80,094 publicly available whole-genome sequences. In addition, we developed the rapid analysis and detection tool of antimicrobial-resistance (RADAR), a one-stop analysis pipeline to detect AMR genes across whole-genome sequencing (WGSs). By integrating BOARDS and RADAR, the AMR prevalence landscape for eight multi-drug resistant pathogens was reconstructed, leading to unexpected findings such as the pre-existence of the MCR genes before their official reports. Enzymatic structure prediction-based analysis revealed that the occurrence of mutations found in some ESBL genes was found to be closely related to the binding affinities with their antibiotic substrates. Overall, BOARDS can play a significant role in performing in-depth analysis on AMR.IMPORTANCEWhile the increasing antibiotic resistance (AMR) in pathogen has been a burden on public health, effective tools for deep understanding of AMR based on genetic or structural information remain limited. In this study, a blanket overarching antimicrobial-resistance gene database with structure information (BOARDS)-a web-based database that comprehensively collected AMR gene data with predictive protein structural information was constructed. Additionally, we report the development of a RADAR pipeline that can analyze whole-genome sequences as well. BOARDS, which includes sequence and structural information, has shown the historical landscape and prevalence of the AMR genes and can provide insight into single-nucleotide polymorphism effects on antibiotic degrading enzymes within protein structures. | 2024 | 38085058 |
| 7698 | 11 | 0.9944 | Detecting horizontal gene transfer with metagenomics co-barcoding sequencing. Horizontal gene transfer (HGT) is the process through which genetic information is transferred between different genomes and that played a crucial role in bacterial evolution. HGT can enable bacteria to rapidly acquire antibiotic resistance and bacteria that have acquired resistance is spreading within the microbiome. Conventional methods of characterizing HGT patterns include short-read metagenomic sequencing (short-reads mNGS), long-read sequencing, and single-cell sequencing. These approaches present several limitations, such as short-read fragments, high amounts of input DNA, and sequencing costs, respectively. Here, we attempt to circumvent present limitations to detect HGT by developing a metagenomics co-barcode sequencing workflow (MECOS) and applying it to the human and mouse gut microbiomes. In addition to that, we have over 10-fold increased contig length compared to short-reads mNGS; we also obtained exceeding 30 million paired reads with co-barcode information. Applying the novel bioinformatic pipeline, we integrated this co-barcoding information and the context information from long reads, and observed over 50-fold HGT events after we corrected the potential wrong HGT events. Specifically, we detected approximately 3,000 HGT blocks in individual samples, encompassing ~6,000 genes and ~100 taxonomic groups, including loci conferring tetracycline resistance through ribosomal protection. MECOS provides a valuable tool for investigating HGT and advance our understanding on the evolution of natural microbial communities within hosts.IMPORTANCEIn this study, to better identify horizontal gene transfer (HGT) in individual samples, we introduce a new co-barcoding sequencing system called metagenomics co-barcoding sequencing (MECOS), which has three significant improvements: (i) long DNA fragment extraction, (ii) a special transposome insertion, (iii) hybridization of DNA to barcode beads, and (4) an integrated bioinformatic pipeline. Using our approach, we have over 10-fold increased contig length compared to short-reads mNGS, and observed over 50-fold HGT events after we corrected the potential wrong HGT events. Our results indicate the presence of approximately 3,000 HGT blocks, involving roughly 6,000 genes and 100 taxonomic groups in individual samples. Notably, these HGT events are predominantly enriched in genes that confer tetracycline resistance via ribosomal protection. MECOS is a useful tool for investigating HGT and the evolution of natural microbial communities within hosts, thereby advancing our understanding of microbial ecology and evolution. | 2024 | 38315121 |
| 4778 | 12 | 0.9943 | DNA extraction of microbial DNA directly from infected tissue: an optimized protocol for use in nanopore sequencing. Identification of bacteria causing tissue infections can be comprehensive and, in the cases of non- or slow-growing bacteria, near impossible with conventional methods. Performing shotgun metagenomic sequencing on bacterial DNA extracted directly from the infected tissue may improve time to diagnosis and targeted treatment considerably. However, infected tissue consists mainly of human DNA (hDNA) which hampers bacterial identification. In this proof of concept study, we present a modified version of the Ultra-Deep Microbiome Prep kit for DNA extraction procedure, removing additional human DNA. Tissue biopsies from 3 patients with orthopedic implant-related infections containing varying degrees of Staphylococcus aureus were included. Subsequent DNA shotgun metagenomic sequencing using Oxford Nanopore Technologies' (ONT) MinION platform and ONTs EPI2ME WIMP and ARMA bioinformatic workflows for microbe and antibiotic resistance genes identification, respectively. The modified DNA extraction protocol led to an additional ~10-fold reduction of human DNA while preserving S. aureus DNA. Including the DNA sequencing and bioinformatics analyses, the presented protocol has the potential of identifying the infection-causing pathogen in infected tissue within 7 hours after biopsy. However, due to low number of S. aureus reads, positive identification of antibiotic resistance genes was not possible. | 2020 | 32076089 |
| 5110 | 13 | 0.9943 | Surveillance of carbapenem-resistant organisms using next-generation sequencing. The genomic data generated from next-generation sequencing (NGS) provides nucleotide-level resolution of bacterial genomes which is critical for disease surveillance and the implementation of prevention strategies to interrupt the spread of antimicrobial resistance (AMR) bacteria. Infection with AMR bacteria, including Gram-negative Carbapenem-Resistant Organisms (CRO), may be acute and recurrent-once they have colonized a patient, they are notoriously difficult to eradicate. Through phylogenetic tools that assess the single nucleotide polymorphisms (SNPs) within a pathogen genome dataset, public health scientists can estimate the genetic identity between isolates. This information is used as an epidemiologic proxy of a putative outbreak. Pathogens with minimal to no differences in SNPs are likely to be the same strain attributable to a common source or transmission between cases. These genomic comparisons enhance public health response by prompting targeted intervention and infection control measures. This methodology overview demonstrates the utility of phenotypic and molecular assays, antimicrobial susceptibility testing (AST), NGS, publicly available genomics databases, and open-source bioinformatics pipelines for a tiered workflow to detect resistance genes and potential clusters of illness. These methods, when used in combination, facilitate a genomic surveillance workflow for detecting potential AMR bacterial outbreaks to inform epidemiologic investigations. Use of this workflow helps to target and focus epidemiologic resources to the cases with the highest likelihood of being related. | 2023 | 37255756 |
| 5121 | 14 | 0.9943 | Rapid Nanopore Whole-Genome Sequencing for Anthrax Emergency Preparedness. Human anthrax cases necessitate rapid response. We completed Bacillus anthracis nanopore whole-genome sequencing in our high-containment laboratory from a human anthrax isolate hours after receipt. The de novo assembled genome showed no evidence of known antimicrobial resistance genes or introduced plasmid(s). Same-day genomic characterization enhances public health emergency response. | 2020 | 31961318 |
| 5122 | 15 | 0.9943 | Clinical long-read metagenomic sequencing of culture-negative infective endocarditis reveals genomic features and antimicrobial resistance. BACKGROUND: Infective endocarditis (IE) poses significant diagnostic challenges, particularly in blood culture-negative cases where fastidious bacteria evade detection. Metagenomic-based nanopore sequencing enables rapid pathogen detection and provides a new approach for the diagnosis of IE. METHOD: Two cases of blood culture-negative infective endocarditis (IE) were analyzed using nanopore sequencing with an in silico host-depletion approach. Complete genome reconstruction and antimicrobial resistance gene annotation were successfully performed. RESULTS: Within an hour of sequencing, EPI2ME classified nanopore reads, identifying Corynebacterium striatum in IE patient 1 and Granulicatella adiacens in IE patient 2. After 18 h, long-read sequencing successfully reconstructed a single circular genome of C. striatum in IE patient 1, whereas short-read sequencing was used to compare but produced fragmented assemblies. Based on these results, long-read sequencing was exclusively used for IE patient 2, allowing for the complete and accurate assembly of G. adiacens, confirming the presence of these bacteria in the clinical samples. In addition to pathogen identification, antimicrobial resistance (AMR) genes were detected in both genomes. Notably, in C. striatum, regions containing a class 1 integron and multiple novel mobile genetic elements (ISCost1, ISCost2, Tn7838 and Tn7839) were identified, collectively harbouring six AMR genes. This is the first report of such elements in C. striatum, highlighting the potential of nanopore long-read sequencing for comprehensive pathogen characterization in IE cases. CONCLUSIONS: This study highlights the effectiveness of host-depleted, long-read nanopore metagenomics for direct pathogen identification and accurate genome reconstruction, including antimicrobial resistance gene detection. The approach enables same-day diagnostic reporting within a matter of hours. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12879-025-11741-5. | 2025 | 41087996 |
| 7697 | 16 | 0.9942 | Impact of sample multiplexing on detection of bacteria and antimicrobial resistance genes in pig microbiomes using long-read sequencing. The effects of sample multiplexing on the detection sensitivity of antimicrobial resistance genes (ARGs) and pathogenic bacteria in metagenomic sequencing remain underexplored in newer sequencing technologies such as Oxford Nanopore Technologies (ONT), despite its critical importance for surveillance applications. Here, we evaluate how different multiplexing levels (four and eight samples per flowcell) on two ONT platforms, GridION and PromethION, influence the detection of ARGs, bacterial taxa and pathogens. While overall resistome and bacterial community profiles remained comparable across multiplexing levels, ARG detection was more comprehensive in the four-plex setting with low-abundance genes. Similarly, pathogen detection was more sensitive in the four-plex, identifying a broader range of low abundant bacterial taxa compared to the eight-plex. However, triplicate sequencing of the same microbiomes revealed that these differences were primarily due to sequencing variability rather than multiplexing itself, as similar inconsistencies were observed across replicates. Given that eight-plex sequencing is more cost-effective while still capturing the overall resistome and bacterial community composition, it may be the preferred option for general surveillance. Lower multiplexing levels may be advantageous for applications requiring enhanced sensitivity, such as detailed pathogen research. These findings highlight the trade-off between multiplexing efficiency, sequencing depth, and cost in metagenomic studies. | 2025 | 40611965 |
| 4943 | 17 | 0.9942 | Targeted sequencing of Enterobacterales bacteria using CRISPR-Cas9 enrichment and Oxford Nanopore Technologies. Sequencing DNA directly from patient samples enables faster pathogen characterization compared to traditional culture-based approaches, but often yields insufficient sequence data for effective downstream analysis. CRISPR-Cas9 enrichment is designed to improve the yield of low abundance sequences but has not been thoroughly explored with Oxford Nanopore Technologies (ONT) for use in clinical bacterial epidemiology. We designed CRISPR-Cas9 guide RNAs to enrich the human pathogen Klebsiella pneumoniae, by targeting multi-locus sequence type (MLST) and transfer RNA (tRNA) genes, as well as common antimicrobial resistance (AMR) genes and the resistance-associated integron gene intI1. We validated enrichment performance in 20 K. pneumoniae isolates, finding that guides generated successful enrichment across all conserved sites except for one AMR gene in two isolates. Enrichment of MLST genes led to a correct allele call in all seven loci for 8 out of 10 isolates that had depth of 30× or more in these regions. We then compared enriched and unenriched sequencing of three human fecal samples spiked with K. pneumoniae at varying abundance. Enriched sequencing generated 56× and 11.3× the number of AMR and MLST reads, respectively, compared to unenriched sequencing, and required approximately one-third of the computational storage space. Targeting the intI1 gene often led to detection of 10-20 proximal resistance genes due to the long reads produced by ONT sequencing. We demonstrated that CRISPR-Cas9 enrichment combined with ONT sequencing enabled improved genomic characterization outcomes over unenriched sequencing of patient samples. This method could be used to inform infection control strategies by identifying patients colonized with high-risk strains. IMPORTANCE: Understanding bacteria in complex samples can be challenging due to their low abundance, which often results in insufficient data for analysis. To improve the detection of harmful bacteria, we implemented a technique aimed at increasing the amount of data from target pathogens when combined with modern DNA sequencing technologies. Our technique uses CRISPR-Cas9 to target specific gene sequences in the bacterial pathogen Klebsiella pneumoniae and improve recovery from human stool samples. We found our enrichment method to significantly outperform traditional methods, generating far more data originating from our target genes. Additionally, we developed new computational techniques to further enhance the analysis, providing a thorough method for characterizing pathogens from complex biological samples. | 2025 | 39772804 |
| 5119 | 18 | 0.9942 | ROCker models for reliable detection and typing of short-read sequences carrying mcr, erm, mph, and lnu antibiotic resistance genes. Quantitative monitoring of emerging antimicrobial resistance genes (ARGs) using short-read sequences remains challenging due to the high frequency of amino acid functional domains and motifs shared with related but functionally distinct (non-target) proteins. To facilitate ARG monitoring efforts using unassembled short reads, we present novel ROCker models for mcr, mph, erm, and lnu ARG families, as well as models for variants of special public health concern within these families, including mcr-1, mphA, ermB, lnuF, lnuB, and lnuG genes. For this, we curated target gene sequence sets for model training and built these models using the recently updated ROCker V2 pipeline (Gerhardt et al., in review). To validate our models, we simulated reads from the whole genome of ARG-carrying isolates spanning a range of common read lengths and used them to challenge the filtering efficacy of ROCker versus common static filtering approaches, such as similarity searches using BLASTx with various e-value thresholds or hidden Markov models. ROCker models consistently showed F1 scores up to 10× higher (31% higher on average) and lower false-positive (by 30%, on average) and false-negative (by 16%, on average) rates based on 250 bp reads compared to alternative methods. The ROCker models and all related reference materials and data are freely available through http://enve-omics.ce.gatech.edu/rocker/models, further expanding the available model collection previously developed for other genes. Their application to short-read metagenomes, metatranscriptomes, and PCR amplicon data should facilitate more accurate classification and quantification of unassembled short-read sequences for these ARG families and specific genes.IMPORTANCEAntimicrobial resistance gene families encoding erm and mph genes confer resistance to the macrolide class of antimicrobials, which are used to treat a wide range of infections. Similarly, the mcr gene family confers resistance to polymyxin E (colistin), a drug of last resort for many serious drug-resistant bacterial infections, and the lnu gene family confers resistance to lincomycin, which is reserved for patients allergic to penicillin or where bacteria have developed resistance to other antimicrobials. Assessing the prevalence of these genes in clinical or environmental samples and monitoring their spread to new pathogens are thus important for quantifying the associated public health risk. However, detecting these and other resistance genes in short-read sequence data is technically challenging. Our ROCker bioinformatic pipeline achieves reliable detection and typing of broad-range target gene sequences in complex data sets, thus contributing toward solving an important problem in ongoing surveillance efforts of antimicrobial resistance. | 2025 | 41143534 |
| 3770 | 19 | 0.9942 | Detection of mobile genetic elements associated with antibiotic resistance in Salmonella enterica using a newly developed web tool: MobileElementFinder. OBJECTIVES: Antimicrobial resistance (AMR) in clinically relevant bacteria is a growing threat to public health globally. In these bacteria, antimicrobial resistance genes are often associated with mobile genetic elements (MGEs), which promote their mobility, enabling them to rapidly spread throughout a bacterial community. METHODS: The tool MobileElementFinder was developed to enable rapid detection of MGEs and their genetic context in assembled sequence data. MGEs are detected based on sequence similarity to a database of 4452 known elements augmented with annotation of resistance genes, virulence factors and detection of plasmids. RESULTS: MobileElementFinder was applied to analyse the mobilome of 1725 sequenced Salmonella enterica isolates of animal origin from Denmark, Germany and the USA. We found that the MGEs were seemingly conserved according to multilocus ST and not restricted to either the host or the country of origin. Moreover, we identified putative translocatable units for specific aminoglycoside, sulphonamide and tetracycline genes. Several putative composite transposons were predicted that could mobilize, among others, AMR, metal resistance and phosphodiesterase genes associated with macrophage survivability. This is, to our knowledge, the first time the phosphodiesterase-like pdeL has been found to be potentially mobilized into S. enterica. CONCLUSIONS: MobileElementFinder is a powerful tool to study the epidemiology of MGEs in a large number of genome sequences and to determine the potential for genomic plasticity of bacteria. This web service provides a convenient method of detecting MGEs in assembled sequence data. MobileElementFinder can be accessed at https://cge.cbs.dtu.dk/services/MobileElementFinder/. | 2021 | 33009809 |