# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 9076 | 0 | 0.9940 | ResiDB: An automated database manager for sequence data. The amount of publicly available DNA sequence data is drastically increasing, making it a tedious task to create sequence databases necessary for the design of diagnostic assays. The selection of appropriate sequences is especially challenging in genes affected by frequent point mutations such as antibiotic resistance genes. To overcome this issue, we have designed the webtool resiDB, a rapid and user-friendly sequence database manager for bacteria, fungi, viruses, protozoa, invertebrates, plants, archaea, environmental and whole genome shotgun sequence data. It automatically identifies and curates sequence clusters to create custom sequence databases based on user-defined input sequences. A collection of helpful visualization tools gives the user the opportunity to easily access, evaluate, edit, and download the newly created database. Consequently, researchers do no longer have to manually manage sequence data retrieval, deal with hardware limitations, and run multiple independent software tools, each having its own requirements, input and output formats. Our tool was developed within the H2020 project FAPIC aiming to develop a single diagnostic assay targeting all sepsis-relevant pathogens and antibiotic resistance mechanisms. ResiDB is freely accessible to all users through https://residb.ait.ac.at/. | 2021 | 33495705 |
| 5068 | 1 | 0.9938 | Ultrasensitive Label-Free Detection of Unamplified Multidrug-Resistance Bacteria Genes with a Bimodal Waveguide Interferometric Biosensor. Infections by multidrug-resistant bacteria are becoming a major healthcare emergence with millions of reported cases every year and an increasing incidence of deaths. An advanced diagnostic platform able to directly detect and identify antimicrobial resistance in a faster way than conventional techniques could help in the adoption of early and accurate therapeutic interventions, limiting the actual negative impact on patient outcomes. With this objective, we have developed a new biosensor methodology using an ultrasensitive nanophotonic bimodal waveguide interferometer (BiMW), which allows a rapid and direct detection, without amplification, of two prevalent and clinically relevant Gram-negative antimicrobial resistance encoding sequences: the extended-spectrum betalactamase-encoding gene blaCTX-M-15 and the carbapenemase-encoding gene blaNDM-5 We demonstrate the extreme sensitivity and specificity of our biosensor methodology for the detection of both gene sequences. Our results show that the BiMW biosensor can be employed as an ultrasensitive (attomolar level) and specific diagnostic tool for rapidly (less than 30 min) identifying drug resistance. The BiMW nanobiosensor holds great promise as a powerful tool for the control and management of healthcare-associated infections by multidrug-resistant bacteria. | 2020 | 33086716 |
| 5097 | 2 | 0.9937 | Comparing Graph Sample and Aggregation (SAGE) and Graph Attention Networks in the Prediction of Drug-Gene Associations of Extended-Spectrum Beta-Lactamases in Periodontal Infections and Resistance. INTRODUCTION: Gram-negative bacteria exhibit more antibiotic resistance than gram-positive bacteria due to their cell wall structure and composition differences. Porins, or protein channels in these bacteria, can allow small, hydrophilic antibiotics to diffuse, affecting their susceptibility. Mutations in porin protein genes can also impair antibiotic entry. Predicting drug-gene associations of extended-spectrum beta-lactamases (ESBLs) is crucial as they confer resistance to beta-lactam antibiotics, challenging the treatment of infections. This aids clinicians in selecting suitable treatments, optimizing drug usage, enhancing patient outcomes, and controlling antibiotic resistance in healthcare settings. Graph-based neural networks can predict drug-gene associations in periodontal infections and resistance. The aim of the study was to predict drug-gene associations of ESBLs in periodontal infections and resistance. METHODS: The study focuses on analyzing drug-gene associations using probes and drugs. The data was converted into graph language, assigning nodes and edges for drugs and genes. Graph neural networks (GNNs) and similar algorithms were implemented using Google Colab and Python. Cytoscape and CytoHubba are open-source software platforms used for network analysis and visualization. GNNs were used for tasks like node classification, link prediction, and graph-level prediction. Three graph-based models were used: graph convolutional network (GCN), Graph SAGE, and graph attention network (GAT). Each model was trained for 200 epochs using the Adam optimizer with a learning rate of 0.01 and a weight decay of 5e-4. RESULTS: The drug-gene association network has 57 nodes, 79 edges, and a 2.730 characteristic path length. Its structure, organization, and connectivity are analyzed using the GCN and Graph SAGE, which show high accuracy, precision, recall, and an F1-score of 0.94. GAT's performance metrics are lower, with an accuracy of 0.68, precision of 0.47, recall of 0.68, and F1-score of 0.56, suggesting that it may not be as effective in capturing drug-gene relationships. CONCLUSION: Compared to ESBLs, both GCN and Graph SAGE demonstrate excellent performance with accuracy, precision, recall, and an F1-score of 0.94. These results indicate that GCN and Graph SAGE are highly effective in predicting drug-gene associations related to ESBLs. GCN and Graph SAGE outperform GAT in predicting drug-gene associations for ESBLs. Improvements include data augmentation, regularization, and cross-validation. Ethical considerations, fairness, and open-source implementations are crucial for future research in precision periodontal treatment. | 2024 | 39347119 |
| 5826 | 3 | 0.9936 | Rapid and accurate sepsis diagnostics via a novel probe-based multiplex real-time PCR system. Sepsis is a critical clinical emergency that requires prompt diagnosis and intervention. Its prevalence has increased due to the aging population and increased antibiotic resistance. Early identification and the use of innovative technologies are crucial for improving patient outcomes. Modern methodologies are needed to minimize the turnaround time for diagnosis and improve outcomes. Rapid diagnostic tests and multiplex PCR are effective but have limitations in identifying a range of pathogens and target genes. Our study evaluated two novel probe-based multiplex real-time PCR systems: the SEPSI ID and SEPSI DR panels. These systems can quickly identify bacterial and fungal pathogens, alongside antibiotic resistance genes. The assays cover 29 microorganisms (gram-negative bacteria, gram-positive bacteria, yeast, and mold species), alongside 23 resistance genes and four virulence factors. A streamlined workflow uses 2 µL of broth from positive blood cultures (BCs) without nucleic acid extraction and provides results in approximately 1 h. We present the results from an evaluation of 228 BCs and 22 isolates previously characterized by whole-genome sequencing. In comparison to the reference methods, the SEPSI ID panel demonstrated a sensitivity of 96.88%, a specificity of 100%, and a PPV of 100%, whereas the SEPSI DR panel showed a sensitivity of 97.8%, a PPV of 89.7%, and a specificity of 96.7%. Both panels also identified additional pathogens and resistance-related targets not detected by conventional methods. This assay shows promise for rapidly and accurately diagnosing sepsis. Future studies should validate its performance in various clinical settings to enhance sepsis management and improve patient outcomes.IMPORTANCEWe present a new diagnostic method that enables the quick and precise identification of pathogens and resistance genes from positive blood cultures, eliminating the need for nucleic acid extraction. This technique can also be used on fresh pathogen cultures. It has the potential to greatly improve treatment protocols, leading to better patient outcomes, more responsible antibiotic use, and more efficient management of healthcare resources. | 2025 | 41025980 |
| 9742 | 4 | 0.9931 | BOCS: DNA k-mer content and scoring for rapid genetic biomarker identification at low coverage. A single, inexpensive diagnostic test capable of rapidly identifying a wide range of genetic biomarkers would prove invaluable in precision medicine. Previous work has demonstrated the potential for high-throughput, label-free detection of A-G-C-T content in DNA k-mers, providing an alternative to single-letter sequencing while also having inherent lossy data compression and massively parallel data acquisition. Here, we apply a new bioinformatics algorithm - block optical content scoring (BOCS) - capable of using the high-throughput content k-mers for rapid, broad-spectrum identification of genetic biomarkers. BOCS uses content-based sequence alignment for probabilistic mapping of k-mer contents to gene sequences within a biomarker database, resulting in a probability ranking of genes on a content score. Simulations of the BOCS algorithm reveal high accuracy for identification of single antibiotic resistance genes, even in the presence of significant sequencing errors (100% accuracy for no sequencing errors, and >90% accuracy for sequencing errors at 20%), and at well below full coverage of the genes. Simulations for detecting multiple resistance genes within a methicillin-resistant Staphylococcus aureus (MRSA) strain showed 100% accuracy at an average gene coverage of merely 0.515, when the k-mer lengths were variable and with 4% sequencing error within the k-mer blocks. Extension of BOCS to cancer and other genetic diseases met or exceeded the results for resistance genes. Combined with a high-throughput content-based sequencing technique, the BOCS algorithm potentiates a test capable of rapid diagnosis and profiling of genetic biomarkers ranging from antibiotic resistance to cancer and other genetic diseases. | 2019 | 31173943 |
| 9741 | 5 | 0.9930 | ARGai 1.0: A GAN augmented in silico approach for identifying resistant genes and strains in E. coli using vision transformer. The emergence of infectious disease and antibiotic resistance in bacteria like Escherichia coli (E. coli) shows the necessity for novel computational techniques for identifying essential genes that contribute to resistance. The task of identifying resistant strains and multi-drug patterns in E. coli is a major challenge with whole genome sequencing (WGS) and next-generation sequencing (NGS) data. To address this issue, we suggest ARGai 1.0 a deep learning architecture enhanced with generative adversarial networks (GANs). We mitigate data scarcity difficulties by augmenting limited experimental datasets with synthetic data generated by GANs. Our in-silico method (augmentation with feature selection) improves the identification of resistance genes in E. coli by using feature extraction techniques to identify valuable features from actual and GAN-generated data. Employing comprehensive validation, we exhibit the effectiveness of our ARGai 1.0 in precisely identifying the informative and resistant genes. In addition, our ARGai 1.0 identifies the resistant strains with a classification accuracy of 98.96 % on Deep Convolutional Generative Adversarial Network (DCGAN) augmented data. Additionally, ARGai 1.0 achieves more than 98 % of sensitivity and specificity. We also benchmark our ARGai 1.0 with several state-of-the-art AI models for resistant strain classification. In the fight against antibiotic resistance, ARGai 1.0 offers a promising avenue for computational genomics. With implications for research and clinical practice, this work shows the potential of deep networks with GAN augmentation as a practical and successful method for gene identification in E. coli. | 2025 | 39813877 |
| 5119 | 6 | 0.9929 | ROCker models for reliable detection and typing of short-read sequences carrying mcr, erm, mph, and lnu antibiotic resistance genes. Quantitative monitoring of emerging antimicrobial resistance genes (ARGs) using short-read sequences remains challenging due to the high frequency of amino acid functional domains and motifs shared with related but functionally distinct (non-target) proteins. To facilitate ARG monitoring efforts using unassembled short reads, we present novel ROCker models for mcr, mph, erm, and lnu ARG families, as well as models for variants of special public health concern within these families, including mcr-1, mphA, ermB, lnuF, lnuB, and lnuG genes. For this, we curated target gene sequence sets for model training and built these models using the recently updated ROCker V2 pipeline (Gerhardt et al., in review). To validate our models, we simulated reads from the whole genome of ARG-carrying isolates spanning a range of common read lengths and used them to challenge the filtering efficacy of ROCker versus common static filtering approaches, such as similarity searches using BLASTx with various e-value thresholds or hidden Markov models. ROCker models consistently showed F1 scores up to 10× higher (31% higher on average) and lower false-positive (by 30%, on average) and false-negative (by 16%, on average) rates based on 250 bp reads compared to alternative methods. The ROCker models and all related reference materials and data are freely available through http://enve-omics.ce.gatech.edu/rocker/models, further expanding the available model collection previously developed for other genes. Their application to short-read metagenomes, metatranscriptomes, and PCR amplicon data should facilitate more accurate classification and quantification of unassembled short-read sequences for these ARG families and specific genes.IMPORTANCEAntimicrobial resistance gene families encoding erm and mph genes confer resistance to the macrolide class of antimicrobials, which are used to treat a wide range of infections. Similarly, the mcr gene family confers resistance to polymyxin E (colistin), a drug of last resort for many serious drug-resistant bacterial infections, and the lnu gene family confers resistance to lincomycin, which is reserved for patients allergic to penicillin or where bacteria have developed resistance to other antimicrobials. Assessing the prevalence of these genes in clinical or environmental samples and monitoring their spread to new pathogens are thus important for quantifying the associated public health risk. However, detecting these and other resistance genes in short-read sequence data is technically challenging. Our ROCker bioinformatic pipeline achieves reliable detection and typing of broad-range target gene sequences in complex data sets, thus contributing toward solving an important problem in ongoing surveillance efforts of antimicrobial resistance. | 2025 | 41143534 |
| 9744 | 7 | 0.9929 | PARGT: a software tool for predicting antimicrobial resistance in bacteria. With the ever-increasing availability of whole-genome sequences, machine-learning approaches can be used as an alternative to traditional alignment-based methods for identifying new antimicrobial-resistance genes. Such approaches are especially helpful when pathogens cannot be cultured in the lab. In previous work, we proposed a game-theory-based feature evaluation algorithm. When using the protein characteristics identified by this algorithm, called 'features' in machine learning, our model accurately identified antimicrobial resistance (AMR) genes in Gram-negative bacteria. Here we extend our study to Gram-positive bacteria showing that coupling game-theory-identified features with machine learning achieved classification accuracies between 87% and 90% for genes encoding resistance to the antibiotics bacitracin and vancomycin. Importantly, we present a standalone software tool that implements the game-theory algorithm and machine-learning model used in these studies. | 2020 | 32620856 |
| 8405 | 8 | 0.9929 | Mapping Major Disease Resistance Genes in Soybean by Genome-Wide Association Studies. Soybean is one of the most valuable agricultural crops in the world. Besides, this legume is constantly attacked by a wide range of pathogens (fungi, bacteria, viruses, and nematodes) compromising yield and increasing production costs. One of the major disease management strategies is the genetic resistance provided by single genes and quantitative trait loci (QTL). Identifying the genomic regions underlying the resistance against these pathogens on soybean is one of the first steps performed by molecular breeders. In the past, genetic mapping studies have been widely used to discover these genomic regions. However, over the last decade, advances in next-generation sequencing technologies and their subsequent cost decreasing led to the development of cost-effective approaches to high-throughput genotyping. Thus, genome-wide association studies applying thousands of SNPs in large sets composed of diverse soybean accessions have been successfully done. In this chapter, a comprehensive review of the majority of GWAS for soybean diseases published since this approach was developed is provided. Important diseases caused by Heterodera glycines, Phytophthora sojae, and Sclerotinia sclerotiorum have been the focus of the several GWAS. However, other bacterial and fungi diseases also have been targets of GWAS. As such, this GWAS summary can serve as a guide for future studies of these diseases. The protocol begins by describing several considerations about the pathogens and bringing different procedures of molecular characterization of them. Advice to choose the best isolate/race to maximize the discovery of multiple R genes or to directly map an effective R gene is provided. A summary of protocols, methods, and tools to phenotyping the soybean panel is given to several diseases. We also give details of options of DNA extraction protocols and genotyping methods, and we describe parameters of SNP quality to soybean data. Websites and their online tools to obtain genotypic and phenotypic data for thousands of soybean accessions are highlighted. Finally, we report several tricks and tips in Subheading 4, especially related to composing the soybean panel as well as generating and analyzing the phenotype data. We hope this protocol will be helpful to achieve GWAS success in identifying resistance genes on soybean. | 2022 | 35641772 |
| 5828 | 9 | 0.9929 | Target-enriched sequencing enables accurate identification of bloodstream infections in whole blood. Bloodstream infections are within the top ten causes of death globally, with a mortality rate of up to 70%. Gold standard blood culture testing is time-consuming, resulting in delayed, but accurate, treatment. Molecular methods, such as RT-qPCR, have limited targets in one run. We present a new Ampliseq detection system (ADS) combining target amplification and next-generation sequencing for accurate identification of bacteria, fungi, and antimicrobial resistance determinants directly from blood samples. In this study, we included removal of human genomic DNA during nucleic acid extraction, optimized the target sequence set and drug resistance genes, performed antimicrobial resistance profiling of clinical isolates, and evaluated mock specimens and clinical samples by ADS. ADS successfully identified pathogens at the species-level in 36 h, from nucleic acid extraction to results. Besides pathogen identification, ADS can also present drug resistance profiles. ADS enabled detection of all bacteria and accurate identification of 47 pathogens. In 20 spiked samples and 8 clinical specimens, ADS detected at least 92.81% of reads mapped to pathogens. ADS also showed consistency with the three culture-negative samples, and correctly identified pathogens in four of five culture-positive clinical blood specimens. This Ampliseq-based technology promises broad coverage and accurate pathogen identification, helping clinicians to accurately diagnose and treat bloodstream infections. | 2022 | 34915067 |
| 5116 | 10 | 0.9928 | Prediction of Antimicrobial Resistance in Gram-Negative Bacteria From Whole-Genome Sequencing Data. BACKGROUND: Early detection of antimicrobial resistance in pathogens and prescription of more effective antibiotics is a fast-emerging need in clinical practice. High-throughput sequencing technology, such as whole genome sequencing (WGS), may have the capacity to rapidly guide the clinical decision-making process. The prediction of antimicrobial resistance in Gram-negative bacteria, often the cause of serious systemic infections, is more challenging as genotype-to-phenotype (drug resistance) relationship is more complex than for most Gram-positive organisms. METHODS AND FINDINGS: We have used NCBI BioSample database to train and cross-validate eight XGBoost-based machine learning models to predict drug resistance to cefepime, cefotaxime, ceftriaxone, ciprofloxacin, gentamicin, levofloxacin, meropenem, and tobramycin tested in Acinetobacter baumannii, Escherichia coli, Enterobacter cloacae, Klebsiella aerogenes, and Klebsiella pneumoniae. The input is the WGS data in terms of the coverage of known antibiotic resistance genes by shotgun sequencing reads. Models demonstrate high performance and robustness to class imbalanced datasets. CONCLUSION: Whole genome sequencing enables the prediction of antimicrobial resistance in Gram-negative bacteria. We present a tool that provides an in silico antibiogram for eight drugs. Predictions are accompanied with a reliability index that may further facilitate the decision making process. The demo version of the tool with pre-processed samples is available at https://vancampn.shinyapps.io/wgs2amr/. The stand-alone version of the predictor is available at https://github.com/pieterjanvc/wgs2amr/. | 2020 | 32528441 |
| 9083 | 11 | 0.9928 | ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences. BACKGROUND: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing. RESULTS: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG. CONCLUSIONS: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract. | 2024 | 38725076 |
| 5827 | 12 | 0.9928 | Duplex dPCR System for Rapid Identification of Gram-Negative Pathogens in the Blood of Patients with Bloodstream Infection: A Culture-Independent Approach. Early and accurate detection of pathogens is important to improve clinical outcomes of bloodstream infections (BSI), especially in the case of drug-resistant pathogens. In this study, we aimed to develop a culture-independent digital PCR (dPCR) system for multiplex detection of major sepsiscausing gram-negative pathogens and antimicrobial resistance genes using plasma DNA from BSI patients. Our duplex dPCR system successfully detected nine targets (five bacteria-specific targets and four antimicrobial resistance genes) through five reactions within 3 hours. The minimum detection limit was 50 ag of bacterial DNA, suggesting that 1 CFU/ml of bacteria in the blood can be detected. To validate the clinical applicability, cell-free DNA samples from febrile patients were tested with our system and confirmed high consistency with conventional blood culture. This system can support early identification of some drug-resistant gram-negative pathogens, which can help improving treatment outcomes of BSI. | 2021 | 34528911 |
| 2497 | 13 | 0.9927 | Rapid Simultaneous Detection of the Clinically Relevant Carbapenemase Resistance Genes blaKPC, blaOXA48, blaVIM and blaNDM with the Newly Developed Ready-to-Use qPCR CarbaScan LyoBead. Antibiotic resistance, in particular the dissemination of carbapenemase-producing organisms, poses a significant threat to global healthcare. This study introduces the qPCR CarbaScan LyoBead assay, a robust, accurate, and efficient tool for detecting key carbapenemase genes, including blaKPC, blaNDM, blaOXA-48, and blaVIM. The assay utilizes lyophilized beads, a technological advancement that enhances stability, simplifies handling, and eliminates the need for refrigeration. This feature renders it particularly well-suited for point-of-care diagnostics and resource-limited settings. The assay's capacity to detect carbapenemase genes directly from bacterial colonies without the need for extensive sample preparation has been demonstrated to streamline workflows and enable rapid diagnostic results. The assay demonstrated 100% specificity and sensitivity across a diverse range of bacterial strains, including multiple allelic variants of target genes, facilitating precise identification of resistance mechanisms. Bacterial strains of the species Acinetobacter baumannii, Citrobacter freundii, Escherichia coli, Enterobacter cloacae, Klebsiella pneumoniae and Pseudomonas aeruginosa were utilized as reference material for assay development (n = 9) and validation (n = 28). It is notable that the assay's long shelf life and minimal operational complexity further enhance its utility for large-scale implementation in healthcare, food safety, and environmental monitoring. The findings emphasize the necessity of continuous surveillance and the implementation of rapid diagnostic methods for the effective detection of resistance genes. Furthermore, the assay's potential applications in other fields, such as toxin-antitoxin system research and monitoring of resistant bacteria in the community, highlight its versatility. In conclusion, the qPCR CarbaScan LyoBead assay is a valuable tool that can contribute to the urgent need to combat antibiotic resistance and improve global public health outcomes. | 2025 | 39940986 |
| 5118 | 14 | 0.9927 | Automated extraction of genes associated with antibiotic resistance from the biomedical literature. The detection of bacterial antibiotic resistance phenotypes is important when carrying out clinical decisions for patient treatment. Conventional phenotypic testing involves culturing bacteria which requires a significant amount of time and work. Whole-genome sequencing is emerging as a fast alternative to resistance prediction, by considering the presence/absence of certain genes. A lot of research has focused on determining which bacterial genes cause antibiotic resistance and efforts are being made to consolidate these facts in knowledge bases (KBs). KBs are usually manually curated by domain experts to be of the highest quality. However, this limits the pace at which new facts are added. Automated relation extraction of gene-antibiotic resistance relations from the biomedical literature is one solution that can simplify the curation process. This paper reports on the development of a text mining pipeline that takes in English biomedical abstracts and outputs genes that are predicted to cause resistance to antibiotics. To test the generalisability of this pipeline it was then applied to predict genes associated with Helicobacter pylori antibiotic resistance, that are not present in common antibiotic resistance KBs or publications studying H. pylori. These genes would be candidates for further lab-based antibiotic research and inclusion in these KBs. For relation extraction, state-of-the-art deep learning models were used. These models were trained on a newly developed silver corpus which was generated by distant supervision of abstracts using the facts obtained from KBs. The top performing model was superior to a co-occurrence model, achieving a recall of 95%, a precision of 60% and F1-score of 74% on a manually annotated holdout dataset. To our knowledge, this project was the first attempt at developing a complete text mining pipeline that incorporates deep learning models to extract gene-antibiotic resistance relations from the literature. Additional related data can be found at https://github.com/AndreBrincat/Gene-Antibiotic-Resistance-Relation-Extraction. | 2022 | 35134132 |
| 9075 | 15 | 0.9927 | CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter. BACKGROUND: The rapid expansion of Whole-Genome Sequencing has revolutionized the fields of clinical and food microbiology. However, its implementation as a routine laboratory technique remains challenging due to the growth of data at a faster rate than can be effectively analyzed and critical gaps in bioinformatics knowledge. RESULTS: To address both issues, CamPype was developed as a new bioinformatics workflow for the genomics analysis of sequencing data of bacteria, especially Campylobacter, which is the main cause of gastroenteritis worldwide making a negative impact on the economy of the public health systems. CamPype allows fully customization of stages to run and tools to use, including read quality control filtering, read contamination, reads extension and assembly, bacterial typing, genome annotation, searching for antibiotic resistance genes, virulence genes and plasmids, pangenome construction and identification of nucleotide variants. All results are processed and resumed in an interactive HTML report for best data visualization and interpretation. CONCLUSIONS: The minimal user intervention of CamPype makes of this workflow an attractive resource for microbiology laboratories with no expertise in bioinformatics as a first line method for bacterial typing and epidemiological analyses, that would help to reduce the costs of disease outbreaks, or for comparative genomic analyses. CamPype is publicly available at https://github.com/JoseBarbero/CamPype . | 2023 | 37474912 |
| 5100 | 16 | 0.9927 | DeepPBI-KG: a deep learning method for the prediction of phage-bacteria interactions based on key genes. Phages, the natural predators of bacteria, were discovered more than 100 years ago. However, increasing antimicrobial resistance rates have revitalized phage research. Methods that are more time-consuming and efficient than wet-laboratory experiments are needed to help screen phages quickly for therapeutic use. Traditional computational methods usually ignore the fact that phage-bacteria interactions are achieved by key genes and proteins. Methods for intraspecific prediction are rare since almost all existing methods consider only interactions at the species and genus levels. Moreover, most strains in existing databases contain only partial genome information because whole-genome information for species is difficult to obtain. Here, we propose a new approach for interaction prediction by constructing new features from key genes and proteins via the application of K-means sampling to select high-quality negative samples for prediction. Finally, we develop DeepPBI-KG, a corresponding prediction tool based on feature selection and a deep neural network. The results show that the average area under the curve for prediction reached 0.93 for each strain, and the overall AUC and area under the precision-recall curve reached 0.89 and 0.92, respectively, on the independent test set; these values are greater than those of other existing prediction tools. The forward and reverse validation results indicate that key genes and key proteins regulate and influence the interaction, which supports the reliability of the model. In addition, intraspecific prediction experiments based on Klebsiella pneumoniae data demonstrate the potential applicability of DeepPBI-KG for intraspecific prediction. In summary, the feature engineering and interaction prediction approaches proposed in this study can effectively improve the robustness and stability of interaction prediction, can achieve high generalizability, and may provide new directions and insights for rapid phage screening for therapy. | 2024 | 39344712 |
| 8448 | 17 | 0.9926 | Genome-Wide Association Analysis for Resistance to Coniothyrium glycines Causing Red Leaf Blotch Disease in Soybean. Soybean is a high oil and protein-rich legume with several production constraints. Globally, several fungi, viruses, nematodes, and bacteria cause significant yield losses in soybean. Coniothyrium glycines (CG), the causal pathogen for red leaf blotch disease, is the least researched and causes severe damage to soybean. The identification of resistant soybean genotypes and mapping of genomic regions associated with resistance to CG is critical for developing improved cultivars for sustainable soybean production. This study used single nucleotide polymorphism (SNP) markers generated from a Diversity Arrays Technology (DArT) platform to conduct a genome-wide association (GWAS) analysis of resistance to CG using 279 soybean genotypes grown in three environments. A total of 6395 SNPs was used to perform the GWAS applying a multilocus model Fixed and random model Circulating Probability Unification (FarmCPU) with correction of the population structure and a statistical test p-value threshold of 5%. A total of 19 significant marker-trait associations for resistance to CG were identified on chromosomes 1, 5, 6, 9, 10, 12, 13, 15, 16, 17, 19, and 20. Approximately 113 putative genes associated with significant markers for resistance to red leaf blotch disease were identified across soybean genome. Positional candidate genes associated with significant SNP loci-encoding proteins involved in plant defense responses and that could be associated with soybean defenses against CG infection were identified. The results of this study provide valuable insight for further dissection of the genetic architecture of resistance to CG in soybean. They also highlight SNP variants and genes useful for genomics-informed selection decisions in the breeding process for improving resistance traits in soybean. | 2023 | 37372451 |
| 5829 | 18 | 0.9926 | Diagnosing Antibiotic Resistance Using Nucleic Acid Enzymes and Gold Nanoparticles. The rapid and accurate detection of antimicrobial resistance is critical to limiting the spread of infections and delivering effective treatments. Here, we developed a rapid, sensitive, and simple colorimetric nanodiagnostic platform to identify disease-causing pathogens and their associated antibiotic resistance genes within 2 h. The platform can detect bacteria from different biological samples (i.e., blood, wound swabs) with or without culturing. We validated the multicomponent nucleic acid enzyme-gold nanoparticle (MNAzyme-GNP) platform by screening patients with central line associated bloodstream infections and achieved a clinical sensitivity and specificity of 86% and 100%, respectively. We detected antibiotic resistance in methicillin-resistant Staphylococcus aureus (MRSA) in patient swabs with 90% clinical sensitivity and 95% clinical specificity. Finally, we identified mecA resistance genes in uncultured nasal, groin, axilla, and wound swabs from patients with 90% clinical sensitivity and 95% clinical specificity. The simplicity and versatility for detecting bacteria and antibiotic resistance markers make our platform attractive for the broad screening of microbial pathogens. | 2021 | 33970612 |
| 5194 | 19 | 0.9926 | Evaluation of the CosmosID Bioinformatics Platform for Prosthetic Joint-Associated Sonicate Fluid Shotgun Metagenomic Data Analysis. We previously demonstrated that shotgun metagenomic sequencing can detect bacteria in sonicate fluid, providing a diagnosis of prosthetic joint infection (PJI). A limitation of the approach that we used is that data analysis was time-consuming and specialized bioinformatics expertise was required, both of which are barriers to routine clinical use. Fortunately, automated commercial analytic platforms that can interpret shotgun metagenomic data are emerging. In this study, we evaluated the CosmosID bioinformatics platform using shotgun metagenomic sequencing data derived from 408 sonicate fluid samples from our prior study with the goal of evaluating the platform vis-à-vis bacterial detection and antibiotic resistance gene detection for predicting staphylococcal antibacterial susceptibility. Samples were divided into a derivation set and a validation set, each consisting of 204 samples; results from the derivation set were used to establish cutoffs, which were then tested in the validation set for identifying pathogens and predicting staphylococcal antibacterial resistance. Metagenomic analysis detected bacteria in 94.8% (109/115) of sonicate fluid culture-positive PJIs and 37.8% (37/98) of sonicate fluid culture-negative PJIs. Metagenomic analysis showed sensitivities ranging from 65.7 to 85.0% for predicting staphylococcal antibacterial resistance. In conclusion, the CosmosID platform has the potential to provide fast, reliable bacterial detection and identification from metagenomic shotgun sequencing data derived from sonicate fluid for the diagnosis of PJI. Strategies for metagenomic detection of antibiotic resistance genes for predicting staphylococcal antibacterial resistance need further development. | 2019 | 30429253 |