# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 3771 | 0 | 0.8963 | RFPlasmid: predicting plasmid sequences from short-read assembly data using machine learning. Antimicrobial-resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know whether the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole-genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely on single features, like k-mer composition, circularity of the DNA molecule, copy number or sequence identity to plasmid replication genes, all of which have their drawbacks, especially when faced with large single-copy plasmids, which often carry resistance genes. With our newly developed prediction tool RFPlasmid, we use a combination of multiple features, including k-mer composition and databases with plasmid and chromosomal marker proteins, to predict whether the likely source of a contig is plasmid or chromosomal. The tool RFPlasmid supports models for 17 different bacterial taxa, including Campylobacter, Escherichia coli and Salmonella, and has a taxon agnostic model for metagenomic assemblies or unsupported organisms. RFPlasmid is available both as a standalone tool and via a web interface. | 2021 | 34846288 |
| 9074 | 1 | 0.8926 | BacAnt: A Combination Annotation Server for Bacterial DNA Sequences to Identify Antibiotic Resistance Genes, Integrons, and Transposable Elements. Whole genome sequencing (WGS) of bacteria has become a routine method in diagnostic laboratories. One of the clinically most useful advantages of WGS is the ability to predict antimicrobial resistance genes (ARGs) and mobile genetic elements (MGEs) in bacterial sequences. This allows comprehensive investigations of such genetic features but can also be used for epidemiological studies. A plethora of software programs have been developed for the detailed annotation of bacterial DNA sequences, such as rapid annotation using subsystem technology (RAST), Resfinder, ISfinder, INTEGRALL and The Transposon Registry. Unfortunately, to this day, a reliable annotation tool of the combination of ARGs and MGEs is not available, and the generation of genbank files requires much manual input. Here, we present a new webserver which allows the annotation of ARGs, integrons and transposable elements at the same time. The pipeline generates genbank files automatically, which are compatible with Easyfig for comparative genomic analysis. Our BacAnt code and standalone software package are available at https://github.com/xthua/bacant with an accompanying web application at http://bacant.net. | 2021 | 34367079 |
| 9076 | 2 | 0.8925 | ResiDB: An automated database manager for sequence data. The amount of publicly available DNA sequence data is drastically increasing, making it a tedious task to create sequence databases necessary for the design of diagnostic assays. The selection of appropriate sequences is especially challenging in genes affected by frequent point mutations such as antibiotic resistance genes. To overcome this issue, we have designed the webtool resiDB, a rapid and user-friendly sequence database manager for bacteria, fungi, viruses, protozoa, invertebrates, plants, archaea, environmental and whole genome shotgun sequence data. It automatically identifies and curates sequence clusters to create custom sequence databases based on user-defined input sequences. A collection of helpful visualization tools gives the user the opportunity to easily access, evaluate, edit, and download the newly created database. Consequently, researchers do no longer have to manually manage sequence data retrieval, deal with hardware limitations, and run multiple independent software tools, each having its own requirements, input and output formats. Our tool was developed within the H2020 project FAPIC aiming to develop a single diagnostic assay targeting all sepsis-relevant pathogens and antibiotic resistance mechanisms. ResiDB is freely accessible to all users through https://residb.ait.ac.at/. | 2021 | 33495705 |
| 9985 | 3 | 0.8923 | Identification of the First Gene Transfer Agent (GTA) Small Terminase in Rhodobacter capsulatus and Its Role in GTA Production and Packaging of DNA. Genetic exchange mediated by viruses of bacteria (bacteriophages) is the primary driver of rapid bacterial evolution. The priority of viruses is usually to propagate themselves. Most bacteriophages use the small terminase protein to identify their own genome and direct its inclusion into phage capsids. Gene transfer agents (GTAs) are descended from bacteriophages, but they instead package fragments of the entire bacterial genome without preference for their own genes. GTAs do not selectively target specific DNA, and no GTA small terminases are known. Here, we identified the small terminase from the model Rhodobacter capsulatus GTA, which then allowed prediction of analogues in other species. We examined the role of the small terminase in GTA production and propose a structural basis for random DNA packaging.IMPORTANCE Random transfer of any and all genes between bacteria could be influential in the spread of virulence or antimicrobial resistance genes. Discovery of the true prevalence of GTAs in sequenced genomes is hampered by their apparent similarity to bacteriophages. Our data allowed the prediction of small terminases in diverse GTA producer species, and defining the characteristics of a "GTA-type" terminase could be an important step toward novel GTA identification. Importantly, the GTA small terminase shares many features with its phage counterpart. We propose that the GTA terminase complex could become a streamlined model system to answer fundamental questions about double-stranded DNA (dsDNA) packaging by viruses that have not been forthcoming to date. | 2019 | 31534034 |
| 9075 | 4 | 0.8909 | CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter. BACKGROUND: The rapid expansion of Whole-Genome Sequencing has revolutionized the fields of clinical and food microbiology. However, its implementation as a routine laboratory technique remains challenging due to the growth of data at a faster rate than can be effectively analyzed and critical gaps in bioinformatics knowledge. RESULTS: To address both issues, CamPype was developed as a new bioinformatics workflow for the genomics analysis of sequencing data of bacteria, especially Campylobacter, which is the main cause of gastroenteritis worldwide making a negative impact on the economy of the public health systems. CamPype allows fully customization of stages to run and tools to use, including read quality control filtering, read contamination, reads extension and assembly, bacterial typing, genome annotation, searching for antibiotic resistance genes, virulence genes and plasmids, pangenome construction and identification of nucleotide variants. All results are processed and resumed in an interactive HTML report for best data visualization and interpretation. CONCLUSIONS: The minimal user intervention of CamPype makes of this workflow an attractive resource for microbiology laboratories with no expertise in bioinformatics as a first line method for bacterial typing and epidemiological analyses, that would help to reduce the costs of disease outbreaks, or for comparative genomic analyses. CamPype is publicly available at https://github.com/JoseBarbero/CamPype . | 2023 | 37474912 |
| 9078 | 5 | 0.8899 | MetaCherchant: analyzing genomic context of antibiotic resistance genes in gut microbiota. MOTIVATION: Antibiotic resistance is an important global public health problem. Human gut microbiota is an accumulator of resistance genes potentially providing them to pathogens. It is important to develop tools for identifying the mechanisms of how resistance is transmitted between gut microbial species and pathogens. RESULTS: We developed MetaCherchant-an algorithm for extracting the genomic environment of antibiotic resistance genes from metagenomic data in the form of a graph. The algorithm was validated on a number of simulated and published datasets, as well as applied to new 'shotgun' metagenomes of gut microbiota from patients with Helicobacter pylori who underwent antibiotic therapy. Genomic context was reconstructed for several major resistance genes. Taxonomic annotation of the context suggests that within a single metagenome, the resistance genes can be contained in genomes of multiple species. MetaCherchant allows reconstruction of mobile elements with resistance genes within the genomes of bacteria using metagenomic data. Application of MetaCherchant in differential mode produced specific graph structures suggesting the evidence of possible resistance gene transmission within a mobile element that occurred as a result of the antibiotic therapy. MetaCherchant is a promising tool giving researchers an opportunity to get an insight into dynamics of resistance transmission in vivo basing on metagenomic data. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available for download at https://github.com/ctlab/metacherchant. The code is written in Java and is platform-independent. COTANCT: ulyantsev@rain.ifmo.ru. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. | 2018 | 29092015 |
| 5125 | 6 | 0.8895 | Do we still need Illumina sequencing data? Evaluating Oxford Nanopore Technologies R10.4.1 flow cells and the Rapid v14 library prep kit for Gram negative bacteria whole genome assemblies. The best whole genome assemblies are currently built from a combination of highly accurate short-read sequencing data and long-read sequencing data that can bridge repetitive and problematic regions. Oxford Nanopore Technologies (ONT) produce long-read sequencing platforms and they are continually improving their technology to obtain higher quality read data that is approaching the quality obtained from short-read platforms such as Illumina. As these innovations continue, we evaluated how much ONT read coverage produced by the Rapid Barcoding Kit v14 (SQK-RBK114) is necessary to generate high-quality hybrid and long-read-only genome assemblies for a panel of carbapenemase-producing Enterobacterales bacterial isolates. We found that 30× long-read coverage is sufficient if Illumina data are available, and that more (at least 100× long-read coverage is recommended for long-read-only assemblies. Illumina polishing is still improving single nucleotide variants (SNVs) and INDELs in long-read-only assemblies. We also examined if antimicrobial resistance genes could be accurately identified in long-read-only data, and found that Flye assemblies regardless of ONT coverage detected >96% of resistance genes at 100% identity and length. Overall, the Rapid Barcoding Kit v14 and long-read-only assemblies can be an optimal sequencing strategy (i.e., plasmid characterization and AMR detection) but finer-scale analyses (i.e., SNV) still benefit from short-read data. | 2024 | 38354391 |
| 9392 | 7 | 0.8892 | CNproScan: Hybrid CNV detection for bacterial genomes. Discovering copy number variation (CNV) in bacteria is not in the spotlight compared to the attention focused on CNV detection in eukaryotes. However, challenges arising from bacterial drug resistance bring further interest to the topic of CNV and its role in drug resistance. General CNV detection methods do not consider bacteria's features and there is space to improve detection accuracy. Here, we present a CNV detection method called CNproScan focused on bacterial genomes. CNproScan implements a hybrid approach and other bacteria-focused features and depends only on NGS data. We benchmarked our method and compared it to the previously published methods and we can resolve to achieve a higher detection rate together with providing other beneficial features, such as CNV classification. Compared with other methods, CNproScan can detect much shorter CNV events. | 2021 | 34224809 |
| 9070 | 8 | 0.8892 | Automated annotation of mobile antibiotic resistance in Gram-negative bacteria: the Multiple Antibiotic Resistance Annotator (MARA) and database. BACKGROUND: Multiresistance in Gram-negative bacteria is often due to acquisition of several different antibiotic resistance genes, each associated with a different mobile genetic element, that tend to cluster together in complex conglomerations. Accurate, consistent annotation of resistance genes, the boundaries and fragments of mobile elements, and signatures of insertion, such as DR, facilitates comparative analysis of complex multiresistance regions and plasmids to better understand their evolution and how resistance genes spread. OBJECTIVES: To extend the Repository of Antibiotic resistance Cassettes (RAC) web site, which includes a database of 'features', and the Attacca automatic DNA annotation system, to encompass additional resistance genes and all types of associated mobile elements. METHODS: Antibiotic resistance genes and mobile elements were added to RAC, from existing registries where possible. Attacca grammars were extended to accommodate the expanded database, to allow overlapping features to be annotated and to identify and annotate features such as composite transposons and DR. RESULTS: The Multiple Antibiotic Resistance Annotator (MARA) database includes antibiotic resistance genes and selected mobile elements from Gram-negative bacteria, distinguishing important variants. Sequences can be submitted to the MARA web site for annotation. A list of positions and orientations of annotated features, indicating those that are truncated, DR and potential composite transposons is provided for each sequence, as well as a diagram showing annotated features approximately to scale. CONCLUSIONS: The MARA web site (http://mara.spokade.com) provides a comprehensive database for mobile antibiotic resistance in Gram-negative bacteria and accurately annotates resistance genes and associated mobile elements in submitted sequences to facilitate comparative analysis. | 2018 | 29373760 |
| 3057 | 9 | 0.8890 | An Enterobacter plasmid as a new genetic background for the transposon Tn1331. BACKGROUND: Genus Enterobacter includes important opportunistic nosocomial pathogens that could infect complex wounds. The presence of antibiotic resistance genes in these microorganisms represents a challenging clinical problem in the treatment of these wounds. In the authors' screening of antibiotic-resistant bacteria from complex wounds, an Enterobacter species was isolated that harbors antibiotic-resistant plasmids conferring resistance to Escherichia coli. The aim of this study was to identify the resistance genes carried by one of these plasmids. METHODS: The plasmids from the Enterobacter isolate were propagated in E. coli and one of the plasmids, designated as pR23, was sequenced by the Sanger method using fluorescent dyeterminator chemistry on a genetic analyzer. The assembled sequence was annotated by search of the GenBank database. RESULTS: Plasmid pR23 is composed of the transposon Tn1331 and a backbone plasmid that is identical to the plasmid pPIGDM1 from Enterobacter agglomerans. The multidrug-resistance transposon Tn1331, which confers resistance to aminoglycoside and beta lactam antibiotics, has been previously isolated only from Klebsiella. The Enterobacter plasmid pPIGDM1, which carries a ColE1-like origin of replication and has no apparent selective marker, appears to provide a backbone for propagation of Tn1331 in Enterobacter. The recognition sequence of Tn1331 transposase for insertion into pPIGDM1 is the pentanucleotide TATTA, which occurs only once throughout the length of this plasmid. CONCLUSION: Transposition of Tn1331 into the Enterobacter plasmid pPIGDM1 enables this transposon to propagate in this Enterobacter. Since Tn1331 was previously isolated only from Klebsiella, this report suggests horizontal transfer of this transposon between the two bacterial genera. | 2011 | 22259249 |
| 9066 | 10 | 0.8890 | VRprofile: gene-cluster-detection-based profiling of virulence and antibiotic resistance traits encoded within genome sequences of pathogenic bacteria. VRprofile is a Web server that facilitates rapid investigation of virulence and antibiotic resistance genes, as well as extends these trait transfer-related genetic contexts, in newly sequenced pathogenic bacterial genomes. The used backend database MobilomeDB was firstly built on sets of known gene cluster loci of bacterial type III/IV/VI/VII secretion systems and mobile genetic elements, including integrative and conjugative elements, prophages, class I integrons, IS elements and pathogenicity/antibiotic resistance islands. VRprofile is thus able to co-localize the homologs of these conserved gene clusters using HMMer or BLASTp searches. With the integration of the homologous gene cluster search module with a sequence composition module, VRprofile has exhibited better performance for island-like region predictions than the other widely used methods. In addition, VRprofile also provides an integrated Web interface for aligning and visualizing identified gene clusters with MobilomeDB-archived gene clusters, or a variety set of bacterial genomes. VRprofile might contribute to meet the increasing demands of re-annotations of bacterial variable regions, and aid in the real-time definitions of disease-relevant gene clusters in pathogenic bacteria of interest. VRprofile is freely available at http://bioinfo-mml.sjtu.edu.cn/VRprofile. | 2018 | 28077405 |
| 5068 | 11 | 0.8890 | Ultrasensitive Label-Free Detection of Unamplified Multidrug-Resistance Bacteria Genes with a Bimodal Waveguide Interferometric Biosensor. Infections by multidrug-resistant bacteria are becoming a major healthcare emergence with millions of reported cases every year and an increasing incidence of deaths. An advanced diagnostic platform able to directly detect and identify antimicrobial resistance in a faster way than conventional techniques could help in the adoption of early and accurate therapeutic interventions, limiting the actual negative impact on patient outcomes. With this objective, we have developed a new biosensor methodology using an ultrasensitive nanophotonic bimodal waveguide interferometer (BiMW), which allows a rapid and direct detection, without amplification, of two prevalent and clinically relevant Gram-negative antimicrobial resistance encoding sequences: the extended-spectrum betalactamase-encoding gene blaCTX-M-15 and the carbapenemase-encoding gene blaNDM-5 We demonstrate the extreme sensitivity and specificity of our biosensor methodology for the detection of both gene sequences. Our results show that the BiMW biosensor can be employed as an ultrasensitive (attomolar level) and specific diagnostic tool for rapidly (less than 30 min) identifying drug resistance. The BiMW nanobiosensor holds great promise as a powerful tool for the control and management of healthcare-associated infections by multidrug-resistant bacteria. | 2020 | 33086716 |
| 3770 | 12 | 0.8888 | Detection of mobile genetic elements associated with antibiotic resistance in Salmonella enterica using a newly developed web tool: MobileElementFinder. OBJECTIVES: Antimicrobial resistance (AMR) in clinically relevant bacteria is a growing threat to public health globally. In these bacteria, antimicrobial resistance genes are often associated with mobile genetic elements (MGEs), which promote their mobility, enabling them to rapidly spread throughout a bacterial community. METHODS: The tool MobileElementFinder was developed to enable rapid detection of MGEs and their genetic context in assembled sequence data. MGEs are detected based on sequence similarity to a database of 4452 known elements augmented with annotation of resistance genes, virulence factors and detection of plasmids. RESULTS: MobileElementFinder was applied to analyse the mobilome of 1725 sequenced Salmonella enterica isolates of animal origin from Denmark, Germany and the USA. We found that the MGEs were seemingly conserved according to multilocus ST and not restricted to either the host or the country of origin. Moreover, we identified putative translocatable units for specific aminoglycoside, sulphonamide and tetracycline genes. Several putative composite transposons were predicted that could mobilize, among others, AMR, metal resistance and phosphodiesterase genes associated with macrophage survivability. This is, to our knowledge, the first time the phosphodiesterase-like pdeL has been found to be potentially mobilized into S. enterica. CONCLUSIONS: MobileElementFinder is a powerful tool to study the epidemiology of MGEs in a large number of genome sequences and to determine the potential for genomic plasticity of bacteria. This web service provides a convenient method of detecting MGEs in assembled sequence data. MobileElementFinder can be accessed at https://cge.cbs.dtu.dk/services/MobileElementFinder/. | 2021 | 33009809 |
| 9081 | 13 | 0.8888 | Identification and reconstruction of novel antibiotic resistance genes from metagenomes. BACKGROUND: Environmental and commensal bacteria maintain a diverse and largely unknown collection of antibiotic resistance genes (ARGs) that, over time, may be mobilized and transferred to pathogens. Metagenomics enables cultivation-independent characterization of bacterial communities but the resulting data is noisy and highly fragmented, severely hampering the identification of previously undescribed ARGs. We have therefore developed fARGene, a method for identification and reconstruction of ARGs directly from shotgun metagenomic data. RESULTS: fARGene uses optimized gene models and can therefore with high accuracy identify previously uncharacterized resistance genes, even if their sequence similarity to known ARGs is low. By performing the analysis directly on the metagenomic fragments, fARGene also circumvents the need for a high-quality assembly. To demonstrate the applicability of fARGene, we reconstructed β-lactamases from five billion metagenomic reads, resulting in 221 ARGs, of which 58 were previously not reported. Based on 38 ARGs reconstructed by fARGene, experimental verification showed that 81% provided a resistance phenotype in Escherichia coli. Compared to other methods for detecting ARGs in metagenomic data, fARGene has superior sensitivity and the ability to reconstruct previously unknown genes directly from the sequence reads. CONCLUSIONS: We conclude that fARGene provides an efficient and reliable way to explore the unknown resistome in bacterial communities. The method is applicable to any type of ARGs and is freely available via GitHub under the MIT license. | 2019 | 30935407 |
| 532 | 14 | 0.8887 | Three new dominant drug resistance cassettes for gene disruption in Saccharomyces cerevisiae. Disruption-deletion cassettes are powerful tools used to study gene function in many organisms, including Saccharomyces cerevisiae. Perhaps the most widely useful of these are the heterologous dominant drug resistance cassettes, which use antibiotic resistance genes from bacteria and fungi as selectable markers. We have created three new dominant drug resistance cassettes by replacing the kanamycin resistance (kan(r)) open reading frame from the kanMX3 and kanMX4 disruption-deletion cassettes (Wach et al., 1994) with open reading frames conferring resistance to the antibiotics hygromycin B (hph), nourseothricin (nat) and bialaphos (pat). The new cassettes, pAG25 (natMX4), pAG29 (patMX4), pAG31 (patMX3), pAG32 (hphMX4), pAG34 (hphMX3) and pAG35 (natMX3), are cloned into pFA6, and so are in all other respects identical to pFA6-kanMX3 and pFA6-kanMX4. Most tools and techniques used with the kanMX plasmids can also be used with the hph, nat and patMX containing plasmids. These new heterologous dominant drug resistance cassettes have unique antibiotic resistance phenotypes and do not affect growth when inserted into the ho locus. These attributes make the cassettes ideally suited for creating S. cerevisiae strains with multiple mutations within a single strain. | 1999 | 10514571 |
| 9083 | 15 | 0.8885 | ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences. BACKGROUND: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing. RESULTS: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG. CONCLUSIONS: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract. | 2024 | 38725076 |
| 9073 | 16 | 0.8885 | EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains. In reverse vaccinology approaches, complete proteomes of bacteria are submitted to multiple computational prediction steps in order to filter proteins that are possible vaccine candidates. Most available tools perform such analysis only in a single strain, or a very limited number of strains. But the vast amount of genomic data had shown that most bacteria contain pangenomes, i.e., their genomic information contains core, conserved genes, and random accessory genes specific to each strain. Therefore, in reverse vaccinology methods it is of the utmost importance to define core proteins and core epitopes. EpitoCore is a decision-tree pipeline developed to fulfill that need. It provides surfaceome prediction of proteins from related strains, defines core proteins within those, calculate their immunogenicity, predicts epitopes for a given set of MHC alleles defined by the user, and then reports if epitopes are located extracellularly and if they are conserved among the core homologs. Pipeline performance is illustrated by mining peptide vaccine candidates in Mycobacterium avium hominissuis strains. From a total proteome of ~4,800 proteins per strain, EpitoCore predicted 103 highly immunogenic core homologs located at cell surface, many of those related to virulence and drug resistance. Conserved epitopes identified among these homologs allows the users to define sets of peptides with potential to immunize the largest coverage of tested HLA alleles using peptide-based vaccines. Therefore, EpitoCore is able to provide automated identification of conserved epitopes in bacterial pangenomic datasets. | 2020 | 32431712 |
| 8427 | 17 | 0.8885 | Basal DNA repair machinery is subject to positive selection in ionizing-radiation-resistant bacteria. BACKGROUND: Ionizing-radiation-resistant bacteria (IRRB) show a surprising capacity for adaptation to ionizing radiation and desiccation. Positive Darwinian selection is expected to play an important role in this trait, but no data are currently available regarding the role of positive adaptive selection in resistance to ionizing-radiation and tolerance of desiccation. We analyzed the four known genome sequences of IRRB (Deinococcus geothermalis, Deinococcus radiodurans, Kineococcus radiotolerans, and Rubrobacter xylanophilus) to determine the role of positive Darwinian selection in the evolution of resistance to ionizing radiation and tolerance of desiccation. RESULTS: We used the programs MultiParanoid and DnaSP to deduce the sets of orthologs that potentially evolved due to positive Darwinian selection in IRRB. We find that positive selection targets 689 ortholog sets of IRRB. Among these, 58 ortholog sets are absent in ionizing-radiation-sensitive bacteria (IRSB: Escherichia coli and Thermus thermophilus). The most striking finding is that all basal DNA repair genes in IRRB, unlike many of their orthologs in IRSB, are subject to positive selection. CONCLUSION: Our results provide the first in silico prediction of positively selected genes with potential roles in the molecular basis of resistance to gamma-radiation and tolerance of desiccation in IRRB. Identification of these genes provides a basis for future experimental work aimed at understanding the metabolic networks in which they participate. | 2008 | 18570673 |
| 5116 | 18 | 0.8882 | Prediction of Antimicrobial Resistance in Gram-Negative Bacteria From Whole-Genome Sequencing Data. BACKGROUND: Early detection of antimicrobial resistance in pathogens and prescription of more effective antibiotics is a fast-emerging need in clinical practice. High-throughput sequencing technology, such as whole genome sequencing (WGS), may have the capacity to rapidly guide the clinical decision-making process. The prediction of antimicrobial resistance in Gram-negative bacteria, often the cause of serious systemic infections, is more challenging as genotype-to-phenotype (drug resistance) relationship is more complex than for most Gram-positive organisms. METHODS AND FINDINGS: We have used NCBI BioSample database to train and cross-validate eight XGBoost-based machine learning models to predict drug resistance to cefepime, cefotaxime, ceftriaxone, ciprofloxacin, gentamicin, levofloxacin, meropenem, and tobramycin tested in Acinetobacter baumannii, Escherichia coli, Enterobacter cloacae, Klebsiella aerogenes, and Klebsiella pneumoniae. The input is the WGS data in terms of the coverage of known antibiotic resistance genes by shotgun sequencing reads. Models demonstrate high performance and robustness to class imbalanced datasets. CONCLUSION: Whole genome sequencing enables the prediction of antimicrobial resistance in Gram-negative bacteria. We present a tool that provides an in silico antibiogram for eight drugs. Predictions are accompanied with a reliability index that may further facilitate the decision making process. The demo version of the tool with pre-processed samples is available at https://vancampn.shinyapps.io/wgs2amr/. The stand-alone version of the predictor is available at https://github.com/pieterjanvc/wgs2amr/. | 2020 | 32528441 |
| 9080 | 19 | 0.8880 | Comparison of de-novo assembly tools for plasmid metagenome analysis. BACKGROUND: With the advent of next-generation sequencing techniques, culture-independent metagenome approaches have now made it possible to predict possible presence of genes in the environmental bacteria most of which may be non-cultivable. Short reads obtained from the deep sequencing can be assembled into long contigs some of which include plasmids. Plasmids are the circular double stranded DNA in bacteria and known as one of the major carriers of antibiotic resistance genes. OBJECTIVE: Metagenomic analyses, especially focused on plasmids, could help us predict dissemination mechanisms of antibiotic resistance genes in the environment. However, with the availability of a myriad of metagenomic assemblers, the selection of the most appropriate metagenome assembler for the plasmid metagenome study might be challenging. Therefore, in this study, we compared five open source assemblers to suggest most effective way of plasmid metagenome analysis. METHODS: IDBA-UD, MEGAHIT, SPAdes, SOAPdenovo2, and Velvet are compared for conducting plasmid metagenome analyses using two water samples. RESULTS: Our results clearly showed that abundance and types of antibiotic resistance genes on plasmids varied depending on the selection of assembly tools. IDBA-UD and MEGAHIT demonstrated the overall best assembly statistics with high N50 values with higher portion of longer contigs. CONCLUSION: These two assemblers also detected more diverse plasmids. Among the two, MEGAHIT showed more memory efficient assembly, therefore we suggest that the use of MEGAHIT for plasmid metagenome analysis may offer more diverse plasmids with less computer resource required. Here, we also summarized a fundamental plasmid metagenome work flow, especially for antibiotic resistance gene investigation. | 2019 | 31187446 |