# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 9071 | 0 | 0.9746 | RAC: Repository of Antibiotic resistance Cassettes. Antibiotic resistance in bacteria is often due to acquisition of resistance genes associated with different mobile genetic elements. In Gram-negative bacteria, many resistance genes are found as part of small mobile genetic elements called gene cassettes, generally found integrated into larger elements called integrons. Integrons carrying antibiotic resistance gene cassettes are often associated with mobile elements and here are designated 'mobile resistance integrons' (MRIs). More than one cassette can be inserted in the same integron to create arrays that contribute to the spread of multi-resistance. In many sequences in databases such as GenBank, only the genes within cassettes, rather than whole cassettes, are annotated and the same gene/cassette may be given different names in different entries, hampering analysis. We have developed the Repository of Antibiotic resistance Cassettes (RAC) website to provide an archive of gene cassettes that includes alternative gene names from multiple nomenclature systems and allows the community to contribute new cassettes. RAC also offers an additional function that allows users to submit sequences containing cassettes or arrays for annotation using the automatic annotation system Attacca. Attacca recognizes features (gene cassettes, integron regions) and identifies cassette arrays as patterns of features and can also distinguish minor cassette variants that may encode different resistance phenotypes (aacA4 cassettes and bla cassettes-encoding β-lactamases). Gaps in annotations are manually reviewed and those found to correspond to novel cassettes are assigned unique names. While there are other websites dedicated to integrons or antibiotic resistance genes, none includes a complete list of antibiotic resistance gene cassettes in MRI or offers consistent annotation and appropriate naming of all of these cassettes in submitted sequences. RAC thus provides a unique resource for researchers, which should reduce confusion and improve the quality of annotations of gene cassettes in integrons associated with antibiotic resistance. DATABASE URL: http://www2.chi.unsw.edu.au/rac. | 2011 | 22140215 |
| 9074 | 1 | 0.9733 | BacAnt: A Combination Annotation Server for Bacterial DNA Sequences to Identify Antibiotic Resistance Genes, Integrons, and Transposable Elements. Whole genome sequencing (WGS) of bacteria has become a routine method in diagnostic laboratories. One of the clinically most useful advantages of WGS is the ability to predict antimicrobial resistance genes (ARGs) and mobile genetic elements (MGEs) in bacterial sequences. This allows comprehensive investigations of such genetic features but can also be used for epidemiological studies. A plethora of software programs have been developed for the detailed annotation of bacterial DNA sequences, such as rapid annotation using subsystem technology (RAST), Resfinder, ISfinder, INTEGRALL and The Transposon Registry. Unfortunately, to this day, a reliable annotation tool of the combination of ARGs and MGEs is not available, and the generation of genbank files requires much manual input. Here, we present a new webserver which allows the annotation of ARGs, integrons and transposable elements at the same time. The pipeline generates genbank files automatically, which are compatible with Easyfig for comparative genomic analysis. Our BacAnt code and standalone software package are available at https://github.com/xthua/bacant with an accompanying web application at http://bacant.net. | 2021 | 34367079 |
| 9077 | 2 | 0.9732 | The PLSDB 2025 update: enhanced annotations and improved functionality for comprehensive plasmid research. Plasmids are extrachromosomal DNA molecules in bacteria and archaea, playing critical roles in horizontal gene transfer, antibiotic resistance, and pathogenicity. Since its first release in 2018, our database on plasmids, PLSDB, has significantly grown and enhanced its content and scope. From 34 513 records contained in the 2021 version, PLSDB now hosts 72 360 entries. Designed to provide life scientists with convenient access to extensive plasmid data and to support computer scientists by offering curated datasets for artificial intelligence (AI) development, this latest update brings more comprehensive and accurate information for plasmid research, with interactive visualization options. We enriched PLSDB by refining the identification and classification of plasmid host ecosystems and host diseases. Additionally, we incorporated annotations for new functional structures, including protein-coding genes and biosynthetic gene clusters. Further, we enhanced existing annotations, such as antimicrobial resistance genes and mobility typing. To accommodate these improvements and to host the increase plasmid sets, the webserver architecture and underlying data structures of PLSDB have been re-reconstructed, resulting in decreased response times and enhanced visualization of features while ensuring that users have access to a more efficient and user-friendly interface. The latest release of PLSDB is freely accessible at https://www.ccb.uni-saarland.de/plsdb2025. | 2025 | 39565221 |
| 9070 | 3 | 0.9732 | Automated annotation of mobile antibiotic resistance in Gram-negative bacteria: the Multiple Antibiotic Resistance Annotator (MARA) and database. BACKGROUND: Multiresistance in Gram-negative bacteria is often due to acquisition of several different antibiotic resistance genes, each associated with a different mobile genetic element, that tend to cluster together in complex conglomerations. Accurate, consistent annotation of resistance genes, the boundaries and fragments of mobile elements, and signatures of insertion, such as DR, facilitates comparative analysis of complex multiresistance regions and plasmids to better understand their evolution and how resistance genes spread. OBJECTIVES: To extend the Repository of Antibiotic resistance Cassettes (RAC) web site, which includes a database of 'features', and the Attacca automatic DNA annotation system, to encompass additional resistance genes and all types of associated mobile elements. METHODS: Antibiotic resistance genes and mobile elements were added to RAC, from existing registries where possible. Attacca grammars were extended to accommodate the expanded database, to allow overlapping features to be annotated and to identify and annotate features such as composite transposons and DR. RESULTS: The Multiple Antibiotic Resistance Annotator (MARA) database includes antibiotic resistance genes and selected mobile elements from Gram-negative bacteria, distinguishing important variants. Sequences can be submitted to the MARA web site for annotation. A list of positions and orientations of annotated features, indicating those that are truncated, DR and potential composite transposons is provided for each sequence, as well as a diagram showing annotated features approximately to scale. CONCLUSIONS: The MARA web site (http://mara.spokade.com) provides a comprehensive database for mobile antibiotic resistance in Gram-negative bacteria and accurately annotates resistance genes and associated mobile elements in submitted sequences to facilitate comparative analysis. | 2018 | 29373760 |
| 9069 | 4 | 0.9726 | Pdif-mediated antibiotic resistance genes transfer in bacteria identified by pdifFinder. Modules consisting of antibiotic resistance genes (ARGs) flanked by inverted repeat Xer-specific recombination sites were thought to be mobile genetic elements that promote horizontal transmission. Less frequently, the presence of mobile modules in plasmids, which facilitate a pdif-mediated ARGs transfer, has been reported. Here, numerous ARGs and toxin-antitoxin genes have been found in pdif site pairs. However, the mechanisms underlying this apparent genetic mobility is currently not understood, and the studies relating to pdif-mediated ARGs transfer onto most bacterial genera are lacking. We developed the web server pdifFinder based on an algorithm called PdifSM that allows the prediction of diverse pdif-ARGs modules in bacterial genomes. Using test set consisting of almost 32 thousand plasmids from 717 species, PdifSM identified 481 plasmids from various bacteria containing pdif sites with ARGs. We found 28-bp-long elements from different genera with clear base preferences. The data we obtained indicate that XerCD-dif site-specific recombination mechanism may have evolutionary adapted to facilitate the pdif-mediated ARGs transfer. Through multiple sequence alignment and evolutionary analyses of duplicated pdif-ARGs modules, we discovered that pdif sites allow an interspecies transfer of ARGs but also across different genera. Mutations in pdif sites generate diverse arrays of modules which mediate multidrug-resistance, as these contain variable numbers of diverse ARGs, insertion sequences and other functional genes. The identification of pdif-ARGs modules and studies focused on the mechanism of ARGs co-transfer will help us to understand and possibly allow controlling the spread of MDR bacteria in clinical settings. The pdifFinder code, standalone software package and description with tutorials are available at https://github.com/mjshao06/pdifFinder. | 2023 | 36470841 |
| 3771 | 5 | 0.9725 | RFPlasmid: predicting plasmid sequences from short-read assembly data using machine learning. Antimicrobial-resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know whether the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole-genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely on single features, like k-mer composition, circularity of the DNA molecule, copy number or sequence identity to plasmid replication genes, all of which have their drawbacks, especially when faced with large single-copy plasmids, which often carry resistance genes. With our newly developed prediction tool RFPlasmid, we use a combination of multiple features, including k-mer composition and databases with plasmid and chromosomal marker proteins, to predict whether the likely source of a contig is plasmid or chromosomal. The tool RFPlasmid supports models for 17 different bacterial taxa, including Campylobacter, Escherichia coli and Salmonella, and has a taxon agnostic model for metagenomic assemblies or unsupported organisms. RFPlasmid is available both as a standalone tool and via a web interface. | 2021 | 34846288 |
| 4453 | 6 | 0.9724 | dfrA trimethoprim resistance genes found in Gram-negative bacteria: compilation and unambiguous numbering. To track the spread of antibiotic resistance genes, accurate identification of individual genes is essential. Acquired trimethoprim resistance genes encoding trimethoprim-insensitive homologues of the sensitive dihydrofolate reductases encoded by the folA genes of bacteria are increasingly found in genome sequences. However, naming and numbering in publicly available records (journal publications or entries in the GenBank non-redundant DNA database) has not always been unambiguous. In addition, the nomenclature has evolved over time. Here, the changes in nomenclature and the most commonly encountered problems and pitfalls affecting dfrA gene identification arising from historically incorrect or inaccurate numbering are explained. The complete set of dfrA genes/DfrA proteins found in Gram-negative bacteria for which readily searchable sequence information is currently available has been compiled using less than 98% identity for both the gene and the derived protein sequence as the criteria for assignment of a new number. In most cases, trimethoprim resistance has been demonstrated. The gene context, predominantly in a gene cassette or near the ori end of CR1 or CR2, is also covered. The RefSeq database that underpins the programs used to automatically identify resistance genes in genome data sets has been curated to assign all sequences listed to the correct number. This led to the assignment of corrected or new gene numbers to several mis-assigned sequences. The unique numbers assigned for the dfrA/DfrA set are now listed in the RefSeq database, which we propose provides a way forward that should end future duplication of numbers and the confusion that causes. | 2021 | 34180526 |
| 9818 | 7 | 0.9721 | ISCR elements: novel gene-capturing systems of the 21st century? "Common regions" (CRs), such as Orf513, are being increasingly linked to mega-antibiotic-resistant regions. While their overall nucleotide sequences show little identity to other mobile elements, amino acid alignments indicate that they possess the key motifs of IS91-like elements, which have been linked to the mobility ent plasmids in pathogenic Escherichia coli. Further inspection reveals that they possess an IS91-like origin of replication and termination sites (terIS), and therefore CRs probably transpose via a rolling-circle replication mechanism. Accordingly, in this review we have renamed CRs as ISCRs to give a more accurate reflection of their functional properties. The genetic context surrounding ISCRs indicates that they can procure 5' sequences via misreading of the cognate terIS, i.e., "unchecked transposition." Clinically, the most worrying aspect of ISCRs is that they are increasingly being linked with more potent examples of resistance, i.e., metallo-beta-lactamases in Pseudomonas aeruginosa and co-trimoxazole resistance in Stenotrophomonas maltophilia. Furthermore, if ISCR elements do move via "unchecked RC transposition," as has been speculated for ISCR1, then this mechanism provides antibiotic resistance genes with a highly mobile genetic vehicle that could greatly exceed the effects of previously reported mobile genetic mechanisms. It has been hypothesized that bacteria will surprise us by extending their "genetic construction kit" to procure and evince additional DNA and, therefore, antibiotic resistance genes. It appears that ISCR elements have now firmly established themselves within that regimen. | 2006 | 16760305 |
| 9067 | 8 | 0.9721 | PIPdb: a comprehensive plasmid sequence resource for tracking the horizontal transfer of pathogenic factors and antimicrobial resistance genes. Plasmids, as independent genetic elements, carrying resistance or virulence genes and transfer them among different pathogens, posing a significant threat to human health. Under the 'One Health' approach, it is crucial to control the spread of plasmids carrying such genes. To achieve this, a comprehensive characterization of plasmids in pathogens is essential. Here we present the Plasmids in Pathogens Database (PIPdb), a pioneering resource that includes 792 964 plasmid segment clusters (PSCs) derived from 1 009 571 assembled genomes across 450 pathogenic species from 110 genera. To our knowledge, PIPdb is the first database specifically dedicated to plasmids in pathogenic bacteria, offering detailed multi-dimensional metadata such as collection date, geographical origin, ecosystem, host taxonomy, and habitat. PIPdb also provides extensive functional annotations, including plasmid type, insertion sequences, integron, oriT, relaxase, T4CP, virulence factors genes, heavy metal resistance genes and antibiotic resistance genes. The database features a user-friendly interface that facilitates studies on plasmids across diverse host taxa, habitats, and ecosystems, with a focus on those carrying antimicrobial resistance genes (ARGs). We have integrated online tools for plasmid identification and annotation from assembled genomes. Additionally, PIPdb includes a risk-scoring system for identifying potentially high-risk plasmids. The PIPdb web interface is accessible at https://nmdc.cn/pipdb. | 2025 | 39460620 |
| 9076 | 9 | 0.9720 | ResiDB: An automated database manager for sequence data. The amount of publicly available DNA sequence data is drastically increasing, making it a tedious task to create sequence databases necessary for the design of diagnostic assays. The selection of appropriate sequences is especially challenging in genes affected by frequent point mutations such as antibiotic resistance genes. To overcome this issue, we have designed the webtool resiDB, a rapid and user-friendly sequence database manager for bacteria, fungi, viruses, protozoa, invertebrates, plants, archaea, environmental and whole genome shotgun sequence data. It automatically identifies and curates sequence clusters to create custom sequence databases based on user-defined input sequences. A collection of helpful visualization tools gives the user the opportunity to easily access, evaluate, edit, and download the newly created database. Consequently, researchers do no longer have to manually manage sequence data retrieval, deal with hardware limitations, and run multiple independent software tools, each having its own requirements, input and output formats. Our tool was developed within the H2020 project FAPIC aiming to develop a single diagnostic assay targeting all sepsis-relevant pathogens and antibiotic resistance mechanisms. ResiDB is freely accessible to all users through https://residb.ait.ac.at/. | 2021 | 33495705 |
| 3764 | 10 | 0.9720 | Evidence for diversifying selection in a set of Mycobacterium tuberculosis genes in response to antibiotic- and nonantibiotic-related pressure. Tuberculosis (TB) is a global health problem estimated to kill 1.4 million people per year. Recent advances in the genomics of the causative agents of TB, bacteria known as the Mycobacterium tuberculosis complex (MTBC), have allowed a better comprehension of its population structure and provided the foundation for molecular evolution analyses. These studies are crucial for a better understanding of TB, including the variation of vaccine efficacy and disease outcome, together with the emergence of drug resistance. Starting from the analysis of 73 publicly available genomes from all the main MTBC lineages, we have screened for evidences of positive selection, a set of 576 genes previously associated with drug resistance or encoding membrane proteins. As expected, because antibiotics constitute strong selective pressure, some of the codons identified correspond to the position of confirmed drug-resistance-associated substitutions in the genes embB, rpoB, and katG. Furthermore, we identified diversifying selection in specific codons of the genes Rv0176 and Rv1872c coding for MCE1-associated transmembrane protein and a putative l-lactate dehydrogenase, respectively. Amino acid sequence analyses showed that in Rv0176, sites undergoing diversifying selection were in a predicted antigen region that varies between "modern" lineages and "ancient" MTBC/BCG strains. In Rv1872c, some of the sites under selection are predicted to impact protein function and thus might result from metabolic adaptation. These results illustrate that diversifying selection in MTBC is happening as a consequence of both antibiotic treatment and other evolutionary pressures. | 2013 | 23449927 |
| 5126 | 11 | 0.9719 | Blanket antimicrobial resistance gene database with structural information, BOARDS, provides insights on historical landscape of resistance prevalence and effects of mutations in enzyme structure. Antimicrobial resistance (AMR) in pathogenic bacteria poses a significant threat to public health, yet there is still a need for development in the tools to deeply understand AMR genes based on genetic or structural information. In this study, we present an interactive web database named Blanket Overarching Antimicrobial-Resistance gene Database with Structural information (BOARDS, sbml.unist.ac.kr), a database that comprehensively includes 3,943 reported AMR gene information for 1,997 extended spectrum beta-lactamase (ESBL) and 1,946 other genes as well as a total of 27,395 predicted protein structures. These structures, which include both wild-type AMR genes and their mutants, were derived from 80,094 publicly available whole-genome sequences. In addition, we developed the rapid analysis and detection tool of antimicrobial-resistance (RADAR), a one-stop analysis pipeline to detect AMR genes across whole-genome sequencing (WGSs). By integrating BOARDS and RADAR, the AMR prevalence landscape for eight multi-drug resistant pathogens was reconstructed, leading to unexpected findings such as the pre-existence of the MCR genes before their official reports. Enzymatic structure prediction-based analysis revealed that the occurrence of mutations found in some ESBL genes was found to be closely related to the binding affinities with their antibiotic substrates. Overall, BOARDS can play a significant role in performing in-depth analysis on AMR.IMPORTANCEWhile the increasing antibiotic resistance (AMR) in pathogen has been a burden on public health, effective tools for deep understanding of AMR based on genetic or structural information remain limited. In this study, a blanket overarching antimicrobial-resistance gene database with structure information (BOARDS)-a web-based database that comprehensively collected AMR gene data with predictive protein structural information was constructed. Additionally, we report the development of a RADAR pipeline that can analyze whole-genome sequences as well. BOARDS, which includes sequence and structural information, has shown the historical landscape and prevalence of the AMR genes and can provide insight into single-nucleotide polymorphism effects on antibiotic degrading enzymes within protein structures. | 2024 | 38085058 |
| 9817 | 12 | 0.9718 | Common regions e.g. orf513 and antibiotic resistance: IS91-like elements evolving complex class 1 integrons. The ability of bacteria to procure, sometimes rearrange, and evince acquired DNA continues to impress us-even more so if this genetic plasticity involves the sequestering of antibiotic resistance genes. The acquisition of genes in bacteria is often facilitated by transposons, integrons and archetype insertion elements. Recently however, a new element, 'orf513', has been increasingly associated with class 1 integrons. Moreover, these 'complex' class 1 integrons can potentially mediate resistance to chloramphenicol, trimethoprim, aminoglycosides and tetracycline and may carry a range of beta-lactamase genes as well as the qnrA gene. Elements such as 'orf513' demonstrate IS91-like characteristics and will mobilize adjacent DNA via a process called rolling circle replication, and thus we have renamed them 'insertion sequence CRs' (ISCRs) to appropriately reflect their structure-function properties. In this article, we provide a brief description of these new and clinically important mobile elements, and how they are able to mobilize antibiotic resistance genes. | 2006 | 16751201 |
| 8398 | 13 | 0.9718 | ARTS 2.0: feature updates and expansion of the Antibiotic Resistant Target Seeker for comparative genome mining. Multi-drug resistant pathogens have become a major threat to human health and new antibiotics are urgently needed. Most antibiotics are derived from secondary metabolites produced by bacteria. In order to avoid suicide, these bacteria usually encode resistance genes, in some cases within the biosynthetic gene cluster (BGC) of the respective antibiotic compound. Modern genome mining tools enable researchers to computationally detect and predict BGCs that encode the biosynthesis of secondary metabolites. The major challenge now is the prioritization of the most promising BGCs encoding antibiotics with novel modes of action. A recently developed target-directed genome mining approach allows researchers to predict the mode of action of the encoded compound of an uncharacterized BGC based on the presence of resistant target genes. In 2017, we introduced the 'Antibiotic Resistant Target Seeker' (ARTS). ARTS allows for specific and efficient genome mining for antibiotics with interesting and novel targets by rapidly linking housekeeping and known resistance genes to BGC proximity, duplication and horizontal gene transfer (HGT) events. Here, we present ARTS 2.0 available at http://arts.ziemertlab.com. ARTS 2.0 now includes options for automated target directed genome mining in all bacterial taxa as well as metagenomic data. Furthermore, it enables comparison of similar BGCs from different genomes and their putative resistance genes. | 2020 | 32427317 |
| 9068 | 14 | 0.9717 | TnCentral: a Prokaryotic Transposable Element Database and Web Portal for Transposon Analysis. We describe here the structure and organization of TnCentral (https://tncentral.proteininformationresource.org/ [or the mirror link at https://tncentral.ncc.unesp.br/]), a web resource for prokaryotic transposable elements (TE). TnCentral currently contains ∼400 carefully annotated TE, including transposons from the Tn3, Tn7, Tn402, and Tn554 families; compound transposons; integrons; and associated insertion sequences (IS). These TE carry passenger genes, including genes conferring resistance to over 25 classes of antibiotics and nine types of heavy metal, as well as genes responsible for pathogenesis in plants, toxin/antitoxin gene pairs, transcription factors, and genes involved in metabolism. Each TE has its own entry page, providing details about its transposition genes, passenger genes, and other sequence features required for transposition, as well as a graphical map of all features. TnCentral content can be browsed and queried through text- and sequence-based searches with a graphic output. We describe three use cases, which illustrate how the search interface, results tables, and entry pages can be used to explore and compare TE. TnCentral also includes downloadable software to facilitate user-driven identification, with manual annotation, of certain types of TE in genomic sequences. Through the TnCentral homepage, users can also access TnPedia, which provides comprehensive reviews of the major TE families, including an extensive general section and specialized sections with descriptions of insertion sequence and transposon families. TnCentral and TnPedia are intuitive resources that can be used by clinicians and scientists to assess TE diversity in clinical, veterinary, and environmental samples. IMPORTANCE The ability of bacteria to undergo rapid evolution and adapt to changing environmental circumstances drives the public health crisis of multiple antibiotic resistance, as well as outbreaks of disease in economically important agricultural crops and animal husbandry. Prokaryotic transposable elements (TE) play a critical role in this. Many carry "passenger genes" (not required for the transposition process) conferring resistance to antibiotics or heavy metals or causing disease in plants and animals. Passenger genes are spread by normal TE transposition activities and by insertion into plasmids, which then spread via conjugation within and across bacterial populations. Thus, an understanding of TE composition and transposition mechanisms is key to developing strategies to combat bacterial pathogenesis. Toward this end, we have developed TnCentral, a bioinformatics resource dedicated to describing and exploring the structural and functional features of prokaryotic TE whose use is intuitive and accessible to users with or without bioinformatics expertise. | 2021 | 34517763 |
| 4452 | 15 | 0.9716 | Whole-Genome Analysis of Acinetobacter baumannii Strain AB43 Containing a Type I-Fb CRISPR-Cas System: Insights into the Relationship with Drug Resistance. The CRISPR-Cas system is a bacterial and archaea adaptive immune system and is a newly recognized mechanism for controlling antibiotic resistance gene transfer. Acinetobacter baumannii (A. baumannii) is an important organism responsible for a variety of nosocomial infections. A. baumannii infections have become problematic worldwide because of the resistance of A. baumannii to multiple antibiotics. Thus, it is clinically significant to explore the relationship between the CRISPR-Cas system and drug resistance in A. baumannii. This study aimed to analyze the genomic characteristics of the A. baumannii strain AB3 containing the type I-Fb CRISPR-Cas system, which was isolated from a tertiary care hospital in China, and to investigate the relationship between the CRISPR-Cas system and antibiotic resistance in this strain. The whole-genome sequencing (WGS) of the AB43 strain was performed using Illumina and PacBio sequencing. The complete genome of AB43 consisted of a 3,854,806 bp chromosome and a 104,309 bp plasmid. The specific characteristics of the CRISPR-Cas system in AB43 are described as follows: (1) The strain AB43 carries a complete type I-Fb CRISPR-Cas system; (2) Homology analysis confirmed that the cas genes in AB43 share high sequence similarity with the same subtype cas genes; (3) A total of 28 of 105 A. baumannii AB43 CRISPR spacers matched genes in the bacteriophage genome database and the plasmid database, implying that the CRISPR-Cas system in AB43 provides immunity against invasive bacteriophage and plasmids; (4) None of the CRISPR spacers in A. baumannii AB43 were matched with antimicrobial resistance genes in the NCBI database. In addition, we analyzed the presence of antibiotic resistance genes and insertion sequences in the AB43 strain and found that the number of antibiotic resistance genes was not lower than in the "no CRISPR-Cas system" strain. This study supports the idea that the CRISPR-Cas system may inhibit drug-resistance gene expression via endogenous gene regulation, except to the published mechanism that the CRISPR-Cas system efficiently limits the acquisition of antibiotic resistance genes that make bacteria sensitive to antibiotics. | 2022 | 36080431 |
| 3020 | 16 | 0.9715 | Combining sequencing approaches to fully resolve a carbapenemase-encoding megaplasmid in a Pseudomonas shirazica clinical strain. Horizontal transfer of plasmids plays a pivotal role in dissemination of antibiotic resistance genes and emergence of multidrug-resistant bacteria. Plasmid sequencing is thus paramount for accurate epidemiological tracking in hospitals and routine surveillance. Combining Nanopore and Illumina sequencing allowed full assembly of a carbapenemase-encoding megaplasmid carried by multidrug-resistant clinical isolate FFUP_PS_41. Average nucleotide identity analyses revealed that FFUP_PS_41 belongs to the recently proposed new species Pseudomonas shirazica, related to the P. putida phylogenetic group. FFUP_PS_41 harbours a 498,516-bp megaplasmid (pJBCL41) with limited similarity to publicly-available plasmids. pJBCL41 contains genes predicted to encode replication, conjugation, partitioning and maintenance functions and heavy metal resistance. The |aacA7|blaVIM-2|aacA4| cassette array (resistance to carbapenems and aminoglycosides) is located within a class 1 integron that is a defective Tn402 derivative. This transposon lies within a 50,273-bp region bound by Tn3-family 38-bp inverted repeats and flanked by 5-bp direct repeats (DR) that composes additional transposon fragments, five insertion sequences and a Tn3-Derived Inverted-Repeat Miniature Element. The hybrid Nanopore/Illumina approach allowed full resolution of a carbapenemase-encoding megaplasmid from P. shirazica. Identification of novel megaplasmids sheds new light on the evolutionary effects of gene transfer and the selective forces driving antibiotic resistance. | 2019 | 31381486 |
| 5119 | 17 | 0.9714 | ROCker models for reliable detection and typing of short-read sequences carrying mcr, erm, mph, and lnu antibiotic resistance genes. Quantitative monitoring of emerging antimicrobial resistance genes (ARGs) using short-read sequences remains challenging due to the high frequency of amino acid functional domains and motifs shared with related but functionally distinct (non-target) proteins. To facilitate ARG monitoring efforts using unassembled short reads, we present novel ROCker models for mcr, mph, erm, and lnu ARG families, as well as models for variants of special public health concern within these families, including mcr-1, mphA, ermB, lnuF, lnuB, and lnuG genes. For this, we curated target gene sequence sets for model training and built these models using the recently updated ROCker V2 pipeline (Gerhardt et al., in review). To validate our models, we simulated reads from the whole genome of ARG-carrying isolates spanning a range of common read lengths and used them to challenge the filtering efficacy of ROCker versus common static filtering approaches, such as similarity searches using BLASTx with various e-value thresholds or hidden Markov models. ROCker models consistently showed F1 scores up to 10× higher (31% higher on average) and lower false-positive (by 30%, on average) and false-negative (by 16%, on average) rates based on 250 bp reads compared to alternative methods. The ROCker models and all related reference materials and data are freely available through http://enve-omics.ce.gatech.edu/rocker/models, further expanding the available model collection previously developed for other genes. Their application to short-read metagenomes, metatranscriptomes, and PCR amplicon data should facilitate more accurate classification and quantification of unassembled short-read sequences for these ARG families and specific genes.IMPORTANCEAntimicrobial resistance gene families encoding erm and mph genes confer resistance to the macrolide class of antimicrobials, which are used to treat a wide range of infections. Similarly, the mcr gene family confers resistance to polymyxin E (colistin), a drug of last resort for many serious drug-resistant bacterial infections, and the lnu gene family confers resistance to lincomycin, which is reserved for patients allergic to penicillin or where bacteria have developed resistance to other antimicrobials. Assessing the prevalence of these genes in clinical or environmental samples and monitoring their spread to new pathogens are thus important for quantifying the associated public health risk. However, detecting these and other resistance genes in short-read sequence data is technically challenging. Our ROCker bioinformatic pipeline achieves reliable detection and typing of broad-range target gene sequences in complex data sets, thus contributing toward solving an important problem in ongoing surveillance efforts of antimicrobial resistance. | 2025 | 41143534 |
| 9879 | 18 | 0.9714 | IntegronFinder 2.0: Identification and Analysis of Integrons across Bacteria, with a Focus on Antibiotic Resistance in Klebsiella. Integrons are flexible gene-exchanging platforms that contain multiple cassettes encoding accessory genes whose order is shuffled by a specific integrase. Integrons embedded within mobile genetic elements often contain multiple antibiotic resistance genes that they spread among nosocomial pathogens and contribute to the current antibiotic resistance crisis. However, most integrons are presumably sedentary and encode a much broader diversity of functions. IntegronFinder is a widely used software to identify novel integrons in bacterial genomes, but has aged and lacks some useful functionalities to handle very large datasets of draft genomes or metagenomes. Here, we present IntegronFinder version 2. We have updated the code, improved its efficiency and usability, adapted the output to incomplete genome data, and added a few novel functions. We describe these changes and illustrate the relevance of the program by analyzing the distribution of integrons across more than 20,000 fully sequenced genomes. We also take full advantage of its novel capabilities to analyze close to 4000 Klebsiella pneumoniae genomes for the presence of integrons and antibiotic resistance genes within them. Our data show that K. pneumoniae has a large diversity of integrons and the largest mobile integron in our database of plasmids. The pangenome of these integrons contains a total of 165 different gene families with most of the largest families being related with resistance to numerous types of antibiotics. IntegronFinder is a free and open-source software available on multiple public platforms. | 2022 | 35456751 |
| 5125 | 19 | 0.9714 | Do we still need Illumina sequencing data? Evaluating Oxford Nanopore Technologies R10.4.1 flow cells and the Rapid v14 library prep kit for Gram negative bacteria whole genome assemblies. The best whole genome assemblies are currently built from a combination of highly accurate short-read sequencing data and long-read sequencing data that can bridge repetitive and problematic regions. Oxford Nanopore Technologies (ONT) produce long-read sequencing platforms and they are continually improving their technology to obtain higher quality read data that is approaching the quality obtained from short-read platforms such as Illumina. As these innovations continue, we evaluated how much ONT read coverage produced by the Rapid Barcoding Kit v14 (SQK-RBK114) is necessary to generate high-quality hybrid and long-read-only genome assemblies for a panel of carbapenemase-producing Enterobacterales bacterial isolates. We found that 30× long-read coverage is sufficient if Illumina data are available, and that more (at least 100× long-read coverage is recommended for long-read-only assemblies. Illumina polishing is still improving single nucleotide variants (SNVs) and INDELs in long-read-only assemblies. We also examined if antimicrobial resistance genes could be accurately identified in long-read-only data, and found that Flye assemblies regardless of ONT coverage detected >96% of resistance genes at 100% identity and length. Overall, the Rapid Barcoding Kit v14 and long-read-only assemblies can be an optimal sequencing strategy (i.e., plasmid characterization and AMR detection) but finer-scale analyses (i.e., SNV) still benefit from short-read data. | 2024 | 38354391 |