ALIGNMENT - Word Related Documents

#	Rank	Similarity	Title + Abs.	Year	PMID
0	1	2	3	4	5
5201	0	0.9909	Complete genome of Enterobacter sichuanensis strain SGAir0282 isolated from air in Singapore. BACKGROUND: Enterobacter cloacae complex (ECC) bacteria, such as E. cloacae, E. sichuanensis, E. kobei, and E. roggenkampii, have been emerging as nosocomial pathogens. Many strains isolated from medical clinics were found to be resistant to antibiotics, and in the worst cases, acquired multidrug resistance. We present the whole genome sequence of SGAir0282, isolated from the outdoor air in Singapore, and its relevance to other ECC bacteria by in silico genomic analysis. RESULTS: Complete genome assembly of E. sichuanensis strain SGAir0282 was generated using PacBio RSII and Illumina MiSeq platforms, and the datasets were used for de novo assembly using Hierarchical Genome Assembly Process (HGAP) and error corrected with Pilon. The genome assembly consisted of a single contig of 4.71 Mb and with a G+C content of 55.5%. No plasmid was detected in the assembly. The genome contained 4371 coding genes, 83 tRNA and 25 rRNA genes, as predicted by NCBI's Prokaryotic Genome Annotation Pipeline (PGAP). Among the genes, the antibiotic resistance related genes were included: Streptothricin acetdyltransferase (SatA), fosfomycin resistance protein (FosA) and metal-dependent hydrolases of the beta-lactamase superfamily I (BLI). CONCLUSION: Based on whole genome alignment and phylogenetic analysis, the strain SGAir0282 was identified to be Enterobacter sichuanensis. The strain possesses gene clusters for virulence, disease and defence, that can also be found in other multidrug resistant ECC type strains.	2020	32127921
5194	1	0.9906	Evaluation of the CosmosID Bioinformatics Platform for Prosthetic Joint-Associated Sonicate Fluid Shotgun Metagenomic Data Analysis. We previously demonstrated that shotgun metagenomic sequencing can detect bacteria in sonicate fluid, providing a diagnosis of prosthetic joint infection (PJI). A limitation of the approach that we used is that data analysis was time-consuming and specialized bioinformatics expertise was required, both of which are barriers to routine clinical use. Fortunately, automated commercial analytic platforms that can interpret shotgun metagenomic data are emerging. In this study, we evaluated the CosmosID bioinformatics platform using shotgun metagenomic sequencing data derived from 408 sonicate fluid samples from our prior study with the goal of evaluating the platform vis-à-vis bacterial detection and antibiotic resistance gene detection for predicting staphylococcal antibacterial susceptibility. Samples were divided into a derivation set and a validation set, each consisting of 204 samples; results from the derivation set were used to establish cutoffs, which were then tested in the validation set for identifying pathogens and predicting staphylococcal antibacterial resistance. Metagenomic analysis detected bacteria in 94.8% (109/115) of sonicate fluid culture-positive PJIs and 37.8% (37/98) of sonicate fluid culture-negative PJIs. Metagenomic analysis showed sensitivities ranging from 65.7 to 85.0% for predicting staphylococcal antibacterial resistance. In conclusion, the CosmosID platform has the potential to provide fast, reliable bacterial detection and identification from metagenomic shotgun sequencing data derived from sonicate fluid for the diagnosis of PJI. Strategies for metagenomic detection of antibiotic resistance genes for predicting staphylococcal antibacterial resistance need further development.	2019	30429253
9076	2	0.9904	ResiDB: An automated database manager for sequence data. The amount of publicly available DNA sequence data is drastically increasing, making it a tedious task to create sequence databases necessary for the design of diagnostic assays. The selection of appropriate sequences is especially challenging in genes affected by frequent point mutations such as antibiotic resistance genes. To overcome this issue, we have designed the webtool resiDB, a rapid and user-friendly sequence database manager for bacteria, fungi, viruses, protozoa, invertebrates, plants, archaea, environmental and whole genome shotgun sequence data. It automatically identifies and curates sequence clusters to create custom sequence databases based on user-defined input sequences. A collection of helpful visualization tools gives the user the opportunity to easily access, evaluate, edit, and download the newly created database. Consequently, researchers do no longer have to manually manage sequence data retrieval, deal with hardware limitations, and run multiple independent software tools, each having its own requirements, input and output formats. Our tool was developed within the H2020 project FAPIC aiming to develop a single diagnostic assay targeting all sepsis-relevant pathogens and antibiotic resistance mechanisms. ResiDB is freely accessible to all users through https://residb.ait.ac.at/.	2021	33495705
5193	3	0.9903	Antibiotic resistance genes prediction via whole genome sequence analysis of Stenotrophomonas maltophilia. BACKGROUND: Stenotrophomonas maltophilia (S. maltophilia) is the first dominant ubiquitous bacterial species identified from the genus Stenotrophomonas in 1943 from a human source. S. maltophilia clinical strains are resistance to several therapies, this study is designed to investigate the whole genome sequence and antimicrobial resistance genes prediction in Stenotrophomonas maltophilia (S. maltophilia) SARC-5 and SARC-6 strains, isolated from the nasopharyngeal samples of an immunocompromised patient. METHODS: These bacterial strains were obtained from Pakistan Institute of Medical Sciences (PIMS) Hospital, Pakistan. The bacterial genome was sequenced using a whole-genome shotgun via a commercial service that used an NGS (Next Generation Sequencing) technology called as Illumina Hiseq 2000 system for genomic sequencing. Moreover, detailed in-silico analyses were done to predict the presence of antibiotic resistance genes in S. maltophilia. RESULTS: Results showed that S. maltophilia is a rare gram negative, rod-shaped, non sporulating bacteria. The genome assembly results in 24 contigs (>500 bp) having a size of 4668,850 bp with 65.8% GC contents. Phylogenetic analysis showed that SARC-5 and SARC-6 were closely related to S. maltophilia B111, S. maltophilia BAB-5317, S. maltophilia AHL, S. maltophilia BAB-5307, S. maltophilia RD-AZPVI_04, S. maltophilia JFZ2, S. maltophilia RD_MAAMIB_06 and lastly with S. maltophilia sp ROi7. Moreover, the whole genome sequence analysis of both SARC-5 and SARC-6 revealed the presence of four resistance genes adeF, qacG, adeF, and smeR. CONCLUSION: Our study confirmed that S. maltophilia SARC-5 and SARC-6 are one of the leading causes of nosocomial infection which carry multiple antibiotic resistance genes.	2024	38128408
9068	4	0.9903	TnCentral: a Prokaryotic Transposable Element Database and Web Portal for Transposon Analysis. We describe here the structure and organization of TnCentral (https://tncentral.proteininformationresource.org/ [or the mirror link at https://tncentral.ncc.unesp.br/]), a web resource for prokaryotic transposable elements (TE). TnCentral currently contains ∼400 carefully annotated TE, including transposons from the Tn3, Tn7, Tn402, and Tn554 families; compound transposons; integrons; and associated insertion sequences (IS). These TE carry passenger genes, including genes conferring resistance to over 25 classes of antibiotics and nine types of heavy metal, as well as genes responsible for pathogenesis in plants, toxin/antitoxin gene pairs, transcription factors, and genes involved in metabolism. Each TE has its own entry page, providing details about its transposition genes, passenger genes, and other sequence features required for transposition, as well as a graphical map of all features. TnCentral content can be browsed and queried through text- and sequence-based searches with a graphic output. We describe three use cases, which illustrate how the search interface, results tables, and entry pages can be used to explore and compare TE. TnCentral also includes downloadable software to facilitate user-driven identification, with manual annotation, of certain types of TE in genomic sequences. Through the TnCentral homepage, users can also access TnPedia, which provides comprehensive reviews of the major TE families, including an extensive general section and specialized sections with descriptions of insertion sequence and transposon families. TnCentral and TnPedia are intuitive resources that can be used by clinicians and scientists to assess TE diversity in clinical, veterinary, and environmental samples. IMPORTANCE The ability of bacteria to undergo rapid evolution and adapt to changing environmental circumstances drives the public health crisis of multiple antibiotic resistance, as well as outbreaks of disease in economically important agricultural crops and animal husbandry. Prokaryotic transposable elements (TE) play a critical role in this. Many carry "passenger genes" (not required for the transposition process) conferring resistance to antibiotics or heavy metals or causing disease in plants and animals. Passenger genes are spread by normal TE transposition activities and by insertion into plasmids, which then spread via conjugation within and across bacterial populations. Thus, an understanding of TE composition and transposition mechanisms is key to developing strategies to combat bacterial pathogenesis. Toward this end, we have developed TnCentral, a bioinformatics resource dedicated to describing and exploring the structural and functional features of prokaryotic TE whose use is intuitive and accessible to users with or without bioinformatics expertise.	2021	34517763
5464	5	0.9902	Genomic and resistome analysis of Alcaligenes faecalis strain PGB1 by Nanopore MinION and Illumina Technologies. BACKGROUND: Drug-resistant bacteria are important carriers of antibiotic-resistant genes (ARGs). This fact is crucial for the development of precise clinical drug treatment strategies. Long-read sequencing platforms such as the Oxford Nanopore sequencer can improve genome assembly efficiency particularly when they are combined with short-read sequencing data. RESULTS: Alcaligenes faecalis PGB1 was isolated and identified with resistance to penicillin and three other antibiotics. After being sequenced by Nanopore MinION and Illumina sequencer, its entire genome was hybrid-assembled. One chromosome and one plasmid was assembled and annotated with 4,433 genes (including 91 RNA genes). Function annotation and comparison between strains were performed. A phylogenetic analysis revealed that it was closest to A. faecalis ZD02. Resistome related sequences was explored, including ARGs, Insert sequence, phage. Two plasmid aminoglycoside genes were determined to be acquired ARGs. The main ARG category was antibiotic efflux resistance and β-lactamase (EC 3.5.2.6) of PGB1 was assigned to Class A, Subclass A1b, and Cluster LSBL3. CONCLUSIONS: The present study identified the newly isolated bacterium A. faecalis PGB1 and systematically annotated its genome sequence and ARGs.	2022	35443609
5119	6	0.9901	ROCker models for reliable detection and typing of short-read sequences carrying mcr, erm, mph, and lnu antibiotic resistance genes. Quantitative monitoring of emerging antimicrobial resistance genes (ARGs) using short-read sequences remains challenging due to the high frequency of amino acid functional domains and motifs shared with related but functionally distinct (non-target) proteins. To facilitate ARG monitoring efforts using unassembled short reads, we present novel ROCker models for mcr, mph, erm, and lnu ARG families, as well as models for variants of special public health concern within these families, including mcr-1, mphA, ermB, lnuF, lnuB, and lnuG genes. For this, we curated target gene sequence sets for model training and built these models using the recently updated ROCker V2 pipeline (Gerhardt et al., in review). To validate our models, we simulated reads from the whole genome of ARG-carrying isolates spanning a range of common read lengths and used them to challenge the filtering efficacy of ROCker versus common static filtering approaches, such as similarity searches using BLASTx with various e-value thresholds or hidden Markov models. ROCker models consistently showed F1 scores up to 10× higher (31% higher on average) and lower false-positive (by 30%, on average) and false-negative (by 16%, on average) rates based on 250 bp reads compared to alternative methods. The ROCker models and all related reference materials and data are freely available through http://enve-omics.ce.gatech.edu/rocker/models, further expanding the available model collection previously developed for other genes. Their application to short-read metagenomes, metatranscriptomes, and PCR amplicon data should facilitate more accurate classification and quantification of unassembled short-read sequences for these ARG families and specific genes.IMPORTANCEAntimicrobial resistance gene families encoding erm and mph genes confer resistance to the macrolide class of antimicrobials, which are used to treat a wide range of infections. Similarly, the mcr gene family confers resistance to polymyxin E (colistin), a drug of last resort for many serious drug-resistant bacterial infections, and the lnu gene family confers resistance to lincomycin, which is reserved for patients allergic to penicillin or where bacteria have developed resistance to other antimicrobials. Assessing the prevalence of these genes in clinical or environmental samples and monitoring their spread to new pathogens are thus important for quantifying the associated public health risk. However, detecting these and other resistance genes in short-read sequence data is technically challenging. Our ROCker bioinformatic pipeline achieves reliable detection and typing of broad-range target gene sequences in complex data sets, thus contributing toward solving an important problem in ongoing surveillance efforts of antimicrobial resistance.	2025	41143534
5125	7	0.9901	Do we still need Illumina sequencing data? Evaluating Oxford Nanopore Technologies R10.4.1 flow cells and the Rapid v14 library prep kit for Gram negative bacteria whole genome assemblies. The best whole genome assemblies are currently built from a combination of highly accurate short-read sequencing data and long-read sequencing data that can bridge repetitive and problematic regions. Oxford Nanopore Technologies (ONT) produce long-read sequencing platforms and they are continually improving their technology to obtain higher quality read data that is approaching the quality obtained from short-read platforms such as Illumina. As these innovations continue, we evaluated how much ONT read coverage produced by the Rapid Barcoding Kit v14 (SQK-RBK114) is necessary to generate high-quality hybrid and long-read-only genome assemblies for a panel of carbapenemase-producing Enterobacterales bacterial isolates. We found that 30× long-read coverage is sufficient if Illumina data are available, and that more (at least 100× long-read coverage is recommended for long-read-only assemblies. Illumina polishing is still improving single nucleotide variants (SNVs) and INDELs in long-read-only assemblies. We also examined if antimicrobial resistance genes could be accurately identified in long-read-only data, and found that Flye assemblies regardless of ONT coverage detected >96% of resistance genes at 100% identity and length. Overall, the Rapid Barcoding Kit v14 and long-read-only assemblies can be an optimal sequencing strategy (i.e., plasmid characterization and AMR detection) but finer-scale analyses (i.e., SNV) still benefit from short-read data.	2024	38354391
9742	8	0.9901	BOCS: DNA k-mer content and scoring for rapid genetic biomarker identification at low coverage. A single, inexpensive diagnostic test capable of rapidly identifying a wide range of genetic biomarkers would prove invaluable in precision medicine. Previous work has demonstrated the potential for high-throughput, label-free detection of A-G-C-T content in DNA k-mers, providing an alternative to single-letter sequencing while also having inherent lossy data compression and massively parallel data acquisition. Here, we apply a new bioinformatics algorithm - block optical content scoring (BOCS) - capable of using the high-throughput content k-mers for rapid, broad-spectrum identification of genetic biomarkers. BOCS uses content-based sequence alignment for probabilistic mapping of k-mer contents to gene sequences within a biomarker database, resulting in a probability ranking of genes on a content score. Simulations of the BOCS algorithm reveal high accuracy for identification of single antibiotic resistance genes, even in the presence of significant sequencing errors (100% accuracy for no sequencing errors, and >90% accuracy for sequencing errors at 20%), and at well below full coverage of the genes. Simulations for detecting multiple resistance genes within a methicillin-resistant Staphylococcus aureus (MRSA) strain showed 100% accuracy at an average gene coverage of merely 0.515, when the k-mer lengths were variable and with 4% sequencing error within the k-mer blocks. Extension of BOCS to cancer and other genetic diseases met or exceeded the results for resistance genes. Combined with a high-throughput content-based sequencing technique, the BOCS algorithm potentiates a test capable of rapid diagnosis and profiling of genetic biomarkers ranging from antibiotic resistance to cancer and other genetic diseases.	2019	31173943
5122	9	0.9900	Clinical long-read metagenomic sequencing of culture-negative infective endocarditis reveals genomic features and antimicrobial resistance. BACKGROUND: Infective endocarditis (IE) poses significant diagnostic challenges, particularly in blood culture-negative cases where fastidious bacteria evade detection. Metagenomic-based nanopore sequencing enables rapid pathogen detection and provides a new approach for the diagnosis of IE. METHOD: Two cases of blood culture-negative infective endocarditis (IE) were analyzed using nanopore sequencing with an in silico host-depletion approach. Complete genome reconstruction and antimicrobial resistance gene annotation were successfully performed. RESULTS: Within an hour of sequencing, EPI2ME classified nanopore reads, identifying Corynebacterium striatum in IE patient 1 and Granulicatella adiacens in IE patient 2. After 18 h, long-read sequencing successfully reconstructed a single circular genome of C. striatum in IE patient 1, whereas short-read sequencing was used to compare but produced fragmented assemblies. Based on these results, long-read sequencing was exclusively used for IE patient 2, allowing for the complete and accurate assembly of G. adiacens, confirming the presence of these bacteria in the clinical samples. In addition to pathogen identification, antimicrobial resistance (AMR) genes were detected in both genomes. Notably, in C. striatum, regions containing a class 1 integron and multiple novel mobile genetic elements (ISCost1, ISCost2, Tn7838 and Tn7839) were identified, collectively harbouring six AMR genes. This is the first report of such elements in C. striatum, highlighting the potential of nanopore long-read sequencing for comprehensive pathogen characterization in IE cases. CONCLUSIONS: This study highlights the effectiveness of host-depleted, long-read nanopore metagenomics for direct pathogen identification and accurate genome reconstruction, including antimicrobial resistance gene detection. The approach enables same-day diagnostic reporting within a matter of hours. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12879-025-11741-5.	2025	41087996
9071	10	0.9900	RAC: Repository of Antibiotic resistance Cassettes. Antibiotic resistance in bacteria is often due to acquisition of resistance genes associated with different mobile genetic elements. In Gram-negative bacteria, many resistance genes are found as part of small mobile genetic elements called gene cassettes, generally found integrated into larger elements called integrons. Integrons carrying antibiotic resistance gene cassettes are often associated with mobile elements and here are designated 'mobile resistance integrons' (MRIs). More than one cassette can be inserted in the same integron to create arrays that contribute to the spread of multi-resistance. In many sequences in databases such as GenBank, only the genes within cassettes, rather than whole cassettes, are annotated and the same gene/cassette may be given different names in different entries, hampering analysis. We have developed the Repository of Antibiotic resistance Cassettes (RAC) website to provide an archive of gene cassettes that includes alternative gene names from multiple nomenclature systems and allows the community to contribute new cassettes. RAC also offers an additional function that allows users to submit sequences containing cassettes or arrays for annotation using the automatic annotation system Attacca. Attacca recognizes features (gene cassettes, integron regions) and identifies cassette arrays as patterns of features and can also distinguish minor cassette variants that may encode different resistance phenotypes (aacA4 cassettes and bla cassettes-encoding β-lactamases). Gaps in annotations are manually reviewed and those found to correspond to novel cassettes are assigned unique names. While there are other websites dedicated to integrons or antibiotic resistance genes, none includes a complete list of antibiotic resistance gene cassettes in MRI or offers consistent annotation and appropriate naming of all of these cassettes in submitted sequences. RAC thus provides a unique resource for researchers, which should reduce confusion and improve the quality of annotations of gene cassettes in integrons associated with antibiotic resistance. DATABASE URL: http://www2.chi.unsw.edu.au/rac.	2011	22140215
9074	11	0.9900	BacAnt: A Combination Annotation Server for Bacterial DNA Sequences to Identify Antibiotic Resistance Genes, Integrons, and Transposable Elements. Whole genome sequencing (WGS) of bacteria has become a routine method in diagnostic laboratories. One of the clinically most useful advantages of WGS is the ability to predict antimicrobial resistance genes (ARGs) and mobile genetic elements (MGEs) in bacterial sequences. This allows comprehensive investigations of such genetic features but can also be used for epidemiological studies. A plethora of software programs have been developed for the detailed annotation of bacterial DNA sequences, such as rapid annotation using subsystem technology (RAST), Resfinder, ISfinder, INTEGRALL and The Transposon Registry. Unfortunately, to this day, a reliable annotation tool of the combination of ARGs and MGEs is not available, and the generation of genbank files requires much manual input. Here, we present a new webserver which allows the annotation of ARGs, integrons and transposable elements at the same time. The pipeline generates genbank files automatically, which are compatible with Easyfig for comparative genomic analysis. Our BacAnt code and standalone software package are available at https://github.com/xthua/bacant with an accompanying web application at http://bacant.net.	2021	34367079
5124	12	0.9899	Oxford nanopore long-read sequencing enables the generation of complete bacterial and plasmid genomes without short-read sequencing. INTRODUCTION: Genome-based analysis is crucial in monitoring antibiotic-resistant bacteria (ARB)and antibiotic-resistance genes (ARGs). Short-read sequencing is typically used to obtain incomplete draft genomes, while long-read sequencing can obtain genomes of multidrug resistance (MDR) plasmids and track the transmission of plasmid-borne antimicrobial resistance genes in bacteria. However, long-read sequencing suffers from low-accuracy base calling, and short-read sequencing is often required to improve genome accuracy. This increases costs and turnaround time. METHODS: In this study, a novel ONT sequencing method is described, which uses the latest ONT chemistry with improved accuracy to assemble genomes of MDR strains and plasmids from long-read sequencing data only. Three strains of Salmonella carrying MDR plasmids were sequenced using the ONT SQK-LSK114 kit with flow cell R10.4.1, and de novo genome assembly was performed with average read accuracy (Q > 10) of 98.9%. RESULTS AND DISCUSSION: For a 5-Mb-long bacterial genome, finished genome sequences with accuracy of >99.99% could be obtained at 75× sequencing coverage depth using Flye and Medaka software. Thus, this new ONT method greatly improves base-calling accuracy, allowing for the de novo assembly of high-quality finished bacterial or plasmid genomes without the need for short-read sequencing. This saves both money and time and supports the application of ONT data in critical genome-based epidemiological analyses. The novel ONT approach described in this study can take the place of traditional combination genome assembly based on short- and long-read sequencing, enabling pangenomic analyses based on high-quality complete bacterial and plasmid genomes to monitor the spread of antibiotic-resistant bacteria and antibiotic resistance genes.	2023	37256057
9083	13	0.9899	ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences. BACKGROUND: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing. RESULTS: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG. CONCLUSIONS: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract.	2024	38725076
1795	14	0.9898	Accessory genome of the multi-drug resistant ocular isolate of Pseudomonas aeruginosa PA34. Bacteria can acquire an accessory genome through the horizontal transfer of genetic elements from non-parental lineages. This leads to rapid genetic evolution allowing traits such as antibiotic resistance and virulence to spread through bacterial communities. The study of complete genomes of bacterial strains helps to understand the genomic traits associated with virulence and antibiotic resistance. We aimed to investigate the complete accessory genome of an ocular isolate of Pseudomonas aeruginosa strain PA34. We obtained the complete genome of PA34 utilising genome sequence reads from Illumina and Oxford Nanopore Technology followed by PCR to close any identified gaps. In-depth genomic analysis was performed using various bioinformatics tools. The susceptibility to heavy metals and cytotoxicity was determined to confirm expression of certain traits. The complete genome of PA34 includes a chromosome of 6.8 Mbp and two plasmids of 95.4 Kbp (pMKPA34-1) and 26.8 Kbp (pMKPA34-2). PA34 had a large accessory genome of 1,213 genes and had 543 unique genes not present in other strains. These exclusive genes encoded features related to metal and antibiotic resistance, phage integrase and transposons. At least 24 genomic islands (GIs) were predicated in the complete chromosome, of which two were integrated into novel sites. Eleven GIs carried virulence factors or replaced pathogenic genes. A bacteriophage carried the aminoglycoside resistance gene (AAC(3)-IId). The two plasmids carried other six antibiotic resistance genes. The large accessory genome of this ocular isolate plays a large role in shaping its virulence and antibiotic resistance.	2019	30986237
5184	15	0.9898	In silico evaluation of genomic characteristics of Streptococcus infantarius subsp. infantarius for application in fermentations. This study aims to evaluate the in silico genomic characteristics of Streptococcus infantarius subsp. infantarius, isolated from Coalho cheese from Paraíba, Brazil, with a view to application in lactic fermentations. rRNA sequences from the 16S ribosomal region were used as input to GenBank, in the search for patterns that could reveal a non-pathogenic behavior of S. infantarius subsp. infantarius, comparing mobile genetic elements, antibiotic resistance genes, pan-genome analysis and multi-genome alignment among related species. S. infantarius subsp. infantarius CJ18 was the only complete genome reported by BLAST/NCBI with high similarity and after comparative genetics with complete genomes of Streptococcus agalactiae (SAG153, NJ1606) and Streptococcus thermophilus (ST106, CS18, IDCC2201, APC151) revealed that CJ18 showed a low number of transposases and integrases, infection by phage bacteria of the Streptococcus genus, absence of antibiotic resistance genes and presence of bacteriocin, folate and riboflavin producing genes. The genome alignment revealed that the collinear blocks of S. thermophilus ST106 and S. agalactiae SAG153 have inverted blocks when compared to the CJ18 genome due to gene positioning, insertions and deletions. Therefore, the strains of S. infantarius subsp. infantarius isolated from Coalho cheese from Paraíba showed genomic similarity with CJ18 and the mobility of genes analyzed in silico showed absence of pathogenicity throughout the genome of CJ18, indicating the potential of these strains for the dairy industry.	2022	36417612
4458	16	0.9898	Insight into the plasmid metagenome of wastewater treatment plant bacteria showing reduced susceptibility to antimicrobial drugs analysed by the 454-pyrosequencing technology. Wastewater treatment plants (WWTPs) are a reservoir for bacteria harbouring antibiotic resistance plasmids. To get a comprehensive overview on the plasmid metagenome of WWTP bacteria showing reduced susceptibility to certain antimicrobial drugs an ultrafast sequencing approach applying the 454-technology was carried out. One run on the GS 20 System yielded 346,427 reads with an average read length of 104 bases resulting in a total of 36,071,493 bases sequence data. The obtained plasmid metagenome was analysed and functionally annotated by means of the Sequence Analysis and Management System (SAMS) software package. Known plasmid genes could be identified within the WWTP plasmid metagenome data set by BLAST searches using the NCBI Plasmid Database. Most abundant hits represent genes involved in plasmid replication, stability, mobility and transposition. Mapping of plasmid metagenome reads to completely sequenced plasmids revealed that many sequences could be assigned to the cryptic pAsa plasmids previously identified in Aeromonas salmonicida subsp. salmonicida and to the accessory modules of the conjugative IncU resistance plasmid pFBAOT6 of Aeromonas punctata. Matches of sequence reads to antibiotic resistance genes indicate that plasmids from WWTP bacteria encode resistances to all major classes of antimicrobial drugs. Plasmid metagenome sequence reads could be assembled into 605 contigs with a minimum length of 500 bases. Contigs predominantly encode plasmid survival functions and transposition enzymes.	2008	18586057
9070	17	0.9897	Automated annotation of mobile antibiotic resistance in Gram-negative bacteria: the Multiple Antibiotic Resistance Annotator (MARA) and database. BACKGROUND: Multiresistance in Gram-negative bacteria is often due to acquisition of several different antibiotic resistance genes, each associated with a different mobile genetic element, that tend to cluster together in complex conglomerations. Accurate, consistent annotation of resistance genes, the boundaries and fragments of mobile elements, and signatures of insertion, such as DR, facilitates comparative analysis of complex multiresistance regions and plasmids to better understand their evolution and how resistance genes spread. OBJECTIVES: To extend the Repository of Antibiotic resistance Cassettes (RAC) web site, which includes a database of 'features', and the Attacca automatic DNA annotation system, to encompass additional resistance genes and all types of associated mobile elements. METHODS: Antibiotic resistance genes and mobile elements were added to RAC, from existing registries where possible. Attacca grammars were extended to accommodate the expanded database, to allow overlapping features to be annotated and to identify and annotate features such as composite transposons and DR. RESULTS: The Multiple Antibiotic Resistance Annotator (MARA) database includes antibiotic resistance genes and selected mobile elements from Gram-negative bacteria, distinguishing important variants. Sequences can be submitted to the MARA web site for annotation. A list of positions and orientations of annotated features, indicating those that are truncated, DR and potential composite transposons is provided for each sequence, as well as a diagram showing annotated features approximately to scale. CONCLUSIONS: The MARA web site (http://mara.spokade.com) provides a comprehensive database for mobile antibiotic resistance in Gram-negative bacteria and accurately annotates resistance genes and associated mobile elements in submitted sequences to facilitate comparative analysis.	2018	29373760
9067	18	0.9897	PIPdb: a comprehensive plasmid sequence resource for tracking the horizontal transfer of pathogenic factors and antimicrobial resistance genes. Plasmids, as independent genetic elements, carrying resistance or virulence genes and transfer them among different pathogens, posing a significant threat to human health. Under the 'One Health' approach, it is crucial to control the spread of plasmids carrying such genes. To achieve this, a comprehensive characterization of plasmids in pathogens is essential. Here we present the Plasmids in Pathogens Database (PIPdb), a pioneering resource that includes 792 964 plasmid segment clusters (PSCs) derived from 1 009 571 assembled genomes across 450 pathogenic species from 110 genera. To our knowledge, PIPdb is the first database specifically dedicated to plasmids in pathogenic bacteria, offering detailed multi-dimensional metadata such as collection date, geographical origin, ecosystem, host taxonomy, and habitat. PIPdb also provides extensive functional annotations, including plasmid type, insertion sequences, integron, oriT, relaxase, T4CP, virulence factors genes, heavy metal resistance genes and antibiotic resistance genes. The database features a user-friendly interface that facilitates studies on plasmids across diverse host taxa, habitats, and ecosystems, with a focus on those carrying antimicrobial resistance genes (ARGs). We have integrated online tools for plasmid identification and annotation from assembled genomes. Additionally, PIPdb includes a risk-scoring system for identifying potentially high-risk plasmids. The PIPdb web interface is accessible at https://nmdc.cn/pipdb.	2025	39460620
9072	19	0.9897	PanGeT: Pan-genomics tool. A decade after the concept of Pan-genome was first introduced; research in this field has spread its tentacles to areas such as pathogenesis of diseases, bacterial evolutionary studies and drug resistance. Gene content-based differentiation of virulent and a virulent strains of bacteria and identification of pathogen specific genes is imperative to understand their physiology and gain insights into the mechanism of genome evolution. Subsequently, this will aid in identifying diagnostic targets and in developing and selecting vaccines. The root of pan-genomic studies, however, is to identify the core genes, dispensable genes and strain specific genes across the genomes belonging to a clade. To this end, we have developed a tool, "PanGeT - Pan-genomics Tool" to compute the 'pan-genome' based on comparisons at the genome as well as the proteome levels. This automated tool is implemented using LaTeX libraries for effective visualization of overall pan-genome through graphical plots. Links to retrieve sequence information and functional annotations have also been provided. PanGeT can be downloaded from http://pranag.physics.iisc.ernet.in/PanGeT/ or https://github.com/PanGeTv1/PanGeT.	2017	27851981