# | Rank | Similarity | Title + Abs. | Year | PMID |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 |
| 8711 | 0 | 0.9981 | Novel soil bacteria possess diverse genes for secondary metabolite biosynthesis. In soil ecosystems, microorganisms produce diverse secondary metabolites such as antibiotics, antifungals and siderophores that mediate communication, competition and interactions with other organisms and the environment(1,2). Most known antibiotics are derived from a few culturable microbial taxa (3) , and the biosynthetic potential of the vast majority of bacteria in soil has rarely been investigated (4) . Here we reconstruct hundreds of near-complete genomes from grassland soil metagenomes and identify microorganisms from previously understudied phyla that encode diverse polyketide and nonribosomal peptide biosynthetic gene clusters that are divergent from well-studied clusters. These biosynthetic loci are encoded by newly identified members of the Acidobacteria, Verrucomicobia and Gemmatimonadetes, and the candidate phylum Rokubacteria. Bacteria from these groups are highly abundant in soils(5-7), but have not previously been genomically linked to secondary metabolite production with confidence. In particular, large numbers of biosynthetic genes were characterized in newly identified members of the Acidobacteria, which is the most abundant bacterial phylum across soil biomes (5) . We identify two acidobacterial genomes from divergent lineages, each of which encodes an unusually large repertoire of biosynthetic genes with up to fifteen large polyketide and nonribosomal peptide biosynthetic loci per genome. To track gene expression of genes encoding polyketide synthases and nonribosomal peptide synthetases in the soil ecosystem that we studied, we sampled 120 time points in a microcosm manipulation experiment and, using metatranscriptomics, found that gene clusters were differentially co-expressed in response to environmental perturbations. Transcriptional co-expression networks for specific organisms associated biosynthetic genes with two-component systems, transcriptional activation, putative antimicrobial resistance and iron regulation, linking metabolite biosynthesis to processes of environmental sensing and ecological competition. We conclude that the biosynthetic potential of abundant and phylogenetically diverse soil microorganisms has previously been underestimated. These organisms may represent a source of natural products that can address needs for new antibiotics and other pharmaceutical compounds. | 2018 | 29899444 |
| 9079 | 1 | 0.9980 | Review, Evaluation, and Directions for Gene-Targeted Assembly for Ecological Analyses of Metagenomes. Shotgun metagenomics has greatly advanced our understanding of microbial communities over the last decade. Metagenomic analyses often include assembly and genome binning, computationally daunting tasks especially for big data from complex environments such as soil and sediments. In many studies, however, only a subset of genes and pathways involved in specific functions are of interest; thus, it is not necessary to attempt global assembly. In addition, methods that target genes can be computationally more efficient and produce more accurate assembly by leveraging rich databases, especially for those genes that are of broad interest such as those involved in biogeochemical cycles, biodegradation, and antibiotic resistance or used as phylogenetic markers. Here, we review six gene-targeted assemblers with unique algorithms for extracting and/or assembling targeted genes: Xander, MegaGTA, SAT-Assembler, HMM-GRASPx, GenSeed-HMM, and MEGAN. We tested these tools using two datasets with known genomes, a synthetic community of artificial reads derived from the genomes of 17 bacteria, shotgun sequence data from a mock community with 48 bacteria and 16 archaea genomes, and a large soil shotgun metagenomic dataset. We compared assemblies of a universal single copy gene (rplB) and two N cycle genes (nifH and nirK). We measured their computational efficiency, sensitivity, specificity, and chimera rate and found Xander and MegaGTA, which both use a probabilistic graph structure to model the genes, have the best overall performance with all three datasets, although MEGAN, a reference matching assembler, had better sensitivity with synthetic and mock community members chosen from its reference collection. Also, Xander and MegaGTA are the only tools that include post-assembly scripts tuned for common molecular ecology and diversity analyses. Additionally, we provide a mathematical model for estimating the probability of assembling targeted genes in a metagenome for estimating required sequencing depth. | 2019 | 31749830 |
| 9345 | 2 | 0.9979 | Replacement of the arginine biosynthesis operon in Xanthomonadales by lateral gene transfer. The role of lateral gene transfer (LGT) in prokaryotes has been shown to rapidly change the genome content, providing new gene tools for environmental adaptation. Features related to pathogenesis and resistance to strong selective conditions have been widely shown to be products of gene transfer between bacteria. The genomes of the gamma-proteobacteria from the genus Xanthomonas, composed mainly of phytopathogens, have potential genomic islands that may represent imprints of such evolutionary processes. In this work, the evolution of genes involved in the pathway responsible for arginine biosynthesis in Xanthomonadales was investigated, and several lines of evidence point to the foreign origin of the arg genes clustered within a potential operon. Their presence inside a potential genomic island, bordered by a tRNA gene, the unusual ranking of sequence similarity, and the atypical phylogenies indicate that the metabolic pathway for arginine biosynthesis was acquired through LGT in the Xanthomonadales group. Moreover, although homologues were also found in Bacteroidetes (Flavobacteria group), for many of the genes analyzed close homologues are detected in different life domains (Eukarya and Archaea), indicating that the source of these arg genes may have been outside the Bacteria clade. The possibility of replacement of a complete primary metabolic pathway by LGT events supports the selfish operon hypothesis and may occur only under very special environmental conditions. Such rare events reveal part of the history of these interesting mosaic Xanthomonadales genomes, disclosing the importance of gene transfer modifying primary metabolism pathways and extending the scenario for bacterial genome evolution. | 2008 | 18305979 |
| 8367 | 3 | 0.9979 | A hybrid NRPS-PKS gene cluster related to the bleomycin family of antitumor antibiotics in Alteromonas macleodii strains. Although numerous marine bacteria are known to produce antibiotics via hybrid NRPS-PKS gene clusters, none have been previously described in an Alteromonas species. In this study, we describe in detail a novel hybrid NRPS-PKS cluster identified in the plasmid of the Alteromonasmacleodii strain AltDE1 and analyze its relatedness to other similar gene clusters in a sequence-based characterization. This is a mobile cluster, flanked by transposase-like genes, that has even been found inserted into the chromosome of some Alteromonasmacleodii strains. The cluster contains separate genes for NRPS and PKS activity. The sole PKS gene appears to carry a novel acyltransferase domain, quite divergent from those currently characterized. The predicted specificities of the adenylation domains of the NRPS genes suggest that the final compound has a backbone very similar to bleomycin related compounds. However, the lack of genes involved in sugar biosynthesis indicates that the final product is not a glycopeptide. Even in the absence of these genes, the presence of the cluster appears to confer complete or partial resistance to phleomycin, which may be attributed to a bleomycin-resistance-like protein identified within the cluster. This also suggests that the compound still shares significant structural similarity to bleomycin. Moreover, transcriptomic evidence indicates that the NRPS-PKS cluster is expressed. Such sequence-based approaches will be crucial to fully explore and analyze the diversity and potential of secondary metabolite production, especially from increasingly important sources like marine microbes. | 2013 | 24069455 |
| 9848 | 4 | 0.9978 | Cargo Genes of Tn7-Like Transposons Comprise an Enormous Diversity of Defense Systems, Mobile Genetic Elements, and Antibiotic Resistance Genes. Transposition is a major mechanism of horizontal gene mobility in prokaryotes. However, exploration of the genes mobilized by transposons (cargo) is hampered by the difficulty in delineating integrated transposons from their surrounding genetic context. Here, we present a computational approach that allowed us to identify the boundaries of 6,549 Tn7-like transposons. We found that 96% of these transposons carry at least one cargo gene. Delineation of distinct communities in a gene-sharing network demonstrates how transposons function as a conduit of genes between phylogenetically distant hosts. Comparative analysis of the cargo genes reveals significant enrichment of mobile genetic elements (MGEs) nested within Tn7-like transposons, such as insertion sequences and toxin-antitoxin modules, and of genes involved in recombination, anti-MGE defense, and antibiotic resistance. More unexpectedly, cargo also includes genes encoding central carbon metabolism enzymes. Twenty-two Tn7-like transposons carry both an anti-MGE defense system and antibiotic resistance genes, illustrating how bacteria can overcome these combined pressures upon acquisition of a single transposon. This work substantially expands the distribution of Tn7-like transposons, defines their evolutionary relationships, and provides a large-scale functional classification of prokaryotic genes mobilized by transposition. IMPORTANCE Transposons are major vehicles of horizontal gene transfer that, in addition to genes directly involved in transposition, carry cargo genes. However, characterization of these genes is hampered by the difficulty of identification of transposon boundaries. We developed a computational approach for detecting transposon ends and applied it to perform a comprehensive census of the cargo genes of Tn7-like transposons, a large class of bacterial mobile genetic elements (MGE), many of which employ a unique, CRISPR-mediated mechanism of site-specific transposition. The cargo genes encompass a striking diversity of MGE, defense, and antibiotic resistance systems. Unexpectedly, we also identified cargo genes encoding metabolic enzymes. Thus, Tn7-like transposons mobilize a vast repertoire of genes that can have multiple effects on the host bacteria. | 2021 | 34872347 |
| 8462 | 5 | 0.9978 | Comparative Genomics of Lactiplantibacillus plantarum: Insights Into Probiotic Markers in Strains Isolated From the Human Gastrointestinal Tract and Fermented Foods. Lactiplantibacillus (Lpb.) plantarum is a versatile species commonly found in a wide variety of ecological niches including dairy products and vegetables, while it may also occur as a natural inhabitant of the human gastrointestinal tract. Although Lpb. plantarum strains have been suggested to exert beneficial properties on their host, the precise mechanisms underlying these microbe-host interactions are still obscure. In this context, the genome-scale in silico analysis of putative probiotic bacteria represents a bottom-up approach to identify probiotic biomarkers, predict desirable functional properties, and identify potentially detrimental antibiotic resistance genes. In this study, we characterized the bacterial genomes of three Lpb. plantarum strains isolated from three distinct environments [strain IMC513 (from the human GIT), C904 (from table olives), and LT52 (from raw-milk cheese)]. A whole-genome sequencing was performed combining Illumina short reads with Oxford Nanopore long reads. The phylogenomic analyses suggested the highest relatedness between IMC513 and C904 strains which were both clade 4 strains, with LT52 positioned within clade 5 within the Lpb. plantarum species. The comparative genome analysis performed across several Lpb. plantarum representatives highlighted the genes involved in the key metabolic pathways as well as those encoding potential probiotic features in these new isolates. In particular, our strains varied significantly in genes encoding exopolysaccharide biosynthesis and in contrast to strains IMC513 and C904, the LT52 strain does not encode a Mannose-binding adhesion protein. The LT52 strain is also deficient in genes encoding complete pentose phosphate and the Embden-Meyerhof pathways. Finally, analyses using the CARD and ResFinder databases revealed that none of the strains encode known antibiotic resistance loci. Ultimately, the results provide better insights into the probiotic potential and safety of these three strains and indicate avenues for further mechanistic studies using these isolates. | 2022 | 35663852 |
| 9343 | 6 | 0.9978 | Origin of the bacterial SET domain genes: vertical or horizontal? The presence of Supressor of variegation-Enhanser of zeste-Trithorax (SET) domain genes in bacteria is a current paradigm for lateral genetic exchange between eukaryotes and prokaryotes. Because a major function of SET domain proteins is the chemical modification of chromatin and bacteria do not have chromatin, there is no apparent functional requirement for the existence of bacterial SET domain genes. Consequently, their finding in only a small fraction of pathogenic and symbiotic bacteria was taken as evidence that bacteria have obtained the SET domain genes from their hosts. Furthermore, it was proposed that the products of the genes would, most likely, be involved in bacteria-host interactions. The broadened scope of sequenced bacterial genomes to include also free-living and environmental species provided a larger sample to analyze the bacterial SET domain genes. By phylogenetic analysis, examination of individual chromosomal regions for signs of insertion, and evaluating the chromosomal versus SET domain genes' GC contents, we provide evidence that SET domain genes have existed in the bacterial domain of life independently of eukaryotes. The bacterial genes have undergone an evolution of their own unconnected to the evolution of the eukaryotic SET domain genes. Initial finding of SET domain genes in predominantly pathogenic and symbiotic bacteria resulted, most probably, from a biased sample. However, a lateral transfer of SET domain genes may have occurred between some bacteria and a family of Archaea. A model for the evolution and distribution of SET domain genes in bacteria is proposed. | 2007 | 17148507 |
| 4347 | 7 | 0.9978 | Going through phages: a computational approach to revealing the role of prophage in Staphylococcus aureus. Prophages have important roles in virulence, antibiotic resistance, and genome evolution in Staphylococcus aureus . Rapid growth in the number of sequenced S. aureus genomes allows for an investigation of prophage sequences at an unprecedented scale. We developed a novel computational pipeline for phage discovery and annotation. We combined PhiSpy, a phage discovery tool, with VGAS and PROKKA, genome annotation tools to detect and analyse prophage sequences in nearly 10 011 S . aureus genomes, discovering thousands of putative prophage sequences with genes encoding virulence factors and antibiotic resistance. To our knowledge, this is the first large-scale application of PhiSpy on a large-scale set of genomes (10 011 S . aureus ). Determining the presence of virulence and resistance encoding genes in prophage has implications for the potential transfer of these genes/functions to other bacteria via transduction and thus can provide insight into the evolution and spread of these genes/functions between bacterial strains. While the phage we have identified may be known, these phages were not necessarily known or characterized in S. aureus and the clustering and comparison we did for phage based on their gene content is novel. Moreover, the reporting of these genes with the S. aureus genomes is novel. | 2023 | 37424556 |
| 9073 | 8 | 0.9978 | EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains. In reverse vaccinology approaches, complete proteomes of bacteria are submitted to multiple computational prediction steps in order to filter proteins that are possible vaccine candidates. Most available tools perform such analysis only in a single strain, or a very limited number of strains. But the vast amount of genomic data had shown that most bacteria contain pangenomes, i.e., their genomic information contains core, conserved genes, and random accessory genes specific to each strain. Therefore, in reverse vaccinology methods it is of the utmost importance to define core proteins and core epitopes. EpitoCore is a decision-tree pipeline developed to fulfill that need. It provides surfaceome prediction of proteins from related strains, defines core proteins within those, calculate their immunogenicity, predicts epitopes for a given set of MHC alleles defined by the user, and then reports if epitopes are located extracellularly and if they are conserved among the core homologs. Pipeline performance is illustrated by mining peptide vaccine candidates in Mycobacterium avium hominissuis strains. From a total proteome of ~4,800 proteins per strain, EpitoCore predicted 103 highly immunogenic core homologs located at cell surface, many of those related to virulence and drug resistance. Conserved epitopes identified among these homologs allows the users to define sets of peptides with potential to immunize the largest coverage of tested HLA alleles using peptide-based vaccines. Therefore, EpitoCore is able to provide automated identification of conserved epitopes in bacterial pangenomic datasets. | 2020 | 32431712 |
| 8378 | 9 | 0.9978 | Genome mining reveals unlocked bioactive potential of marine Gram-negative bacteria. BACKGROUND: Antibiotic resistance in bacteria spreads quickly, overtaking the pace at which new compounds are discovered and this emphasizes the immediate need to discover new compounds for control of infectious diseases. Terrestrial bacteria have for decades been investigated as a source of bioactive compounds leading to successful applications in pharmaceutical and biotech industries. Marine bacteria have so far not been exploited to the same extent; however, they are believed to harbor a multitude of novel bioactive chemistry. To explore this potential, genomes of 21 marine Alpha- and Gammaproteobacteria collected during the Galathea 3 expedition were sequenced and mined for natural product encoding gene clusters. RESULTS: Independently of genome size, bacteria of all tested genera carried a large number of clusters encoding different potential bioactivities, especially within the Vibrionaceae and Pseudoalteromonadaceae families. A very high potential was identified in pigmented pseudoalteromonads with up to 20 clusters in a single strain, mostly NRPSs and NRPS-PKS hybrids. Furthermore, regulatory elements in bioactivity-related pathways including chitin metabolism, quorum sensing and iron scavenging systems were investigated both in silico and in vitro. Genes with siderophore function were identified in 50% of the strains, however, all but one harboured the ferric-uptake-regulator gene. Genes encoding the syntethase of acylated homoserine lactones were found in Roseobacter-clade bacteria, but not in the Vibrionaceae strains and only in one Pseudoalteromonas strains. The understanding and manipulation of these elements can help in the discovery and production of new compounds never identified under regular laboratory cultivation conditions. High chitinolytic potential was demonstrated and verified for Vibrio and Pseudoalteromonas species that commonly live in close association with eukaryotic organisms in the environment. Chitin regulation by the ChiS histidine-kinase seems to be a general trait of the Vibrionaceae family, however it is absent in the Pseudomonadaceae. Hence, the degree to which chitin influences secondary metabolism in marine bacteria is not known. CONCLUSIONS: Utilizing the rapidly developing sequencing technologies and software tools in combination with phenotypic in vitro assays, we demonstrated the high bioactive potential of marine bacteria in an efficient, straightforward manner - an approach that will facilitate natural product discovery in the future. | 2015 | 25879706 |
| 9350 | 10 | 0.9978 | Genome DNA Sequence Variation, Evolution, and Function in Bacteria and Archaea. Comparative genomics has revealed that variations in bacterial and archaeal genome DNA sequences cannot be explained by only neutral mutations. Virus resistance and plasmid distribution systems have resulted in changes in bacterial and archaeal genome sequences during evolution. The restriction-modification system, a virus resistance system, leads to avoidance of palindromic DNA sequences in genomes. Clustered, regularly interspaced, short palindromic repeats (CRISPRs) found in genomes represent yet another virus resistance system. Comparative genomics has shown that bacteria and archaea have failed to gain any DNA with GC content higher than the GC content of their chromosomes. Thus, horizontally transferred DNA regions have lower GC content than the host chromosomal DNA does. Some nucleoid-associated proteins bind DNA regions with low GC content and inhibit the expression of genes contained in those regions. This form of gene repression is another type of virus resistance system. On the other hand, bacteria and archaea have used plasmids to gain additional genes. Virus resistance systems influence plasmid distribution. Interestingly, the restriction-modification system and nucleoid-associated protein genes have been distributed via plasmids. Thus, GC content and genomic signatures do not reflect bacterial and archaeal evolutionary relationships. | 2013 | 22772895 |
| 9850 | 11 | 0.9978 | Annotation and Comparative Genomics of Prokaryotic Transposable Elements. The data generated in nearly 30 years of bacterial genome sequencing has revealed the abundance of transposable elements (TE) and their importance in genome and transcript remodeling through the mediation of DNA insertions and deletions, structural rearrangements, and regulation of gene expression. Furthermore, what we have learned from studying transposition mechanisms and their regulation in bacterial TE is fundamental to our current understanding of TE in other organisms because much of what has been observed in bacteria is conserved in all domains of life. However, unlike eukaryotic TE, prokaryotic TE sequester and transmit important classes of genes that impact host fitness, such as resistance to antibiotics and heavy metals and virulence factors affecting animals and plants, among other acquired traits. This provides dynamism and plasticity to bacteria, which would otherwise be propagated clonally. The insertion sequences (IS), the simplest form of prokaryotic TE, are autonomous and compact mobile genetic elements. These can be organized into compound transposons, in which two similar IS can flank any DNA segment and render it transposable. Other more complex structures, called unit transposons, can be grouped into four major families (Tn3, Tn7, Tn402, Tn554) with specific genetic characteristics. This chapter will revisit the prominent structural features of these elements, focusing on a genomic annotation framework and comparative analysis. Relevant aspects of TE will also be presented, stressing their key position in genome impact and evolution, especially in the emergence of antimicrobial resistance and other adaptive traits. | 2024 | 38819561 |
| 3785 | 12 | 0.9978 | A network approach to decipher the dynamics of Lysobacteraceae plasmid gene sharing. Plasmids provide an efficient vehicle for gene sharing among bacterial populations, playing a key role in bacterial evolution. Network approaches are particularly suitable to represent multipartite relationships and are useful tools to characterize plasmid-mediated gene sharing events. The bacterial family Lysobacteraceae includes plant commensal, plant pathogenic and opportunistic human pathogens for which plasmid-mediated adaptation has been reported. We searched for homologues of plasmid gene sequences from this family in the entire diversity of available bacterial genome sequences and built a network of plasmid gene sharing from the results. While plasmid genes are openly shared between the bacteria of the family Lysobacteraceae, taxonomy strongly defined the boundaries of these exchanges, which only barely reached other families. Most inferred plasmid gene sharing events involved a few genes only, and evidence of full plasmid transfers were restricted to taxonomically closely related taxa. We detected multiple plasmid-chromosome gene transfers, including the known sharing of a heavy metal resistance transposon. In the network, bacterial lifestyles shaped substructures of isolates colonizing specific ecological niches and harbouring specific types of resistance genes. Genes associated with pathogenicity or antibiotic and metal resistance were among those that most importantly structured the network, highlighting the imprints of human-mediated selective pressure on pathogenic populations. A massive sequencing effort on environmental Lysobacteraceae is therefore required to refine our understanding of how this reservoir fuels the emergence and the spread of genes among this family and its potential impact on plant, animal and human health. | 2023 | 35593155 |
| 797 | 13 | 0.9978 | Increasing the PACE of characterising novel transporters by functional genomics. Since the late 1990's the genome sequences for thousands of species of bacteria have been released into public databases. The release of each new genome sequence typically revealed the presence of tens to hundreds of uncharacterised genes encoding putative membrane proteins and more recently, microbial metagenomics has revealed countless more of these uncharacterised genes. Given the importance of small molecule efflux in bacteria, it is likely that a significant proportion of these genes encode for novel efflux proteins, but the elucidation of these functions is challenging. We used transcriptomics to predict that the function of a gene encoding a hypothetical membrane protein is in efflux-mediated antimicrobial resistance. We subsequently confirmed this function and the likely native substrates of the pump by using detailed biochemical and biophysical analyses. Functional studies of homologs of the protein from other bacterial species determined that the protein is a prototype for a family of multidrug efflux pumps - the Proteobacterial Antimicrobial Compound Efflux (PACE) family. The general functional genomics approach used here, and its expansion to functional metagenomics, will very likely reveal the identities of more efflux pumps and other transport proteins of scientific, clinical and commercial interest in the future. | 2021 | 34492595 |
| 9068 | 14 | 0.9978 | TnCentral: a Prokaryotic Transposable Element Database and Web Portal for Transposon Analysis. We describe here the structure and organization of TnCentral (https://tncentral.proteininformationresource.org/ [or the mirror link at https://tncentral.ncc.unesp.br/]), a web resource for prokaryotic transposable elements (TE). TnCentral currently contains ∼400 carefully annotated TE, including transposons from the Tn3, Tn7, Tn402, and Tn554 families; compound transposons; integrons; and associated insertion sequences (IS). These TE carry passenger genes, including genes conferring resistance to over 25 classes of antibiotics and nine types of heavy metal, as well as genes responsible for pathogenesis in plants, toxin/antitoxin gene pairs, transcription factors, and genes involved in metabolism. Each TE has its own entry page, providing details about its transposition genes, passenger genes, and other sequence features required for transposition, as well as a graphical map of all features. TnCentral content can be browsed and queried through text- and sequence-based searches with a graphic output. We describe three use cases, which illustrate how the search interface, results tables, and entry pages can be used to explore and compare TE. TnCentral also includes downloadable software to facilitate user-driven identification, with manual annotation, of certain types of TE in genomic sequences. Through the TnCentral homepage, users can also access TnPedia, which provides comprehensive reviews of the major TE families, including an extensive general section and specialized sections with descriptions of insertion sequence and transposon families. TnCentral and TnPedia are intuitive resources that can be used by clinicians and scientists to assess TE diversity in clinical, veterinary, and environmental samples. IMPORTANCE The ability of bacteria to undergo rapid evolution and adapt to changing environmental circumstances drives the public health crisis of multiple antibiotic resistance, as well as outbreaks of disease in economically important agricultural crops and animal husbandry. Prokaryotic transposable elements (TE) play a critical role in this. Many carry "passenger genes" (not required for the transposition process) conferring resistance to antibiotics or heavy metals or causing disease in plants and animals. Passenger genes are spread by normal TE transposition activities and by insertion into plasmids, which then spread via conjugation within and across bacterial populations. Thus, an understanding of TE composition and transposition mechanisms is key to developing strategies to combat bacterial pathogenesis. Toward this end, we have developed TnCentral, a bioinformatics resource dedicated to describing and exploring the structural and functional features of prokaryotic TE whose use is intuitive and accessible to users with or without bioinformatics expertise. | 2021 | 34517763 |
| 4371 | 15 | 0.9978 | Independent origins and evolution of the secondary replicons of the class Gammaproteobacteria. Multipartite genomes, consisting of more than one replicon, have been found in approximately 10 % of bacteria, many of which belong to the phylum Proteobacteria. Many aspects of their origin and evolution, and the possible advantages related to this type of genome structure, remain to be elucidated. Here, we performed a systematic analysis of the presence and distribution of multipartite genomes in the class Gammaproteobacteria, which includes several genera with diverse lifestyles. Within this class, multipartite genomes are mainly found in the order Alteromonadales (mostly in the genus Pseudoalteromonas) and in the family Vibrionaceae. Our data suggest that the emergence of secondary replicons in Gammaproteobacteria is rare and that they derive from plasmids. Despite their multiple origins, we highlighted the presence of evolutionary trends such as the inverse proportionality of the genome to chromosome size ratio, which appears to be a general feature of bacteria with multipartite genomes irrespective of taxonomic group. We also highlighted some functional trends. The core gene set of the secondary replicons is extremely small, probably limited to essential genes or genes that favour their maintenance in the genome, while the other genes are less conserved. This hypothesis agrees with the idea that the primary advantage of secondary replicons could be to facilitate gene acquisition through horizontal gene transfer, resulting in replicons enriched in genes associated with adaptation to different ecological niches. Indeed, secondary replicons are enriched both in genes that could promote adaptation to harsh environments, such as those involved in antibiotic, biocide and metal resistance, and in functional categories related to the exploitation of environmental resources (e.g. carbohydrates), which can complement chromosomal functions. | 2023 | 37185344 |
| 4375 | 16 | 0.9978 | Evidence of a large novel gene pool associated with prokaryotic genomic islands. Microbial genes that are "novel" (no detectable homologs in other species) have become of increasing interest as environmental sampling suggests that there are many more such novel genes in yet-to-be-cultured microorganisms. By analyzing known microbial genomic islands and prophages, we developed criteria for systematic identification of putative genomic islands (clusters of genes of probable horizontal origin in a prokaryotic genome) in 63 prokaryotic genomes, and then characterized the distribution of novel genes and other features. All but a few of the genomes examined contained significantly higher proportions of novel genes in their predicted genomic islands compared with the rest of their genome (Paired t test = 4.43E-14 to 1.27E-18, depending on method). Moreover, the reverse observation (i.e., higher proportions of novel genes outside of islands) never reached statistical significance in any organism examined. We show that this higher proportion of novel genes in predicted genomic islands is not due to less accurate gene prediction in genomic island regions, but likely reflects a genuine increase in novel genes in these regions for both bacteria and archaea. This represents the first comprehensive analysis of novel genes in prokaryotic genomic islands and provides clues regarding the origin of novel genes. Our collective results imply that there are different gene pools associated with recently horizontally transmitted genomic regions versus regions that are primarily vertically inherited. Moreover, there are more novel genes within the gene pool associated with genomic islands. Since genomic islands are frequently associated with a particular microbial adaptation, such as antibiotic resistance, pathogen virulence, or metal resistance, this suggests that microbes may have access to a larger "arsenal" of novel genes for adaptation than previously thought. | 2005 | 16299586 |
| 8383 | 17 | 0.9977 | Novel insights into carbohydrate utilisation, antimicrobial resistance, and sporulation potential in Roseburia intestinalis isolates across diverse geographical locations. Roseburia intestinalis is one of the most abundant and important butyrate-producing human gut anaerobic bacteria that plays an important role in maintaining health and is a potential next-generation probiotic. We investigated the pangenome of 16 distinct strains, isolated over several decades, identifying local and time-specific adaptations. More than 50% of the genes in each individual strain were assigned to the core genome, and 77% of the cloud genes were unique to individual strains, revealing the high level of genome conservation. Co-carriage of the same enzymes involved in carbohydrate binding and degradation in all strains highlighted major pathways in carbohydrate utilization and reveal the importance of xylan, starch and mannose as key growth substrates. A single strain had adapted to use rhamnose as a sole growth substrate, the first time this has been reported. The ubiquitous presence of motility and sporulation gene clusters demonstrates the importance of these phenotypes for gut survival and acquisition of this bacterium. More than half the strains contained functional, potentially transferable, tetracycline resistance genes. This study advances our understanding of the importance of R. intestinalis within the gut ecosystem by elucidating conserved metabolic characteristics among different strains, isolated from different locations. This information will help to devise dietary strategies to increase the abundance of this species providing health benefits. | 2025 | 40089923 |
| 9234 | 18 | 0.9977 | CRISPR provides acquired resistance against viruses in prokaryotes. Clustered regularly interspaced short palindromic repeats (CRISPR) are a distinctive feature of the genomes of most Bacteria and Archaea and are thought to be involved in resistance to bacteriophages. We found that, after viral challenge, bacteria integrated new spacers derived from phage genomic sequences. Removal or addition of particular spacers modified the phage-resistance phenotype of the cell. Thus, CRISPR, together with associated cas genes, provided resistance against phages, and resistance specificity is determined by spacer-phage sequence similarity. | 2007 | 17379808 |
| 9665 | 19 | 0.9977 | Time-calibrated genomic evolution of a monomorphic bacterium during its establishment as an endemic crop pathogen. Horizontal gene transfer is of major evolutionary importance as it allows for the redistribution of phenotypically important genes among lineages. Such genes with essential functions include those involved in resistance to antimicrobial compounds and virulence factors in pathogenic bacteria. Understanding gene turnover at microevolutionary scales is critical to assess the pace of this evolutionary process. Here, we characterized and quantified gene turnover for the epidemic lineage of a bacterial plant pathogen of major agricultural importance worldwide. Relying on a dense geographic sampling spanning 39 years of evolution, we estimated both the dynamics of single nucleotide polymorphism accumulation and gene content turnover. We identified extensive gene content variation among lineages even at the smallest phylogenetic and geographic scales. Gene turnover rate exceeded nucleotide substitution rate by three orders of magnitude. Accessory genes were found preferentially located on plasmids, but we identified a highly plastic chromosomal region hosting ecologically important genes such as transcription activator-like effectors. Whereas most changes in the gene content are probably transient, the rapid spread of a mobile element conferring resistance to copper compounds widely used for the management of plant bacterial pathogens illustrates how some accessory genes can become ubiquitous within a population over short timeframes. | 2021 | 33305421 |