Dark matter in archaeal genomes: a rich source of novel mobile elements, defense systems and secretory complexes View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2014-08-12

AUTHORS

Kira S. Makarova, Yuri I. Wolf, Patrick Forterre, David Prangishvili, Mart Krupovic, Eugene V. Koonin

ABSTRACT

Microbial genomes encompass a sizable fraction of poorly characterized, narrowly spread fast-evolving genes. Using sensitive methods for sequences comparison and protein structure prediction, we performed a detailed comparative analysis of clusters of such genes, which we denote “dark matter islands”, in archaeal genomes. The dark matter islands comprise up to 20 % of archaeal genomes and show remarkable heterogeneity and diversity. Nevertheless, three classes of entities are common in these genomic loci: (a) integrated viral genomes and other mobile elements; (b) defense systems, and (c) secretory and other membrane-associated systems. The dark matter islands in the genome of thermophiles and mesophiles show similar general trends of gene content, but thermophiles are substantially enriched in predicted membrane proteins whereas mesophiles have a greater proportion of recognizable mobile elements. Based on this analysis, we predict the existence of several novel groups of viruses and mobile elements, previously unnoticed variants of CRISPR-Cas immune systems, and new secretory systems that might be involved in stress response, intermicrobial conflicts and biogenesis of novel, uncharacterized membrane structures. More... »

PAGES

877-893

References to SciGraph publications

  • 2012-12-14. Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer in BIOLOGY DIRECT
  • 2011-07-14. Unification of Cas protein families and a simple scenario for the origin and evolution of CRISPR-Cas systems in BIOLOGY DIRECT
  • 2008-12. Virus evolution: how far does the double β-barrel viral lineage extend? in NATURE REVIEWS MICROBIOLOGY
  • 2010-02-02. CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea in NATURE REVIEWS GENETICS
  • 2002-06. Filamentous phage integration requires the host recombinases XerC and XerD in NATURE
  • 2011-05-09. Evolution and classification of the CRISPR–Cas systems in NATURE REVIEWS MICROBIOLOGY
  • 2014-02-23. Molecular biology of fuselloviruses and their satellites in EXTREMOPHILES
  • 2011-09-29. SignalP 4.0: discriminating signal peptides from transmembrane regions in NATURE METHODS
  • 2006-08-16. On the origin of microbial ORFans: quantifying the strength of the evidence for viral lateral transfer in BMC ECOLOGY AND EVOLUTION
  • 2014-02-16. DNA-guided DNA interference by a prokaryotic Argonaute in NATURE
  • 2004-01-18. Structure of a bifunctional DNA primase-polymerase in NATURE STRUCTURAL & MOLECULAR BIOLOGY
  • 2014-01-07. Gammasphaerolipovirus, a newly proposed bacteriophage genus, unifies viruses of halophilic archaea and thermophilic bacteria within the novel family Sphaerolipoviridae in ARCHIVES OF VIROLOGY
  • 2009-06-16. A hidden reservoir of integrative elements is the major source of recently acquired foreign genes and ORFans in archaeal and bacterial genomes in GENOME BIOLOGY
  • 2006-03-16. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action in BIOLOGY DIRECT
  • 2007-11-27. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea in BIOLOGY DIRECT
  • 2005-08-21. Genome sequence of the chlorinated compound–respiring bacterium Dehalococcoides species strain CBDB1 in NATURE BIOTECHNOLOGY
  • 2009-06-03. Comprehensive comparative-genomic analysis of Type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes in BIOLOGY DIRECT
  • 2006-09-19. The ancient Virus World and evolution of cells in BIOLOGY DIRECT
  • 2014-04-02. A guide to genome engineering with programmable nucleases in NATURE REVIEWS GENETICS
  • 1987-09. Gene expression in archaebacteria: Physical mapping of constitutive and UV-inducible transcripts from the Sulfolobus virus-like particle SSV1 in MOLECULAR GENETICS AND GENOMICS
  • 2000-06. Who's your neighbor? New computational approaches for functional genomics in NATURE BIOTECHNOLOGY
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1007/s00792-014-0672-7

    DOI

    http://dx.doi.org/10.1007/s00792-014-0672-7

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1021608471

    PUBMED

    https://www.ncbi.nlm.nih.gov/pubmed/25113822


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Biological Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Genetics", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Adaptation, Physiological", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Amino Acid Sequence", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Archaea", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Archaeal Proteins", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Genome, Archaeal", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Hot Temperature", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Interspersed Repetitive Sequences", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Membrane Proteins", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Molecular Sequence Data", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Makarova", 
            "givenName": "Kira S.", 
            "id": "sg:person.0676725351.01", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0676725351.01"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Wolf", 
            "givenName": "Yuri I.", 
            "id": "sg:person.0634453251.89", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0634453251.89"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Institut Pasteur, Unit\u00e9 Biologie Mol\u00e9culaire Du G\u00e8ne Chez Les Extr\u00eamophiles, 25 Rue Du Docteur Roux, 75015, Paris, France", 
              "id": "http://www.grid.ac/institutes/grid.428999.7", 
              "name": [
                "Institut Pasteur, Unit\u00e9 Biologie Mol\u00e9culaire Du G\u00e8ne Chez Les Extr\u00eamophiles, 25 Rue Du Docteur Roux, 75015, Paris, France"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Forterre", 
            "givenName": "Patrick", 
            "id": "sg:person.01170427166.75", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01170427166.75"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Institut Pasteur, Unit\u00e9 Biologie Mol\u00e9culaire Du G\u00e8ne Chez Les Extr\u00eamophiles, 25 Rue Du Docteur Roux, 75015, Paris, France", 
              "id": "http://www.grid.ac/institutes/grid.428999.7", 
              "name": [
                "Institut Pasteur, Unit\u00e9 Biologie Mol\u00e9culaire Du G\u00e8ne Chez Les Extr\u00eamophiles, 25 Rue Du Docteur Roux, 75015, Paris, France"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Prangishvili", 
            "givenName": "David", 
            "id": "sg:person.01332500224.95", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01332500224.95"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Institut Pasteur, Unit\u00e9 Biologie Mol\u00e9culaire Du G\u00e8ne Chez Les Extr\u00eamophiles, 25 Rue Du Docteur Roux, 75015, Paris, France", 
              "id": "http://www.grid.ac/institutes/grid.428999.7", 
              "name": [
                "Institut Pasteur, Unit\u00e9 Biologie Mol\u00e9culaire Du G\u00e8ne Chez Les Extr\u00eamophiles, 25 Rue Du Docteur Roux, 75015, Paris, France"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Krupovic", 
            "givenName": "Mart", 
            "id": "sg:person.01032701646.39", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01032701646.39"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Koonin", 
            "givenName": "Eugene V.", 
            "id": "sg:person.01017015051.78", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01017015051.78"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1186/1745-6150-4-19", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1025245856", 
              "https://doi.org/10.1186/1745-6150-4-19"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1745-6150-1-7", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1011010591", 
              "https://doi.org/10.1186/1745-6150-1-7"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg2749", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1005460200", 
              "https://doi.org/10.1038/nrg2749"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature12971", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1052410148", 
              "https://doi.org/10.1038/nature12971"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/bf00329653", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1015787158", 
              "https://doi.org/10.1007/bf00329653"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1745-6150-6-38", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1036092892", 
              "https://doi.org/10.1186/1745-6150-6-38"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature00782", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1052419004", 
              "https://doi.org/10.1038/nature00782"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2148-6-63", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1045973194", 
              "https://doi.org/10.1186/1471-2148-6-63"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrmicro2033", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002207899", 
              "https://doi.org/10.1038/nrmicro2033"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/s00705-013-1970-6", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1031112115", 
              "https://doi.org/10.1007/s00705-013-1970-6"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth.1701", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1049894869", 
              "https://doi.org/10.1038/nmeth.1701"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nsmb723", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1006561522", 
              "https://doi.org/10.1038/nsmb723"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nbt1131", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1012802082", 
              "https://doi.org/10.1038/nbt1131"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1745-6150-2-33", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1004145791", 
              "https://doi.org/10.1186/1745-6150-2-33"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1745-6150-7-46", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1010731156", 
              "https://doi.org/10.1186/1745-6150-7-46"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/76443", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1013118779", 
              "https://doi.org/10.1038/76443"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg3686", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1016455001", 
              "https://doi.org/10.1038/nrg3686"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2009-10-6-r65", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1043023899", 
              "https://doi.org/10.1186/gb-2009-10-6-r65"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/s00792-014-0634-0", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1024493732", 
              "https://doi.org/10.1007/s00792-014-0634-0"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrmicro2577", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1020345829", 
              "https://doi.org/10.1038/nrmicro2577"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1745-6150-1-29", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1034391309", 
              "https://doi.org/10.1186/1745-6150-1-29"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2014-08-12", 
        "datePublishedReg": "2014-08-12", 
        "description": "Microbial genomes encompass a sizable fraction of poorly characterized, narrowly spread fast-evolving genes. Using sensitive methods for sequences comparison and protein structure prediction, we performed a detailed comparative analysis of clusters of such genes, which we denote \u201cdark matter islands\u201d, in archaeal genomes. The dark matter islands comprise up to 20\u00a0% of archaeal genomes and show remarkable heterogeneity and diversity. Nevertheless, three classes of entities are common in these genomic loci: (a) integrated viral genomes and other mobile elements; (b) defense systems, and (c) secretory and other membrane-associated systems. The dark matter islands in the genome of thermophiles and mesophiles show similar general trends of gene content, but thermophiles are substantially enriched in predicted membrane proteins whereas mesophiles have a greater proportion of recognizable mobile elements. Based on this analysis, we predict the existence of several novel groups of viruses and mobile elements, previously unnoticed variants of CRISPR-Cas immune systems, and new secretory systems that might be involved in stress response, intermicrobial conflicts and biogenesis of novel, uncharacterized membrane structures.", 
        "genre": "article", 
        "id": "sg:pub.10.1007/s00792-014-0672-7", 
        "inLanguage": "en", 
        "isAccessibleForFree": true, 
        "isFundedItemOf": [
          {
            "id": "sg:grant.2726032", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2726029", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.3800492", 
            "type": "MonetaryGrant"
          }
        ], 
        "isPartOf": [
          {
            "id": "sg:journal.1116821", 
            "issn": [
              "1431-0651", 
              "1433-4909"
            ], 
            "name": "Extremophiles", 
            "publisher": "Springer Nature", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "5", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "18"
          }
        ], 
        "keywords": [
          "archaeal genomes", 
          "mobile elements", 
          "CRISPR-Cas immune systems", 
          "defense system", 
          "microbial genomes", 
          "genomic loci", 
          "gene content", 
          "such genes", 
          "protein structure prediction", 
          "membrane-associated systems", 
          "novel mobile elements", 
          "membrane proteins", 
          "sequence comparison", 
          "secretory complex", 
          "genome", 
          "stress response", 
          "secretory system", 
          "viral genome", 
          "novel group", 
          "structure prediction", 
          "membrane structure", 
          "thermophiles", 
          "genes", 
          "mesophiles", 
          "detailed comparative analysis", 
          "islands", 
          "remarkable heterogeneity", 
          "rich source", 
          "biogenesis", 
          "immune system", 
          "loci", 
          "protein", 
          "diversity", 
          "comparative analysis", 
          "secretory", 
          "sizable fraction", 
          "complexes", 
          "greater proportion", 
          "variants", 
          "virus", 
          "elements", 
          "general trend", 
          "clusters", 
          "analysis", 
          "similar general trends", 
          "response", 
          "heterogeneity", 
          "sensitive method", 
          "structure", 
          "content", 
          "proportion", 
          "system", 
          "fraction", 
          "class", 
          "source", 
          "existence", 
          "comparison", 
          "prediction", 
          "group", 
          "matter", 
          "trends", 
          "classes of entities", 
          "entities", 
          "method", 
          "conflict", 
          "dark matter"
        ], 
        "name": "Dark matter in archaeal genomes: a rich source of novel mobile elements, defense systems and secretory complexes", 
        "pagination": "877-893", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1021608471"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1007/s00792-014-0672-7"
            ]
          }, 
          {
            "name": "pubmed_id", 
            "type": "PropertyValue", 
            "value": [
              "25113822"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1007/s00792-014-0672-7", 
          "https://app.dimensions.ai/details/publication/pub.1021608471"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2022-05-20T07:29", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20220519/entities/gbq_results/article/article_615.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "https://doi.org/10.1007/s00792-014-0672-7"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s00792-014-0672-7'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s00792-014-0672-7'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s00792-014-0672-7'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s00792-014-0672-7'


     

    This table displays all metadata directly associated to this object as RDF triples.

    292 TRIPLES      22 PREDICATES      122 URIs      93 LITERALS      16 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1007/s00792-014-0672-7 schema:about N1bf1cbcc27f041eb9aa132f950f2973f
    2 N4f64f6726d7c409faecf18c2bdc0dd17
    3 N57ad8ad074414e5ba74681be84e9e30b
    4 Naeb230e5abcd42aaaead91c46ea7a2ad
    5 Nb5fd2bb7a2c140968dff57ef1d10b87d
    6 Nc036d966fe6d4b70909206f524bcd5b8
    7 Ne61b383a57f2438d88d5b0c19b44c0ec
    8 Nf3ebb85385e04d169cb4b541cdbcaa00
    9 Nffaa1084401347b090c1edeb6203f4c9
    10 anzsrc-for:06
    11 anzsrc-for:0604
    12 schema:author Na91b3203cb584255b04a6c116dda2b57
    13 schema:citation sg:pub.10.1007/bf00329653
    14 sg:pub.10.1007/s00705-013-1970-6
    15 sg:pub.10.1007/s00792-014-0634-0
    16 sg:pub.10.1038/76443
    17 sg:pub.10.1038/nature00782
    18 sg:pub.10.1038/nature12971
    19 sg:pub.10.1038/nbt1131
    20 sg:pub.10.1038/nmeth.1701
    21 sg:pub.10.1038/nrg2749
    22 sg:pub.10.1038/nrg3686
    23 sg:pub.10.1038/nrmicro2033
    24 sg:pub.10.1038/nrmicro2577
    25 sg:pub.10.1038/nsmb723
    26 sg:pub.10.1186/1471-2148-6-63
    27 sg:pub.10.1186/1745-6150-1-29
    28 sg:pub.10.1186/1745-6150-1-7
    29 sg:pub.10.1186/1745-6150-2-33
    30 sg:pub.10.1186/1745-6150-4-19
    31 sg:pub.10.1186/1745-6150-6-38
    32 sg:pub.10.1186/1745-6150-7-46
    33 sg:pub.10.1186/gb-2009-10-6-r65
    34 schema:datePublished 2014-08-12
    35 schema:datePublishedReg 2014-08-12
    36 schema:description Microbial genomes encompass a sizable fraction of poorly characterized, narrowly spread fast-evolving genes. Using sensitive methods for sequences comparison and protein structure prediction, we performed a detailed comparative analysis of clusters of such genes, which we denote “dark matter islands”, in archaeal genomes. The dark matter islands comprise up to 20 % of archaeal genomes and show remarkable heterogeneity and diversity. Nevertheless, three classes of entities are common in these genomic loci: (a) integrated viral genomes and other mobile elements; (b) defense systems, and (c) secretory and other membrane-associated systems. The dark matter islands in the genome of thermophiles and mesophiles show similar general trends of gene content, but thermophiles are substantially enriched in predicted membrane proteins whereas mesophiles have a greater proportion of recognizable mobile elements. Based on this analysis, we predict the existence of several novel groups of viruses and mobile elements, previously unnoticed variants of CRISPR-Cas immune systems, and new secretory systems that might be involved in stress response, intermicrobial conflicts and biogenesis of novel, uncharacterized membrane structures.
    37 schema:genre article
    38 schema:inLanguage en
    39 schema:isAccessibleForFree true
    40 schema:isPartOf N005805d82ad141298978ffa113516830
    41 N066e79cd8b3645d39500cd0c84426760
    42 sg:journal.1116821
    43 schema:keywords CRISPR-Cas immune systems
    44 analysis
    45 archaeal genomes
    46 biogenesis
    47 class
    48 classes of entities
    49 clusters
    50 comparative analysis
    51 comparison
    52 complexes
    53 conflict
    54 content
    55 dark matter
    56 defense system
    57 detailed comparative analysis
    58 diversity
    59 elements
    60 entities
    61 existence
    62 fraction
    63 gene content
    64 general trend
    65 genes
    66 genome
    67 genomic loci
    68 greater proportion
    69 group
    70 heterogeneity
    71 immune system
    72 islands
    73 loci
    74 matter
    75 membrane proteins
    76 membrane structure
    77 membrane-associated systems
    78 mesophiles
    79 method
    80 microbial genomes
    81 mobile elements
    82 novel group
    83 novel mobile elements
    84 prediction
    85 proportion
    86 protein
    87 protein structure prediction
    88 remarkable heterogeneity
    89 response
    90 rich source
    91 secretory
    92 secretory complex
    93 secretory system
    94 sensitive method
    95 sequence comparison
    96 similar general trends
    97 sizable fraction
    98 source
    99 stress response
    100 structure
    101 structure prediction
    102 such genes
    103 system
    104 thermophiles
    105 trends
    106 variants
    107 viral genome
    108 virus
    109 schema:name Dark matter in archaeal genomes: a rich source of novel mobile elements, defense systems and secretory complexes
    110 schema:pagination 877-893
    111 schema:productId N08506957343f48ac9bf28db9f68be624
    112 N4857d38b96c44916b6de562c488566b0
    113 Nbfe19654787e49bfaad47e907fc7b80e
    114 schema:sameAs https://app.dimensions.ai/details/publication/pub.1021608471
    115 https://doi.org/10.1007/s00792-014-0672-7
    116 schema:sdDatePublished 2022-05-20T07:29
    117 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    118 schema:sdPublisher N098b24e33c094fc797c584cb9c8ac324
    119 schema:url https://doi.org/10.1007/s00792-014-0672-7
    120 sgo:license sg:explorer/license/
    121 sgo:sdDataset articles
    122 rdf:type schema:ScholarlyArticle
    123 N005805d82ad141298978ffa113516830 schema:issueNumber 5
    124 rdf:type schema:PublicationIssue
    125 N066e79cd8b3645d39500cd0c84426760 schema:volumeNumber 18
    126 rdf:type schema:PublicationVolume
    127 N07eb57c5997b4d52b47cbcb033ad8420 rdf:first sg:person.01032701646.39
    128 rdf:rest N4c91030ca92648b5bceab7620d346df4
    129 N08506957343f48ac9bf28db9f68be624 schema:name dimensions_id
    130 schema:value pub.1021608471
    131 rdf:type schema:PropertyValue
    132 N098b24e33c094fc797c584cb9c8ac324 schema:name Springer Nature - SN SciGraph project
    133 rdf:type schema:Organization
    134 N1bf1cbcc27f041eb9aa132f950f2973f schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    135 schema:name Archaeal Proteins
    136 rdf:type schema:DefinedTerm
    137 N1d9599d9afb746f89f5ade9005fd8a84 rdf:first sg:person.01170427166.75
    138 rdf:rest Nff90fe07a30849098d3ac7f687ab0de0
    139 N39f127f9191e417285c32536ff443503 rdf:first sg:person.0634453251.89
    140 rdf:rest N1d9599d9afb746f89f5ade9005fd8a84
    141 N4857d38b96c44916b6de562c488566b0 schema:name pubmed_id
    142 schema:value 25113822
    143 rdf:type schema:PropertyValue
    144 N4c91030ca92648b5bceab7620d346df4 rdf:first sg:person.01017015051.78
    145 rdf:rest rdf:nil
    146 N4f64f6726d7c409faecf18c2bdc0dd17 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    147 schema:name Molecular Sequence Data
    148 rdf:type schema:DefinedTerm
    149 N57ad8ad074414e5ba74681be84e9e30b schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    150 schema:name Archaea
    151 rdf:type schema:DefinedTerm
    152 Na91b3203cb584255b04a6c116dda2b57 rdf:first sg:person.0676725351.01
    153 rdf:rest N39f127f9191e417285c32536ff443503
    154 Naeb230e5abcd42aaaead91c46ea7a2ad schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    155 schema:name Membrane Proteins
    156 rdf:type schema:DefinedTerm
    157 Nb5fd2bb7a2c140968dff57ef1d10b87d schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    158 schema:name Amino Acid Sequence
    159 rdf:type schema:DefinedTerm
    160 Nbfe19654787e49bfaad47e907fc7b80e schema:name doi
    161 schema:value 10.1007/s00792-014-0672-7
    162 rdf:type schema:PropertyValue
    163 Nc036d966fe6d4b70909206f524bcd5b8 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    164 schema:name Genome, Archaeal
    165 rdf:type schema:DefinedTerm
    166 Ne61b383a57f2438d88d5b0c19b44c0ec schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    167 schema:name Hot Temperature
    168 rdf:type schema:DefinedTerm
    169 Nf3ebb85385e04d169cb4b541cdbcaa00 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    170 schema:name Adaptation, Physiological
    171 rdf:type schema:DefinedTerm
    172 Nff90fe07a30849098d3ac7f687ab0de0 rdf:first sg:person.01332500224.95
    173 rdf:rest N07eb57c5997b4d52b47cbcb033ad8420
    174 Nffaa1084401347b090c1edeb6203f4c9 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    175 schema:name Interspersed Repetitive Sequences
    176 rdf:type schema:DefinedTerm
    177 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
    178 schema:name Biological Sciences
    179 rdf:type schema:DefinedTerm
    180 anzsrc-for:0604 schema:inDefinedTermSet anzsrc-for:
    181 schema:name Genetics
    182 rdf:type schema:DefinedTerm
    183 sg:grant.2726029 http://pending.schema.org/fundedItem sg:pub.10.1007/s00792-014-0672-7
    184 rdf:type schema:MonetaryGrant
    185 sg:grant.2726032 http://pending.schema.org/fundedItem sg:pub.10.1007/s00792-014-0672-7
    186 rdf:type schema:MonetaryGrant
    187 sg:grant.3800492 http://pending.schema.org/fundedItem sg:pub.10.1007/s00792-014-0672-7
    188 rdf:type schema:MonetaryGrant
    189 sg:journal.1116821 schema:issn 1431-0651
    190 1433-4909
    191 schema:name Extremophiles
    192 schema:publisher Springer Nature
    193 rdf:type schema:Periodical
    194 sg:person.01017015051.78 schema:affiliation grid-institutes:grid.419234.9
    195 schema:familyName Koonin
    196 schema:givenName Eugene V.
    197 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01017015051.78
    198 rdf:type schema:Person
    199 sg:person.01032701646.39 schema:affiliation grid-institutes:grid.428999.7
    200 schema:familyName Krupovic
    201 schema:givenName Mart
    202 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01032701646.39
    203 rdf:type schema:Person
    204 sg:person.01170427166.75 schema:affiliation grid-institutes:grid.428999.7
    205 schema:familyName Forterre
    206 schema:givenName Patrick
    207 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01170427166.75
    208 rdf:type schema:Person
    209 sg:person.01332500224.95 schema:affiliation grid-institutes:grid.428999.7
    210 schema:familyName Prangishvili
    211 schema:givenName David
    212 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01332500224.95
    213 rdf:type schema:Person
    214 sg:person.0634453251.89 schema:affiliation grid-institutes:grid.419234.9
    215 schema:familyName Wolf
    216 schema:givenName Yuri I.
    217 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0634453251.89
    218 rdf:type schema:Person
    219 sg:person.0676725351.01 schema:affiliation grid-institutes:grid.419234.9
    220 schema:familyName Makarova
    221 schema:givenName Kira S.
    222 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0676725351.01
    223 rdf:type schema:Person
    224 sg:pub.10.1007/bf00329653 schema:sameAs https://app.dimensions.ai/details/publication/pub.1015787158
    225 https://doi.org/10.1007/bf00329653
    226 rdf:type schema:CreativeWork
    227 sg:pub.10.1007/s00705-013-1970-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031112115
    228 https://doi.org/10.1007/s00705-013-1970-6
    229 rdf:type schema:CreativeWork
    230 sg:pub.10.1007/s00792-014-0634-0 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024493732
    231 https://doi.org/10.1007/s00792-014-0634-0
    232 rdf:type schema:CreativeWork
    233 sg:pub.10.1038/76443 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013118779
    234 https://doi.org/10.1038/76443
    235 rdf:type schema:CreativeWork
    236 sg:pub.10.1038/nature00782 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052419004
    237 https://doi.org/10.1038/nature00782
    238 rdf:type schema:CreativeWork
    239 sg:pub.10.1038/nature12971 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052410148
    240 https://doi.org/10.1038/nature12971
    241 rdf:type schema:CreativeWork
    242 sg:pub.10.1038/nbt1131 schema:sameAs https://app.dimensions.ai/details/publication/pub.1012802082
    243 https://doi.org/10.1038/nbt1131
    244 rdf:type schema:CreativeWork
    245 sg:pub.10.1038/nmeth.1701 schema:sameAs https://app.dimensions.ai/details/publication/pub.1049894869
    246 https://doi.org/10.1038/nmeth.1701
    247 rdf:type schema:CreativeWork
    248 sg:pub.10.1038/nrg2749 schema:sameAs https://app.dimensions.ai/details/publication/pub.1005460200
    249 https://doi.org/10.1038/nrg2749
    250 rdf:type schema:CreativeWork
    251 sg:pub.10.1038/nrg3686 schema:sameAs https://app.dimensions.ai/details/publication/pub.1016455001
    252 https://doi.org/10.1038/nrg3686
    253 rdf:type schema:CreativeWork
    254 sg:pub.10.1038/nrmicro2033 schema:sameAs https://app.dimensions.ai/details/publication/pub.1002207899
    255 https://doi.org/10.1038/nrmicro2033
    256 rdf:type schema:CreativeWork
    257 sg:pub.10.1038/nrmicro2577 schema:sameAs https://app.dimensions.ai/details/publication/pub.1020345829
    258 https://doi.org/10.1038/nrmicro2577
    259 rdf:type schema:CreativeWork
    260 sg:pub.10.1038/nsmb723 schema:sameAs https://app.dimensions.ai/details/publication/pub.1006561522
    261 https://doi.org/10.1038/nsmb723
    262 rdf:type schema:CreativeWork
    263 sg:pub.10.1186/1471-2148-6-63 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045973194
    264 https://doi.org/10.1186/1471-2148-6-63
    265 rdf:type schema:CreativeWork
    266 sg:pub.10.1186/1745-6150-1-29 schema:sameAs https://app.dimensions.ai/details/publication/pub.1034391309
    267 https://doi.org/10.1186/1745-6150-1-29
    268 rdf:type schema:CreativeWork
    269 sg:pub.10.1186/1745-6150-1-7 schema:sameAs https://app.dimensions.ai/details/publication/pub.1011010591
    270 https://doi.org/10.1186/1745-6150-1-7
    271 rdf:type schema:CreativeWork
    272 sg:pub.10.1186/1745-6150-2-33 schema:sameAs https://app.dimensions.ai/details/publication/pub.1004145791
    273 https://doi.org/10.1186/1745-6150-2-33
    274 rdf:type schema:CreativeWork
    275 sg:pub.10.1186/1745-6150-4-19 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025245856
    276 https://doi.org/10.1186/1745-6150-4-19
    277 rdf:type schema:CreativeWork
    278 sg:pub.10.1186/1745-6150-6-38 schema:sameAs https://app.dimensions.ai/details/publication/pub.1036092892
    279 https://doi.org/10.1186/1745-6150-6-38
    280 rdf:type schema:CreativeWork
    281 sg:pub.10.1186/1745-6150-7-46 schema:sameAs https://app.dimensions.ai/details/publication/pub.1010731156
    282 https://doi.org/10.1186/1745-6150-7-46
    283 rdf:type schema:CreativeWork
    284 sg:pub.10.1186/gb-2009-10-6-r65 schema:sameAs https://app.dimensions.ai/details/publication/pub.1043023899
    285 https://doi.org/10.1186/gb-2009-10-6-r65
    286 rdf:type schema:CreativeWork
    287 grid-institutes:grid.419234.9 schema:alternateName National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA
    288 schema:name National Center for Biotechnology Information, National Library of Medicine, 20894, Bethesda, MD, USA
    289 rdf:type schema:Organization
    290 grid-institutes:grid.428999.7 schema:alternateName Institut Pasteur, Unité Biologie Moléculaire Du Gène Chez Les Extrêmophiles, 25 Rue Du Docteur Roux, 75015, Paris, France
    291 schema:name Institut Pasteur, Unité Biologie Moléculaire Du Gène Chez Les Extrêmophiles, 25 Rue Du Docteur Roux, 75015, Paris, France
    292 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...