Systematic artifacts in metagenomes from complex microbial communities View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2009-11

AUTHORS

Vicente Gomez-Alvarez, Tracy K Teal, Thomas M Schmidt

ABSTRACT

Metagenomics is providing an unprecedented view of the taxonomic diversity, metabolic potential and ecological role of microbial communities in biomes as diverse as the mammalian gastrointestinal tract, the marine water column and soils. However, we have found a systematic error in metagenomes generated by 454-based pyrosequencing that leads to an overestimation of gene and taxon abundance; between 11% and 35% of sequences in a typical metagenome are artificial replicates. Here we document the error in several published and original datasets and offer a web-based solution (http://microbiomes.msu.edu/replicates) for identifying and removing these artifacts. More... »

PAGES

ismej200972

Identifiers

URI

http://scigraph.springernature.com/pub.10.1038/ismej.2009.72

DOI

http://dx.doi.org/10.1038/ismej.2009.72

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1023938739

PUBMED

https://www.ncbi.nlm.nih.gov/pubmed/19587772


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0605", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Microbiology", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Biological Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Base Sequence", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Databases, Genetic", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Metagenome", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Metagenomics", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Molecular Sequence Data", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Sequence Alignment", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Sequence Analysis, DNA", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Soil Microbiology", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Michigan State University", 
          "id": "https://www.grid.ac/institutes/grid.17088.36", 
          "name": [
            "Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Gomez-Alvarez", 
        "givenName": "Vicente", 
        "id": "sg:person.01152341741.73", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01152341741.73"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Michigan State University", 
          "id": "https://www.grid.ac/institutes/grid.17088.36", 
          "name": [
            "Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Teal", 
        "givenName": "Tracy K", 
        "id": "sg:person.0663631777.21", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0663631777.21"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Michigan State University", 
          "id": "https://www.grid.ac/institutes/grid.17088.36", 
          "name": [
            "Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA", 
            "Kellogg Biological Station, Michigan State University, East Lansing, MI, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Schmidt", 
        "givenName": "Thomas M", 
        "id": "sg:person.013423766702.17", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013423766702.17"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1371/journal.pone.0002527", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1001679264"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-9-386", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1006083026", 
          "https://doi.org/10.1186/1471-2105-9-386"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1073/pnas.0605127103", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1008547462"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btl158", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1014668137"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1073/pnas.0708897105", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1016844456"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1371/journal.pone.0003375", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1017375866"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature05414", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023893418", 
          "https://doi.org/10.1038/nature05414"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature05414", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023893418", 
          "https://doi.org/10.1038/nature05414"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature05414", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023893418", 
          "https://doi.org/10.1038/nature05414"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1073/pnas.0711303105", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1025703877"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1073/pnas.0704665104", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1045401726"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature06810", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1047805213", 
          "https://doi.org/10.1038/nature06810"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature06513", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1052335368", 
          "https://doi.org/10.1038/nature06513"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://app.dimensions.ai/details/publication/pub.1082873120", 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2009-11", 
    "datePublishedReg": "2009-11-01", 
    "description": "Metagenomics is providing an unprecedented view of the taxonomic diversity, metabolic potential and ecological role of microbial communities in biomes as diverse as the mammalian gastrointestinal tract, the marine water column and soils. However, we have found a systematic error in metagenomes generated by 454-based pyrosequencing that leads to an overestimation of gene and taxon abundance; between 11% and 35% of sequences in a typical metagenome are artificial replicates. Here we document the error in several published and original datasets and offer a web-based solution (http://microbiomes.msu.edu/replicates) for identifying and removing these artifacts.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1038/ismej.2009.72", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": true, 
    "isFundedItemOf": [
      {
        "id": "sg:grant.3082408", 
        "type": "MonetaryGrant"
      }
    ], 
    "isPartOf": [
      {
        "id": "sg:journal.1038436", 
        "issn": [
          "1751-7362", 
          "1751-7370"
        ], 
        "name": "The ISME Journal", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "11", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "3"
      }
    ], 
    "name": "Systematic artifacts in metagenomes from complex microbial communities", 
    "pagination": "ismej200972", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "720be557efd7eec1988cf78605840f675032ca24b22a1318c661637bb527102a"
        ]
      }, 
      {
        "name": "pubmed_id", 
        "type": "PropertyValue", 
        "value": [
          "19587772"
        ]
      }, 
      {
        "name": "nlm_unique_id", 
        "type": "PropertyValue", 
        "value": [
          "101301086"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1038/ismej.2009.72"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1023938739"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1038/ismej.2009.72", 
      "https://app.dimensions.ai/details/publication/pub.1023938739"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-10T12:58", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8659_00000424.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "http://www.nature.com/articles/ismej200972"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/ismej.2009.72'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/ismej.2009.72'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/ismej.2009.72'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/ismej.2009.72'


 

This table displays all metadata directly associated to this object as RDF triples.

157 TRIPLES      21 PREDICATES      49 URIs      29 LITERALS      17 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1038/ismej.2009.72 schema:about N3fc41b9f81cb4abeb2e80e7515614801
2 N405fdd4b1638464b9dd80a09570cbdbe
3 N6732862bda65496094f9c033ae99048b
4 Na3664215cd264278adc13be816d95de8
5 Nbf7614d9e3b34c709092743f08a40eef
6 Nd096a2904f324d56ba03e29078d0e786
7 Ne12260b12fdd457db28addd85818eba8
8 Neacd0dad922844dd916311792833121e
9 anzsrc-for:06
10 anzsrc-for:0605
11 schema:author N0b8c67fc185445a8a5a188b028b14605
12 schema:citation sg:pub.10.1038/nature05414
13 sg:pub.10.1038/nature06513
14 sg:pub.10.1038/nature06810
15 sg:pub.10.1186/1471-2105-9-386
16 https://app.dimensions.ai/details/publication/pub.1082873120
17 https://doi.org/10.1073/pnas.0605127103
18 https://doi.org/10.1073/pnas.0704665104
19 https://doi.org/10.1073/pnas.0708897105
20 https://doi.org/10.1073/pnas.0711303105
21 https://doi.org/10.1093/bioinformatics/btl158
22 https://doi.org/10.1371/journal.pone.0002527
23 https://doi.org/10.1371/journal.pone.0003375
24 schema:datePublished 2009-11
25 schema:datePublishedReg 2009-11-01
26 schema:description Metagenomics is providing an unprecedented view of the taxonomic diversity, metabolic potential and ecological role of microbial communities in biomes as diverse as the mammalian gastrointestinal tract, the marine water column and soils. However, we have found a systematic error in metagenomes generated by 454-based pyrosequencing that leads to an overestimation of gene and taxon abundance; between 11% and 35% of sequences in a typical metagenome are artificial replicates. Here we document the error in several published and original datasets and offer a web-based solution (http://microbiomes.msu.edu/replicates) for identifying and removing these artifacts.
27 schema:genre research_article
28 schema:inLanguage en
29 schema:isAccessibleForFree true
30 schema:isPartOf Nadc1323412854eefaa77d0d676fd2bf8
31 Nceaabd0c8ef5469686704646eb15cbb5
32 sg:journal.1038436
33 schema:name Systematic artifacts in metagenomes from complex microbial communities
34 schema:pagination ismej200972
35 schema:productId N3a7724366cf94d458529d7615927539d
36 N85f1793268c247e39318dda8e88141a4
37 N90845df003b941e4908be8dff1e9015a
38 N9655c8c6178e4ce6a98b61729a4e0fee
39 Ndef0e39e53ae408d9af6ea4a37d8e373
40 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023938739
41 https://doi.org/10.1038/ismej.2009.72
42 schema:sdDatePublished 2019-04-10T12:58
43 schema:sdLicense https://scigraph.springernature.com/explorer/license/
44 schema:sdPublisher N9806e941cbb1420dbb5f66770140da14
45 schema:url http://www.nature.com/articles/ismej200972
46 sgo:license sg:explorer/license/
47 sgo:sdDataset articles
48 rdf:type schema:ScholarlyArticle
49 N0b8c67fc185445a8a5a188b028b14605 rdf:first sg:person.01152341741.73
50 rdf:rest N9096f66bbba94e65b5d69dd9f9b1ed2d
51 N3a7724366cf94d458529d7615927539d schema:name nlm_unique_id
52 schema:value 101301086
53 rdf:type schema:PropertyValue
54 N3fc41b9f81cb4abeb2e80e7515614801 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
55 schema:name Soil Microbiology
56 rdf:type schema:DefinedTerm
57 N405fdd4b1638464b9dd80a09570cbdbe schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
58 schema:name Metagenome
59 rdf:type schema:DefinedTerm
60 N6732862bda65496094f9c033ae99048b schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
61 schema:name Sequence Analysis, DNA
62 rdf:type schema:DefinedTerm
63 N79bc72f1efef470a98e47600ff6663f0 rdf:first sg:person.013423766702.17
64 rdf:rest rdf:nil
65 N85f1793268c247e39318dda8e88141a4 schema:name readcube_id
66 schema:value 720be557efd7eec1988cf78605840f675032ca24b22a1318c661637bb527102a
67 rdf:type schema:PropertyValue
68 N90845df003b941e4908be8dff1e9015a schema:name pubmed_id
69 schema:value 19587772
70 rdf:type schema:PropertyValue
71 N9096f66bbba94e65b5d69dd9f9b1ed2d rdf:first sg:person.0663631777.21
72 rdf:rest N79bc72f1efef470a98e47600ff6663f0
73 N9655c8c6178e4ce6a98b61729a4e0fee schema:name doi
74 schema:value 10.1038/ismej.2009.72
75 rdf:type schema:PropertyValue
76 N9806e941cbb1420dbb5f66770140da14 schema:name Springer Nature - SN SciGraph project
77 rdf:type schema:Organization
78 Na3664215cd264278adc13be816d95de8 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
79 schema:name Molecular Sequence Data
80 rdf:type schema:DefinedTerm
81 Nadc1323412854eefaa77d0d676fd2bf8 schema:volumeNumber 3
82 rdf:type schema:PublicationVolume
83 Nbf7614d9e3b34c709092743f08a40eef schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
84 schema:name Base Sequence
85 rdf:type schema:DefinedTerm
86 Nceaabd0c8ef5469686704646eb15cbb5 schema:issueNumber 11
87 rdf:type schema:PublicationIssue
88 Nd096a2904f324d56ba03e29078d0e786 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
89 schema:name Sequence Alignment
90 rdf:type schema:DefinedTerm
91 Ndef0e39e53ae408d9af6ea4a37d8e373 schema:name dimensions_id
92 schema:value pub.1023938739
93 rdf:type schema:PropertyValue
94 Ne12260b12fdd457db28addd85818eba8 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
95 schema:name Databases, Genetic
96 rdf:type schema:DefinedTerm
97 Neacd0dad922844dd916311792833121e schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
98 schema:name Metagenomics
99 rdf:type schema:DefinedTerm
100 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
101 schema:name Biological Sciences
102 rdf:type schema:DefinedTerm
103 anzsrc-for:0605 schema:inDefinedTermSet anzsrc-for:
104 schema:name Microbiology
105 rdf:type schema:DefinedTerm
106 sg:grant.3082408 http://pending.schema.org/fundedItem sg:pub.10.1038/ismej.2009.72
107 rdf:type schema:MonetaryGrant
108 sg:journal.1038436 schema:issn 1751-7362
109 1751-7370
110 schema:name The ISME Journal
111 rdf:type schema:Periodical
112 sg:person.01152341741.73 schema:affiliation https://www.grid.ac/institutes/grid.17088.36
113 schema:familyName Gomez-Alvarez
114 schema:givenName Vicente
115 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01152341741.73
116 rdf:type schema:Person
117 sg:person.013423766702.17 schema:affiliation https://www.grid.ac/institutes/grid.17088.36
118 schema:familyName Schmidt
119 schema:givenName Thomas M
120 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013423766702.17
121 rdf:type schema:Person
122 sg:person.0663631777.21 schema:affiliation https://www.grid.ac/institutes/grid.17088.36
123 schema:familyName Teal
124 schema:givenName Tracy K
125 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0663631777.21
126 rdf:type schema:Person
127 sg:pub.10.1038/nature05414 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023893418
128 https://doi.org/10.1038/nature05414
129 rdf:type schema:CreativeWork
130 sg:pub.10.1038/nature06513 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052335368
131 https://doi.org/10.1038/nature06513
132 rdf:type schema:CreativeWork
133 sg:pub.10.1038/nature06810 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047805213
134 https://doi.org/10.1038/nature06810
135 rdf:type schema:CreativeWork
136 sg:pub.10.1186/1471-2105-9-386 schema:sameAs https://app.dimensions.ai/details/publication/pub.1006083026
137 https://doi.org/10.1186/1471-2105-9-386
138 rdf:type schema:CreativeWork
139 https://app.dimensions.ai/details/publication/pub.1082873120 schema:CreativeWork
140 https://doi.org/10.1073/pnas.0605127103 schema:sameAs https://app.dimensions.ai/details/publication/pub.1008547462
141 rdf:type schema:CreativeWork
142 https://doi.org/10.1073/pnas.0704665104 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045401726
143 rdf:type schema:CreativeWork
144 https://doi.org/10.1073/pnas.0708897105 schema:sameAs https://app.dimensions.ai/details/publication/pub.1016844456
145 rdf:type schema:CreativeWork
146 https://doi.org/10.1073/pnas.0711303105 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025703877
147 rdf:type schema:CreativeWork
148 https://doi.org/10.1093/bioinformatics/btl158 schema:sameAs https://app.dimensions.ai/details/publication/pub.1014668137
149 rdf:type schema:CreativeWork
150 https://doi.org/10.1371/journal.pone.0002527 schema:sameAs https://app.dimensions.ai/details/publication/pub.1001679264
151 rdf:type schema:CreativeWork
152 https://doi.org/10.1371/journal.pone.0003375 schema:sameAs https://app.dimensions.ai/details/publication/pub.1017375866
153 rdf:type schema:CreativeWork
154 https://www.grid.ac/institutes/grid.17088.36 schema:alternateName Michigan State University
155 schema:name Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
156 Kellogg Biological Station, Michigan State University, East Lansing, MI, USA
157 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...