Comparison of normalization methods for the analysis of metagenomic gene abundance data View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2018-04-20

AUTHORS

Mariana Buongermino Pereira, Mikael Wallroth, Viktor Jonsson, Erik Kristiansson

ABSTRACT

BackgroundIn shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated.ResultsHere, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance.ConclusionsThis study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead to incorrect or obfuscated biological interpretation. More... »

PAGES

274

References to SciGraph publications

  • 2012-09-26. A metagenome-wide association study of gut microbiota in type 2 diabetes in NATURE
  • 2017-04-06. Comparative metagenomics reveals insights into the deep-sea adaptation mechanism of the microorganisms in Iheya hydrothermal fields in WORLD JOURNAL OF MICROBIOLOGY AND BIOTECHNOLOGY
  • 2013-10-20. Metagenomic species profiling using universal phylogenetic marker genes in NATURE METHODS
  • 2013-05-29. Gut metagenome in European women with normal, impaired and diabetic glucose control in NATURE
  • 2014-08-24. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium in NATURE BIOTECHNOLOGY
  • 2005-08-01. Metagenomics for studying unculturable microorganisms: cutting the Gordian knot in GENOME BIOLOGY
  • 2017-04-11. Intestinal microbiome in children with severe and complicated acute viral gastroenteritis in SCIENTIFIC REPORTS
  • 2010-10-27. Differential expression analysis for sequence count data in GENOME BIOLOGY
  • 2014-12-05. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 in GENOME BIOLOGY
  • 2008-09-19. The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes in BMC BIOINFORMATICS
  • 2010-03-02. A scaling normalization method for differential expression analysis of RNA-seq data in GENOME BIOLOGY
  • 2015-09-07. Tentacle: distributed quantification of genes in metagenomes in GIGASCIENCE
  • 2016-01-25. Statistical evaluation of methods for identification of differentially abundant genes in comparative metagenomics in BMC GENOMICS
  • 2012-05-09. Human gut microbiome viewed across age and geography in NATURE
  • 2013-09-29. Differential abundance analysis for microbial marker-gene surveys in NATURE METHODS
  • 2010-02-18. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments in BMC BIOINFORMATICS
  • 2017-03-03. Normalization and microbial differential abundance strategies depend upon data characteristics in MICROBIOME
  • 2015-03-25. MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome in GENOME BIOLOGY
  • 2017-04-21. HirBin: high-resolution identification of differentially abundant functions in metagenomes in BMC GENOMICS
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1186/s12864-018-4637-6

    DOI

    http://dx.doi.org/10.1186/s12864-018-4637-6

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1103494409

    PUBMED

    https://www.ncbi.nlm.nih.gov/pubmed/29678163


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Biological Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Genetics", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Data Analysis", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Metagenomics", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden", 
              "id": "http://www.grid.ac/institutes/grid.8761.8", 
              "name": [
                "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Pereira", 
            "givenName": "Mariana Buongermino", 
            "id": "sg:person.0656367524.09", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0656367524.09"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden", 
              "id": "http://www.grid.ac/institutes/grid.8761.8", 
              "name": [
                "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Wallroth", 
            "givenName": "Mikael", 
            "id": "sg:person.011360455425.69", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011360455425.69"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden", 
              "id": "http://www.grid.ac/institutes/grid.8761.8", 
              "name": [
                "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Jonsson", 
            "givenName": "Viktor", 
            "id": "sg:person.0761362305.11", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0761362305.11"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden", 
              "id": "http://www.grid.ac/institutes/grid.8761.8", 
              "name": [
                "Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Kristiansson", 
            "givenName": "Erik", 
            "id": "sg:person.01051113471.17", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01051113471.17"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1038/nmeth.2658", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002139060", 
              "https://doi.org/10.1038/nmeth.2658"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/s13059-015-0610-8", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1029050385", 
              "https://doi.org/10.1186/s13059-015-0610-8"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/s13742-015-0078-1", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1027641814", 
              "https://doi.org/10.1186/s13742-015-0078-1"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature11450", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1004546178", 
              "https://doi.org/10.1038/nature11450"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/srep46130", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1084748698", 
              "https://doi.org/10.1038/srep46130"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/s12864-016-2386-y", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1020096515", 
              "https://doi.org/10.1186/s12864-016-2386-y"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature11053", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1052378845", 
              "https://doi.org/10.1038/nature11053"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2010-11-10-r106", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1031289083", 
              "https://doi.org/10.1186/gb-2010-11-10-r106"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2105-9-386", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1006083026", 
              "https://doi.org/10.1186/1471-2105-9-386"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature12198", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002791386", 
              "https://doi.org/10.1038/nature12198"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth.2693", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028082738", 
              "https://doi.org/10.1038/nmeth.2693"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/s13059-014-0550-8", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1015222646", 
              "https://doi.org/10.1186/s13059-014-0550-8"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/s40168-017-0237-y", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1084252802", 
              "https://doi.org/10.1186/s40168-017-0237-y"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2105-11-94", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1053091615", 
              "https://doi.org/10.1186/1471-2105-11-94"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nbt.2957", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1027683701", 
              "https://doi.org/10.1038/nbt.2957"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/s12864-017-3686-6", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1084954596", 
              "https://doi.org/10.1186/s12864-017-3686-6"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/s11274-017-2255-0", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1084519412", 
              "https://doi.org/10.1007/s11274-017-2255-0"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2010-11-3-r25", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1050509557", 
              "https://doi.org/10.1186/gb-2010-11-3-r25"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2005-6-8-229", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1029667488", 
              "https://doi.org/10.1186/gb-2005-6-8-229"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2018-04-20", 
        "datePublishedReg": "2018-04-20", 
        "description": "BackgroundIn shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated.ResultsHere, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance.ConclusionsThis study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead to incorrect or obfuscated biological interpretation.", 
        "genre": "article", 
        "id": "sg:pub.10.1186/s12864-018-4637-6", 
        "isAccessibleForFree": true, 
        "isPartOf": [
          {
            "id": "sg:journal.1023790", 
            "issn": [
              "1471-2164"
            ], 
            "name": "BMC Genomics", 
            "publisher": "Springer Nature", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "1", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "19"
          }
        ], 
        "keywords": [
          "gene abundance data", 
          "abundance data", 
          "shotgun metagenomics", 
          "metagenomic data", 
          "false discovery rate", 
          "shotgun metagenomic data", 
          "microbial communities", 
          "abundant genes", 
          "gene abundance", 
          "sequencing reads", 
          "unbiased p-values", 
          "biological interpretation", 
          "prior cultivation", 
          "functional differences", 
          "metagenomics", 
          "direct sequencing", 
          "discovery rate", 
          "high levels", 
          "high-dimensional count data", 
          "comprehensive dataset", 
          "genes", 
          "sequencing", 
          "DNA", 
          "abundance", 
          "reads", 
          "unacceptably high levels", 
          "expression", 
          "statistical power", 
          "community", 
          "ResultsHere", 
          "cultivation", 
          "false positives", 
          "count data", 
          "variability", 
          "wide range", 
          "normalization method", 
          "analysis", 
          "levels", 
          "unique characteristics", 
          "large impact", 
          "larger sample size", 
          "high false positive rate", 
          "DAG", 
          "experimental conditions", 
          "suitable normalization method", 
          "method's ability", 
          "ability", 
          "satisfactory performance", 
          "data", 
          "realistic settings", 
          "importance", 
          "results", 
          "false positive rate", 
          "rate", 
          "turn", 
          "sample size", 
          "positives", 
          "size", 
          "vital part", 
          "data analysis", 
          "process", 
          "systematic variability", 
          "systematic evaluation", 
          "resampling", 
          "conditions", 
          "ConclusionsThis study", 
          "improper methods", 
          "part", 
          "dataset", 
          "differences", 
          "p-value", 
          "study", 
          "high performance", 
          "analysis of data", 
          "performance", 
          "range", 
          "true positive rate", 
          "impact", 
          "comparison", 
          "overall high performance", 
          "positive rate", 
          "M values", 
          "terms", 
          "end result", 
          "method", 
          "power", 
          "characteristics", 
          "means", 
          "interpretation", 
          "choice", 
          "normalization", 
          "setting", 
          "CSS", 
          "evaluation"
        ], 
        "name": "Comparison of normalization methods for the analysis of metagenomic gene abundance data", 
        "pagination": "274", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1103494409"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1186/s12864-018-4637-6"
            ]
          }, 
          {
            "name": "pubmed_id", 
            "type": "PropertyValue", 
            "value": [
              "29678163"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1186/s12864-018-4637-6", 
          "https://app.dimensions.ai/details/publication/pub.1103494409"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2022-08-04T17:07", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/article/article_772.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "https://doi.org/10.1186/s12864-018-4637-6"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/s12864-018-4637-6'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/s12864-018-4637-6'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/s12864-018-4637-6'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/s12864-018-4637-6'


     

    This table displays all metadata directly associated to this object as RDF triples.

    259 TRIPLES      21 PREDICATES      140 URIs      113 LITERALS      9 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1186/s12864-018-4637-6 schema:about Nb803de15cc4e4008b423b181920abf4e
    2 Nd9492a36b5864ebf8edaa9b15a4d783f
    3 anzsrc-for:06
    4 anzsrc-for:0604
    5 schema:author N9cb352f7f5bd4b07959b4bcc66ae0056
    6 schema:citation sg:pub.10.1007/s11274-017-2255-0
    7 sg:pub.10.1038/nature11053
    8 sg:pub.10.1038/nature11450
    9 sg:pub.10.1038/nature12198
    10 sg:pub.10.1038/nbt.2957
    11 sg:pub.10.1038/nmeth.2658
    12 sg:pub.10.1038/nmeth.2693
    13 sg:pub.10.1038/srep46130
    14 sg:pub.10.1186/1471-2105-11-94
    15 sg:pub.10.1186/1471-2105-9-386
    16 sg:pub.10.1186/gb-2005-6-8-229
    17 sg:pub.10.1186/gb-2010-11-10-r106
    18 sg:pub.10.1186/gb-2010-11-3-r25
    19 sg:pub.10.1186/s12864-016-2386-y
    20 sg:pub.10.1186/s12864-017-3686-6
    21 sg:pub.10.1186/s13059-014-0550-8
    22 sg:pub.10.1186/s13059-015-0610-8
    23 sg:pub.10.1186/s13742-015-0078-1
    24 sg:pub.10.1186/s40168-017-0237-y
    25 schema:datePublished 2018-04-20
    26 schema:datePublishedReg 2018-04-20
    27 schema:description BackgroundIn shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated.ResultsHere, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance.ConclusionsThis study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead to incorrect or obfuscated biological interpretation.
    28 schema:genre article
    29 schema:isAccessibleForFree true
    30 schema:isPartOf N309443e90e214582a079504f1e53f360
    31 N8761e5b42cbf4a37983ffcf3995a5a06
    32 sg:journal.1023790
    33 schema:keywords CSS
    34 ConclusionsThis study
    35 DAG
    36 DNA
    37 M values
    38 ResultsHere
    39 ability
    40 abundance
    41 abundance data
    42 abundant genes
    43 analysis
    44 analysis of data
    45 biological interpretation
    46 characteristics
    47 choice
    48 community
    49 comparison
    50 comprehensive dataset
    51 conditions
    52 count data
    53 cultivation
    54 data
    55 data analysis
    56 dataset
    57 differences
    58 direct sequencing
    59 discovery rate
    60 end result
    61 evaluation
    62 experimental conditions
    63 expression
    64 false discovery rate
    65 false positive rate
    66 false positives
    67 functional differences
    68 gene abundance
    69 gene abundance data
    70 genes
    71 high false positive rate
    72 high levels
    73 high performance
    74 high-dimensional count data
    75 impact
    76 importance
    77 improper methods
    78 interpretation
    79 large impact
    80 larger sample size
    81 levels
    82 means
    83 metagenomic data
    84 metagenomics
    85 method
    86 method's ability
    87 microbial communities
    88 normalization
    89 normalization method
    90 overall high performance
    91 p-value
    92 part
    93 performance
    94 positive rate
    95 positives
    96 power
    97 prior cultivation
    98 process
    99 range
    100 rate
    101 reads
    102 realistic settings
    103 resampling
    104 results
    105 sample size
    106 satisfactory performance
    107 sequencing
    108 sequencing reads
    109 setting
    110 shotgun metagenomic data
    111 shotgun metagenomics
    112 size
    113 statistical power
    114 study
    115 suitable normalization method
    116 systematic evaluation
    117 systematic variability
    118 terms
    119 true positive rate
    120 turn
    121 unacceptably high levels
    122 unbiased p-values
    123 unique characteristics
    124 variability
    125 vital part
    126 wide range
    127 schema:name Comparison of normalization methods for the analysis of metagenomic gene abundance data
    128 schema:pagination 274
    129 schema:productId N269faaeb8a4f424794b63055f838d295
    130 N392802fca0114dd3af6cf6d2b8985117
    131 Nd1c493e38c1f46dea87985c597c51d4c
    132 schema:sameAs https://app.dimensions.ai/details/publication/pub.1103494409
    133 https://doi.org/10.1186/s12864-018-4637-6
    134 schema:sdDatePublished 2022-08-04T17:07
    135 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    136 schema:sdPublisher N66042b910f324c0f973e4f23a7eb4bd6
    137 schema:url https://doi.org/10.1186/s12864-018-4637-6
    138 sgo:license sg:explorer/license/
    139 sgo:sdDataset articles
    140 rdf:type schema:ScholarlyArticle
    141 N04dad35de0604dec916ff11e964e9e9c rdf:first sg:person.01051113471.17
    142 rdf:rest rdf:nil
    143 N269faaeb8a4f424794b63055f838d295 schema:name pubmed_id
    144 schema:value 29678163
    145 rdf:type schema:PropertyValue
    146 N309443e90e214582a079504f1e53f360 schema:volumeNumber 19
    147 rdf:type schema:PublicationVolume
    148 N392802fca0114dd3af6cf6d2b8985117 schema:name dimensions_id
    149 schema:value pub.1103494409
    150 rdf:type schema:PropertyValue
    151 N45629500ff494eb783d224d8740ed959 rdf:first sg:person.011360455425.69
    152 rdf:rest N8c33c5ed438a44799598384b6a5d30c5
    153 N66042b910f324c0f973e4f23a7eb4bd6 schema:name Springer Nature - SN SciGraph project
    154 rdf:type schema:Organization
    155 N8761e5b42cbf4a37983ffcf3995a5a06 schema:issueNumber 1
    156 rdf:type schema:PublicationIssue
    157 N8c33c5ed438a44799598384b6a5d30c5 rdf:first sg:person.0761362305.11
    158 rdf:rest N04dad35de0604dec916ff11e964e9e9c
    159 N9cb352f7f5bd4b07959b4bcc66ae0056 rdf:first sg:person.0656367524.09
    160 rdf:rest N45629500ff494eb783d224d8740ed959
    161 Nb803de15cc4e4008b423b181920abf4e schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    162 schema:name Metagenomics
    163 rdf:type schema:DefinedTerm
    164 Nd1c493e38c1f46dea87985c597c51d4c schema:name doi
    165 schema:value 10.1186/s12864-018-4637-6
    166 rdf:type schema:PropertyValue
    167 Nd9492a36b5864ebf8edaa9b15a4d783f schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    168 schema:name Data Analysis
    169 rdf:type schema:DefinedTerm
    170 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
    171 schema:name Biological Sciences
    172 rdf:type schema:DefinedTerm
    173 anzsrc-for:0604 schema:inDefinedTermSet anzsrc-for:
    174 schema:name Genetics
    175 rdf:type schema:DefinedTerm
    176 sg:journal.1023790 schema:issn 1471-2164
    177 schema:name BMC Genomics
    178 schema:publisher Springer Nature
    179 rdf:type schema:Periodical
    180 sg:person.01051113471.17 schema:affiliation grid-institutes:grid.8761.8
    181 schema:familyName Kristiansson
    182 schema:givenName Erik
    183 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01051113471.17
    184 rdf:type schema:Person
    185 sg:person.011360455425.69 schema:affiliation grid-institutes:grid.8761.8
    186 schema:familyName Wallroth
    187 schema:givenName Mikael
    188 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011360455425.69
    189 rdf:type schema:Person
    190 sg:person.0656367524.09 schema:affiliation grid-institutes:grid.8761.8
    191 schema:familyName Pereira
    192 schema:givenName Mariana Buongermino
    193 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0656367524.09
    194 rdf:type schema:Person
    195 sg:person.0761362305.11 schema:affiliation grid-institutes:grid.8761.8
    196 schema:familyName Jonsson
    197 schema:givenName Viktor
    198 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0761362305.11
    199 rdf:type schema:Person
    200 sg:pub.10.1007/s11274-017-2255-0 schema:sameAs https://app.dimensions.ai/details/publication/pub.1084519412
    201 https://doi.org/10.1007/s11274-017-2255-0
    202 rdf:type schema:CreativeWork
    203 sg:pub.10.1038/nature11053 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052378845
    204 https://doi.org/10.1038/nature11053
    205 rdf:type schema:CreativeWork
    206 sg:pub.10.1038/nature11450 schema:sameAs https://app.dimensions.ai/details/publication/pub.1004546178
    207 https://doi.org/10.1038/nature11450
    208 rdf:type schema:CreativeWork
    209 sg:pub.10.1038/nature12198 schema:sameAs https://app.dimensions.ai/details/publication/pub.1002791386
    210 https://doi.org/10.1038/nature12198
    211 rdf:type schema:CreativeWork
    212 sg:pub.10.1038/nbt.2957 schema:sameAs https://app.dimensions.ai/details/publication/pub.1027683701
    213 https://doi.org/10.1038/nbt.2957
    214 rdf:type schema:CreativeWork
    215 sg:pub.10.1038/nmeth.2658 schema:sameAs https://app.dimensions.ai/details/publication/pub.1002139060
    216 https://doi.org/10.1038/nmeth.2658
    217 rdf:type schema:CreativeWork
    218 sg:pub.10.1038/nmeth.2693 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028082738
    219 https://doi.org/10.1038/nmeth.2693
    220 rdf:type schema:CreativeWork
    221 sg:pub.10.1038/srep46130 schema:sameAs https://app.dimensions.ai/details/publication/pub.1084748698
    222 https://doi.org/10.1038/srep46130
    223 rdf:type schema:CreativeWork
    224 sg:pub.10.1186/1471-2105-11-94 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053091615
    225 https://doi.org/10.1186/1471-2105-11-94
    226 rdf:type schema:CreativeWork
    227 sg:pub.10.1186/1471-2105-9-386 schema:sameAs https://app.dimensions.ai/details/publication/pub.1006083026
    228 https://doi.org/10.1186/1471-2105-9-386
    229 rdf:type schema:CreativeWork
    230 sg:pub.10.1186/gb-2005-6-8-229 schema:sameAs https://app.dimensions.ai/details/publication/pub.1029667488
    231 https://doi.org/10.1186/gb-2005-6-8-229
    232 rdf:type schema:CreativeWork
    233 sg:pub.10.1186/gb-2010-11-10-r106 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031289083
    234 https://doi.org/10.1186/gb-2010-11-10-r106
    235 rdf:type schema:CreativeWork
    236 sg:pub.10.1186/gb-2010-11-3-r25 schema:sameAs https://app.dimensions.ai/details/publication/pub.1050509557
    237 https://doi.org/10.1186/gb-2010-11-3-r25
    238 rdf:type schema:CreativeWork
    239 sg:pub.10.1186/s12864-016-2386-y schema:sameAs https://app.dimensions.ai/details/publication/pub.1020096515
    240 https://doi.org/10.1186/s12864-016-2386-y
    241 rdf:type schema:CreativeWork
    242 sg:pub.10.1186/s12864-017-3686-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1084954596
    243 https://doi.org/10.1186/s12864-017-3686-6
    244 rdf:type schema:CreativeWork
    245 sg:pub.10.1186/s13059-014-0550-8 schema:sameAs https://app.dimensions.ai/details/publication/pub.1015222646
    246 https://doi.org/10.1186/s13059-014-0550-8
    247 rdf:type schema:CreativeWork
    248 sg:pub.10.1186/s13059-015-0610-8 schema:sameAs https://app.dimensions.ai/details/publication/pub.1029050385
    249 https://doi.org/10.1186/s13059-015-0610-8
    250 rdf:type schema:CreativeWork
    251 sg:pub.10.1186/s13742-015-0078-1 schema:sameAs https://app.dimensions.ai/details/publication/pub.1027641814
    252 https://doi.org/10.1186/s13742-015-0078-1
    253 rdf:type schema:CreativeWork
    254 sg:pub.10.1186/s40168-017-0237-y schema:sameAs https://app.dimensions.ai/details/publication/pub.1084252802
    255 https://doi.org/10.1186/s40168-017-0237-y
    256 rdf:type schema:CreativeWork
    257 grid-institutes:grid.8761.8 schema:alternateName Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden
    258 schema:name Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, SE-412 96, Gothenburg, Sweden
    259 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...