Kernel Methods for Regression Analysis of Microbiome Compositional Data View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2013-08-15

AUTHORS

Jun Chen , Hongzhe Li

ABSTRACT

With the development of next generation sequencing technologies, the human microbiome can now be studied using direct DNA sequencing. Many human diseases have been shown to be associated with the disorder of the human microbiome. Previous statistical methods for associating the microbiome composition with an outcome such as disease status focus on the association of the abundance of individual taxon or their abundance ratios with the outcome variable. However, the problem of multiple testing leads to loss of power to detect the association. When individual taxon-level association test fails, an overall test, which pools the individually weak association signal, can be applied to test the significance of the effect of the overall microbiome composition on an outcome variable. In this paper, we propose a kernel-based semi-parametric regression method for testing the significance of the effect of the microbiome composition on a continuous or binary outcome. Our method provides the flexibility to incorporate the phylogenetic information into the kernels as well as the ability to naturally adjust for the covariate effects. We evaluate our methods using simulations as well as a real data set on testing the significance of the human gut microbiome composition on body mass index (BMI) while adjusting for total fat intake. Our result suggests that the gut microbiome has a strong effect on BMI and this effect is independent of total fat intake. More... »

PAGES

191-201

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-1-4614-7846-1_16

DOI

http://dx.doi.org/10.1007/978-1-4614-7846-1_16

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1035879661


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/01", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Mathematical Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0104", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Statistics", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA", 
          "id": "http://www.grid.ac/institutes/grid.38142.3c", 
          "name": [
            "Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Chen", 
        "givenName": "Jun", 
        "id": "sg:person.01023412627.52", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01023412627.52"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA", 
          "id": "http://www.grid.ac/institutes/grid.25879.31", 
          "name": [
            "Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Li", 
        "givenName": "Hongzhe", 
        "id": "sg:person.013262445067.87", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013262445067.87"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2013-08-15", 
    "datePublishedReg": "2013-08-15", 
    "description": "With the development of next generation sequencing technologies, the human microbiome can now be studied using direct DNA sequencing. Many human diseases have been shown to be associated with the disorder of the human microbiome. Previous statistical methods for associating the microbiome composition with an outcome such as disease status focus on the association of the abundance of individual taxon or their abundance ratios with the outcome variable. However, the problem of multiple testing leads to loss of power to detect the association. When individual taxon-level association test fails, an overall test, which pools the individually weak association signal, can be applied to test the significance of the effect of the overall microbiome composition on an outcome variable. In this paper, we propose a kernel-based semi-parametric regression method for testing the significance of the effect of the microbiome composition on a continuous or binary outcome. Our method provides the flexibility to incorporate the phylogenetic information into the kernels as well as the ability to naturally adjust for the covariate effects. We evaluate our methods using simulations as well as a real data set on testing the significance of the human gut microbiome composition on body mass index (BMI) while adjusting for total fat intake. Our result suggests that the gut microbiome has a strong effect on BMI and this effect is independent of total fat intake.", 
    "editor": [
      {
        "familyName": "Hu", 
        "givenName": "Mingxiu", 
        "type": "Person"
      }, 
      {
        "familyName": "Liu", 
        "givenName": "Yi", 
        "type": "Person"
      }, 
      {
        "familyName": "Lin", 
        "givenName": "Jianchang", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-1-4614-7846-1_16", 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-1-4614-7845-4", 
        "978-1-4614-7846-1"
      ], 
      "name": "Topics in Applied Statistics", 
      "type": "Book"
    }, 
    "keywords": [
      "body mass index", 
      "total fat intake", 
      "fat intake", 
      "microbiome composition", 
      "gut microbiome composition", 
      "human microbiome", 
      "overall microbiome composition", 
      "mass index", 
      "direct DNA sequencing", 
      "human gut microbiome composition", 
      "gut microbiome", 
      "outcomes", 
      "regression analysis", 
      "intake", 
      "binary outcomes", 
      "human diseases", 
      "microbiome", 
      "association", 
      "DNA sequencing", 
      "disease", 
      "significance", 
      "testing lead", 
      "disorders", 
      "effect", 
      "association signals", 
      "test", 
      "covariate effects", 
      "semi-parametric regression methods", 
      "Association Test", 
      "overall test", 
      "index", 
      "data", 
      "sequencing", 
      "loss", 
      "strong effect", 
      "method", 
      "ability", 
      "loss of power", 
      "development", 
      "regression method", 
      "ratio", 
      "lead", 
      "analysis", 
      "statistical methods", 
      "focus", 
      "results", 
      "information", 
      "next generation", 
      "composition", 
      "generation", 
      "signals", 
      "abundance", 
      "problem", 
      "individual taxa", 
      "technology", 
      "power", 
      "flexibility", 
      "compositional data", 
      "paper", 
      "taxa", 
      "weak association signals", 
      "phylogenetic information", 
      "abundance ratios", 
      "real data", 
      "microbiome compositional data", 
      "previous statistical methods", 
      "kernel", 
      "simulations", 
      "kernel method"
    ], 
    "name": "Kernel Methods for Regression Analysis of Microbiome Compositional Data", 
    "pagination": "191-201", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1035879661"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-1-4614-7846-1_16"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-1-4614-7846-1_16", 
      "https://app.dimensions.ai/details/publication/pub.1035879661"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2022-09-02T16:14", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220902/entities/gbq_results/chapter/chapter_340.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/978-1-4614-7846-1_16"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-7846-1_16'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-7846-1_16'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-7846-1_16'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-7846-1_16'


 

This table displays all metadata directly associated to this object as RDF triples.

148 TRIPLES      22 PREDICATES      93 URIs      86 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-1-4614-7846-1_16 schema:about anzsrc-for:01
2 anzsrc-for:0104
3 schema:author N1efefe5a162346658d1c515725ef8edf
4 schema:datePublished 2013-08-15
5 schema:datePublishedReg 2013-08-15
6 schema:description With the development of next generation sequencing technologies, the human microbiome can now be studied using direct DNA sequencing. Many human diseases have been shown to be associated with the disorder of the human microbiome. Previous statistical methods for associating the microbiome composition with an outcome such as disease status focus on the association of the abundance of individual taxon or their abundance ratios with the outcome variable. However, the problem of multiple testing leads to loss of power to detect the association. When individual taxon-level association test fails, an overall test, which pools the individually weak association signal, can be applied to test the significance of the effect of the overall microbiome composition on an outcome variable. In this paper, we propose a kernel-based semi-parametric regression method for testing the significance of the effect of the microbiome composition on a continuous or binary outcome. Our method provides the flexibility to incorporate the phylogenetic information into the kernels as well as the ability to naturally adjust for the covariate effects. We evaluate our methods using simulations as well as a real data set on testing the significance of the human gut microbiome composition on body mass index (BMI) while adjusting for total fat intake. Our result suggests that the gut microbiome has a strong effect on BMI and this effect is independent of total fat intake.
7 schema:editor Nf6dda7bd22cb4e15b6990812c5a26929
8 schema:genre chapter
9 schema:isAccessibleForFree false
10 schema:isPartOf N450f5330b64f4e488892c9b35d9cbbbb
11 schema:keywords Association Test
12 DNA sequencing
13 ability
14 abundance
15 abundance ratios
16 analysis
17 association
18 association signals
19 binary outcomes
20 body mass index
21 composition
22 compositional data
23 covariate effects
24 data
25 development
26 direct DNA sequencing
27 disease
28 disorders
29 effect
30 fat intake
31 flexibility
32 focus
33 generation
34 gut microbiome
35 gut microbiome composition
36 human diseases
37 human gut microbiome composition
38 human microbiome
39 index
40 individual taxa
41 information
42 intake
43 kernel
44 kernel method
45 lead
46 loss
47 loss of power
48 mass index
49 method
50 microbiome
51 microbiome composition
52 microbiome compositional data
53 next generation
54 outcomes
55 overall microbiome composition
56 overall test
57 paper
58 phylogenetic information
59 power
60 previous statistical methods
61 problem
62 ratio
63 real data
64 regression analysis
65 regression method
66 results
67 semi-parametric regression methods
68 sequencing
69 signals
70 significance
71 simulations
72 statistical methods
73 strong effect
74 taxa
75 technology
76 test
77 testing lead
78 total fat intake
79 weak association signals
80 schema:name Kernel Methods for Regression Analysis of Microbiome Compositional Data
81 schema:pagination 191-201
82 schema:productId N83f9df69f05a45c78f146859f9e161dd
83 Ndee7679ec33e4bc0bc01d5343ed29ec0
84 schema:publisher N608c3739039b49a081a0dee45ecf69a7
85 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035879661
86 https://doi.org/10.1007/978-1-4614-7846-1_16
87 schema:sdDatePublished 2022-09-02T16:14
88 schema:sdLicense https://scigraph.springernature.com/explorer/license/
89 schema:sdPublisher N9ff6fd16ca4e4bb08af918dfa50951cb
90 schema:url https://doi.org/10.1007/978-1-4614-7846-1_16
91 sgo:license sg:explorer/license/
92 sgo:sdDataset chapters
93 rdf:type schema:Chapter
94 N1efefe5a162346658d1c515725ef8edf rdf:first sg:person.01023412627.52
95 rdf:rest N8189d1ef41ff4a8ca3ab587b9e2f1768
96 N2c8032aa808e408791f92076ae569a04 schema:familyName Liu
97 schema:givenName Yi
98 rdf:type schema:Person
99 N450f5330b64f4e488892c9b35d9cbbbb schema:isbn 978-1-4614-7845-4
100 978-1-4614-7846-1
101 schema:name Topics in Applied Statistics
102 rdf:type schema:Book
103 N608c3739039b49a081a0dee45ecf69a7 schema:name Springer Nature
104 rdf:type schema:Organisation
105 N8189d1ef41ff4a8ca3ab587b9e2f1768 rdf:first sg:person.013262445067.87
106 rdf:rest rdf:nil
107 N83f9df69f05a45c78f146859f9e161dd schema:name dimensions_id
108 schema:value pub.1035879661
109 rdf:type schema:PropertyValue
110 N947322994a234a21932d3691c224a811 rdf:first Naff5576efc2744dba7c23fd992bca876
111 rdf:rest rdf:nil
112 N9e3f6127154944c58b04cb0826ca5dd1 rdf:first N2c8032aa808e408791f92076ae569a04
113 rdf:rest N947322994a234a21932d3691c224a811
114 N9ff6fd16ca4e4bb08af918dfa50951cb schema:name Springer Nature - SN SciGraph project
115 rdf:type schema:Organization
116 Naff5576efc2744dba7c23fd992bca876 schema:familyName Lin
117 schema:givenName Jianchang
118 rdf:type schema:Person
119 Ndee7679ec33e4bc0bc01d5343ed29ec0 schema:name doi
120 schema:value 10.1007/978-1-4614-7846-1_16
121 rdf:type schema:PropertyValue
122 Nf6dda7bd22cb4e15b6990812c5a26929 rdf:first Nfd0e836a10c94c7dac276f0950827355
123 rdf:rest N9e3f6127154944c58b04cb0826ca5dd1
124 Nfd0e836a10c94c7dac276f0950827355 schema:familyName Hu
125 schema:givenName Mingxiu
126 rdf:type schema:Person
127 anzsrc-for:01 schema:inDefinedTermSet anzsrc-for:
128 schema:name Mathematical Sciences
129 rdf:type schema:DefinedTerm
130 anzsrc-for:0104 schema:inDefinedTermSet anzsrc-for:
131 schema:name Statistics
132 rdf:type schema:DefinedTerm
133 sg:person.01023412627.52 schema:affiliation grid-institutes:grid.38142.3c
134 schema:familyName Chen
135 schema:givenName Jun
136 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01023412627.52
137 rdf:type schema:Person
138 sg:person.013262445067.87 schema:affiliation grid-institutes:grid.25879.31
139 schema:familyName Li
140 schema:givenName Hongzhe
141 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013262445067.87
142 rdf:type schema:Person
143 grid-institutes:grid.25879.31 schema:alternateName Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
144 schema:name Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
145 rdf:type schema:Organization
146 grid-institutes:grid.38142.3c schema:alternateName Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA
147 schema:name Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA
148 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...