Ontology type: schema:ScholarlyArticle Open Access: True
2014-05-05
AUTHORSAndrew D Fernandes, Jennifer NS Reid, Jean M Macklaim, Thomas A McMurrough, David R Edgell, Gregory B Gloor
ABSTRACTBackgroundExperimental designs that take advantage of high-throughput sequencing to generate datasets include RNA sequencing (RNA-seq), chromatin immunoprecipitation sequencing (ChIP-seq), sequencing of 16S rRNA gene fragments, metagenomic analysis and selective growth experiments. In each case the underlying data are similar and are composed of counts of sequencing reads mapped to a large number of features in each sample. Despite this underlying similarity, the data analysis methods used for these experimental designs are all different, and do not translate across experiments. Alternative methods have been developed in the physical and geological sciences that treat similar data as compositions. Compositional data analysis methods transform the data to relative abundances with the result that the analyses are more robust and reproducible.ResultsData from an in vitro selective growth experiment, an RNA-seq experiment and the Human Microbiome Project 16S rRNA gene abundance dataset were examined by ALDEx2, a compositional data analysis tool that uses Bayesian methods to infer technical and statistical error. The ALDEx2 approach is shown to be suitable for all three types of data: it correctly identifies both the direction and differential abundance of features in the differential growth experiment, it identifies a substantially similar set of differentially expressed genes in the RNA-seq dataset as the leading tools and it identifies as differential the taxa that distinguish the tongue dorsum and buccal mucosa in the Human Microbiome Project dataset. The design of ALDEx2 reduces the number of false positive identifications that result from datasets composed of many features in few samples.ConclusionStatistical analysis of high-throughput sequencing datasets composed of per feature counts showed that the ALDEx2 R package is a simple and robust tool, which can be applied to RNA-seq, 16S rRNA gene sequencing and differential growth datasets, and by extension to other techniques that use a similar approach. More... »
PAGES15
http://scigraph.springernature.com/pub.10.1186/2049-2618-2-15
DOIhttp://dx.doi.org/10.1186/2049-2618-2-15
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1046874717
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/24910773
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "YouKaryote Genomics, London, ON, Canada",
"id": "http://www.grid.ac/institutes/None",
"name": [
"YouKaryote Genomics, London, ON, Canada"
],
"type": "Organization"
},
"familyName": "Fernandes",
"givenName": "Andrew D",
"id": "sg:person.01315170521.35",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01315170521.35"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada",
"id": "http://www.grid.ac/institutes/grid.39381.30",
"name": [
"Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada"
],
"type": "Organization"
},
"familyName": "Reid",
"givenName": "Jennifer NS",
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada",
"id": "http://www.grid.ac/institutes/grid.39381.30",
"name": [
"Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada"
],
"type": "Organization"
},
"familyName": "Macklaim",
"givenName": "Jean M",
"id": "sg:person.01306205104.89",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01306205104.89"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada",
"id": "http://www.grid.ac/institutes/grid.39381.30",
"name": [
"Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada"
],
"type": "Organization"
},
"familyName": "McMurrough",
"givenName": "Thomas A",
"id": "sg:person.0742354142.70",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0742354142.70"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada",
"id": "http://www.grid.ac/institutes/grid.39381.30",
"name": [
"Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada"
],
"type": "Organization"
},
"familyName": "Edgell",
"givenName": "David R",
"id": "sg:person.01246046512.37",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01246046512.37"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada",
"id": "http://www.grid.ac/institutes/grid.39381.30",
"name": [
"Department of Biochemistry, Medical Science Building, University of Western Ontario, Richmond St, 1151, N6A 5C1, London, ON, Canada"
],
"type": "Organization"
},
"familyName": "Gloor",
"givenName": "Gregory B",
"id": "sg:person.013452663514.54",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013452663514.54"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1023/a:1023818214614",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1024177181",
"https://doi.org/10.1023/a:1023818214614"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-14-91",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016675314",
"https://doi.org/10.1186/1471-2105-14-91"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.f.303",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009032055",
"https://doi.org/10.1038/nmeth.f.303"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-94-009-4109-0",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1109716595",
"https://doi.org/10.1007/978-94-009-4109-0"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11004-005-7381-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035536844",
"https://doi.org/10.1007/s11004-005-7381-9"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-14-135",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016618347",
"https://doi.org/10.1186/1471-2105-14-135"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2010-11-2-106",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1017440825",
"https://doi.org/10.1186/gb-2010-11-2-106"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s004420100716",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1021940115",
"https://doi.org/10.1007/s004420100716"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nprot.2013.099",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033430059",
"https://doi.org/10.1038/nprot.2013.099"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrg3129",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019355616",
"https://doi.org/10.1038/nrg3129"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2010-11-10-r106",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031289083",
"https://doi.org/10.1186/gb-2010-11-10-r106"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-449",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032929685",
"https://doi.org/10.1186/1471-2105-12-449"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2012-13-6-r42",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038980434",
"https://doi.org/10.1186/gb-2012-13-6-r42"
],
"type": "CreativeWork"
}
],
"datePublished": "2014-05-05",
"datePublishedReg": "2014-05-05",
"description": "BackgroundExperimental designs that take advantage of high-throughput sequencing to generate datasets include RNA sequencing (RNA-seq), chromatin immunoprecipitation sequencing (ChIP-seq), sequencing of 16S rRNA gene fragments, metagenomic analysis and selective growth experiments. In each case the underlying data are similar and are composed of counts of sequencing reads mapped to a large number of features in each sample. Despite this underlying similarity, the data analysis methods used for these experimental designs are all different, and do not translate across experiments. Alternative methods have been developed in the physical and geological sciences that treat similar data as compositions. Compositional data analysis methods transform the data to relative abundances with the result that the analyses are more robust and reproducible.ResultsData from an in vitro selective growth experiment, an RNA-seq experiment and the Human Microbiome Project 16S rRNA gene abundance dataset were examined by ALDEx2, a compositional data analysis tool that uses Bayesian methods to infer technical and statistical error. The ALDEx2 approach is shown to be suitable for all three types of data: it correctly identifies both the direction and differential abundance of features in the differential growth experiment, it identifies a substantially similar set of differentially expressed genes in the RNA-seq dataset as the leading tools and it identifies as differential the taxa that distinguish the tongue dorsum and buccal mucosa in the Human Microbiome Project dataset. The design of ALDEx2 reduces the number of false positive identifications that result from datasets composed of many features in few samples.ConclusionStatistical analysis of high-throughput sequencing datasets composed of per feature counts showed that the ALDEx2 R package is a simple and robust tool, which can be applied to RNA-seq, 16S rRNA gene sequencing and differential growth datasets, and by extension to other techniques that use a similar approach.",
"genre": "article",
"id": "sg:pub.10.1186/2049-2618-2-15",
"isAccessibleForFree": true,
"isPartOf": [
{
"id": "sg:journal.1048878",
"issn": [
"2049-2618"
],
"name": "Microbiome",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "2"
}
],
"keywords": [
"selective growth experiments",
"statistical errors",
"data analysis methods",
"high-throughput sequencing datasets",
"compositional data analysis methods",
"compositional data analysis",
"Bayesian methods",
"Human Microbiome Project dataset",
"data analysis tools",
"R package",
"analysis method",
"ALDEx2",
"types of data",
"analysis tools",
"robust tool",
"abundance datasets",
"large number",
"similar approach",
"growth dataset",
"data analysis",
"sequencing datasets",
"RNA-seq experiments",
"alternative method",
"error",
"approach",
"experimental design",
"extension",
"dataset",
"Project dataset",
"design",
"number",
"set",
"tool",
"experiments",
"feature counts",
"package",
"RNA-seq datasets",
"false positive identifications",
"analysis",
"features",
"direction",
"data",
"technique",
"similar set",
"advantages",
"geological sciences",
"cases",
"science",
"results",
"growth experiments",
"similar data",
"types",
"samples",
"identification",
"sequencing reads",
"rRNA gene sequencing",
"similarity",
"RNA-seq",
"high-throughput sequencing",
"gene sequencing",
"rRNA gene fragments",
"differential abundance",
"chromatin immunoprecipitation",
"RNA sequencing",
"gene fragments",
"metagenomic analysis",
"count",
"positive identification",
"relative abundance",
"sequencing",
"abundance",
"composition",
"reads",
"taxa",
"immunoprecipitation",
"genes",
"fragments",
"tongue dorsum",
"dorsum",
"method",
"mucosa",
"ResultsData",
"buccal mucosa"
],
"name": "Unifying the analysis of high-throughput sequencing datasets: characterizing RNA-seq, 16S rRNA gene sequencing and selective growth experiments by compositional data analysis",
"pagination": "15",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1046874717"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/2049-2618-2-15"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"24910773"
]
}
],
"sameAs": [
"https://doi.org/10.1186/2049-2618-2-15",
"https://app.dimensions.ai/details/publication/pub.1046874717"
],
"sdDataset": "articles",
"sdDatePublished": "2022-08-04T17:02",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/article/article_644.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1186/2049-2618-2-15"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/2049-2618-2-15'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/2049-2618-2-15'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/2049-2618-2-15'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/2049-2618-2-15'
This table displays all metadata directly associated to this object as RDF triples.
232 TRIPLES
21 PREDICATES
121 URIs
100 LITERALS
7 BLANK NODES