Ontology type: schema:ScholarlyArticle Open Access: True
2016-03-18
AUTHORSRon Wehrens, Jos. A. Hageman, Fred van Eeuwijk, Rik Kooke, Pádraic J. Flood, Erik Wijnker, Joost J. B. Keurentjes, Arjen Lommen, Henriëtte D. L. M. van Eekelen, Robert D. Hall, Roland Mumm, Ric C. H. de Vos
ABSTRACTIntroductionBatch effects in large untargeted metabolomics experiments are almost unavoidable, especially when sensitive detection techniques like mass spectrometry (MS) are employed. In order to obtain peak intensities that are comparable across all batches, corrections need to be performed. Since non-detects, i.e., signals with an intensity too low to be detected with certainty, are common in metabolomics studies, the batch correction methods need to take these into account. ObjectivesThis paper aims to compare several batch correction methods, and investigates the effect of different strategies for handling non-detects.MethodsBatch correction methods usually consist of regression models, possibly also accounting for trends within batches. To fit these models quality control samples (QCs), injected at regular intervals, can be used. Also study samples can be used, provided that the injection order is properly randomized. Normalization methods, not using information on batch labels or injection order, can correct for batch effects as well. Introducing two easy-to-use quality criteria, we assess the merits of these batch correction strategies using three large LC–MS and GC–MS data sets of samples from Arabidopsis thaliana.ResultsThe three data sets have very different characteristics, leading to clearly distinct behaviour of the batch correction strategies studied. Explicit inclusion of information on batch and injection order in general leads to very good corrections; when enough QCs are available, also general normalization approaches perform well. Several approaches are shown to be able to handle non-detects—replacing them with very small numbers such as zero seems the worst of the approaches considered.ConclusionThe use of quality control samples for batch correction leads to good results when enough QCs are available. If an experiment is properly set up, batch correction using the study samples usually leads to a similar high-quality correction, but has the advantage that more metabolites are corrected. The strategy for handling non-detects is important: choosing small values like zero can lead to suboptimal batch corrections. More... »
PAGES88
http://scigraph.springernature.com/pub.10.1007/s11306-016-1015-8
DOIhttp://dx.doi.org/10.1007/s11306-016-1015-8
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1010819562
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/27073351
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/03",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Chemical Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0301",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Analytical Chemistry",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Bioscience, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Biometris, Wageningen UR, Wageningen, The Netherlands",
"Bioscience, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Wehrens",
"givenName": "Ron",
"id": "sg:person.0707004771.27",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0707004771.27"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Biometris, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Biometris, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Hageman",
"givenName": "Jos. A.",
"id": "sg:person.0770101362.75",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0770101362.75"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Biometris, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Biometris, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "van Eeuwijk",
"givenName": "Fred",
"id": "sg:person.0650756633.82",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0650756633.82"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Laboratory of Plant Physiology, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Laboratory of Genetics, Wageningen UR, Wageningen, The Netherlands",
"Laboratory of Plant Physiology, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Kooke",
"givenName": "Rik",
"id": "sg:person.0607514543.46",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0607514543.46"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Max Planck Institute For Plant Breeding Research, Cologne, Germany",
"id": "http://www.grid.ac/institutes/grid.419498.9",
"name": [
"Laboratory of Genetics, Wageningen UR, Wageningen, The Netherlands",
"Horticulture and Production Physiology, Wageningen UR, Wageningen, The Netherlands",
"Max Planck Institute For Plant Breeding Research, Cologne, Germany"
],
"type": "Organization"
},
"familyName": "Flood",
"givenName": "P\u00e1draic J.",
"id": "sg:person.01314356234.66",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01314356234.66"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Developmental Biology, Hamburg University, Hamburg, Germany",
"id": "http://www.grid.ac/institutes/grid.9026.d",
"name": [
"Laboratory of Genetics, Wageningen UR, Wageningen, The Netherlands",
"Developmental Biology, Hamburg University, Hamburg, Germany"
],
"type": "Organization"
},
"familyName": "Wijnker",
"givenName": "Erik",
"id": "sg:person.01124432620.04",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01124432620.04"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Laboratory of Genetics, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Laboratory of Genetics, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Keurentjes",
"givenName": "Joost J. B.",
"id": "sg:person.010104047604.94",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010104047604.94"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "RIKILT, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"RIKILT, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Lommen",
"givenName": "Arjen",
"id": "sg:person.0713030357.17",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0713030357.17"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Bioscience, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Bioscience, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "van Eekelen",
"givenName": "Henri\u00ebtte D. L. M.",
"id": "sg:person.0605446667.25",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0605446667.25"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Laboratory of Plant Physiology, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Bioscience, Wageningen UR, Wageningen, The Netherlands",
"Laboratory of Plant Physiology, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Hall",
"givenName": "Robert D.",
"id": "sg:person.0704576323.20",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0704576323.20"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Bioscience, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Bioscience, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "Mumm",
"givenName": "Roland",
"id": "sg:person.01366416351.05",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01366416351.05"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Bioscience, Wageningen UR, Wageningen, The Netherlands",
"id": "http://www.grid.ac/institutes/grid.4818.5",
"name": [
"Bioscience, Wageningen UR, Wageningen, The Netherlands"
],
"type": "Organization"
},
"familyName": "de Vos",
"givenName": "Ric C. H.",
"id": "sg:person.014575024362.22",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014575024362.22"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1007/s11306-015-0925-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016892332",
"https://doi.org/10.1007/s11306-015-0925-1"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrm3314",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023146162",
"https://doi.org/10.1038/nrm3314"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-3-642-17841-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026545920",
"https://doi.org/10.1007/978-3-642-17841-2"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-0-387-77318-6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1018506708",
"https://doi.org/10.1007/978-0-387-77318-6"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.2931",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008162673",
"https://doi.org/10.1038/nbt.2931"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11306-014-0742-y",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1041306727",
"https://doi.org/10.1007/s11306-014-0742-y"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11306-014-0625-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1003227544",
"https://doi.org/10.1007/s11306-014-0625-2"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nprot.2007.95",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1029308877",
"https://doi.org/10.1038/nprot.2007.95"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2172-9-59",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007480674",
"https://doi.org/10.1186/1471-2172-9-59"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-1-4757-1904-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031639131",
"https://doi.org/10.1007/978-1-4757-1904-8"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s00216-013-6856-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035732243",
"https://doi.org/10.1007/s00216-013-6856-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nprot.2011.335",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015396743",
"https://doi.org/10.1038/nprot.2011.335"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-1-61779-594-7_6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052181974",
"https://doi.org/10.1007/978-1-61779-594-7_6"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng.1042",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044562954",
"https://doi.org/10.1038/ng.1042"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11306-012-0434-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002880572",
"https://doi.org/10.1007/s11306-012-0434-4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11306-011-0368-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1042363301",
"https://doi.org/10.1007/s11306-011-0368-2"
],
"type": "CreativeWork"
}
],
"datePublished": "2016-03-18",
"datePublishedReg": "2016-03-18",
"description": "IntroductionBatch effects in large untargeted metabolomics experiments are almost unavoidable, especially when sensitive detection techniques like mass spectrometry (MS) are employed. In order to obtain peak intensities that are comparable across all batches, corrections need to be performed. Since non-detects, i.e., signals with an intensity too low to be detected with certainty, are common in metabolomics studies, the batch correction methods need to take these into account.\nObjectivesThis paper aims to compare several batch correction methods, and investigates the effect of different strategies for handling non-detects.MethodsBatch correction methods usually consist of regression models, possibly also accounting for trends within batches. To fit these models quality control samples (QCs), injected at regular intervals, can be used. Also study samples can be used, provided that the injection order is properly randomized. Normalization methods, not using information on batch labels or injection order, can correct for batch effects as well. Introducing two easy-to-use quality criteria, we assess the merits of these batch correction strategies using three large LC\u2013MS and GC\u2013MS data sets of samples from Arabidopsis thaliana.ResultsThe three data sets have very different characteristics, leading to clearly distinct behaviour of the batch correction strategies studied. Explicit inclusion of information on batch and injection order in general leads to very good corrections; when enough QCs are available, also general normalization approaches perform well. Several approaches are shown to be able to handle non-detects\u2014replacing them with very small numbers such as zero seems the worst of the approaches considered.ConclusionThe use of quality control samples for batch correction leads to good results when enough QCs are available. If an experiment is properly set up, batch correction using the study samples usually leads to a similar high-quality correction, but has the advantage that more metabolites are corrected. The strategy for handling non-detects is important: choosing small values like zero can lead to suboptimal batch corrections.",
"genre": "article",
"id": "sg:pub.10.1007/s11306-016-1015-8",
"inLanguage": "en",
"isAccessibleForFree": true,
"isPartOf": [
{
"id": "sg:journal.1036887",
"issn": [
"1573-3882",
"1573-3890"
],
"name": "Metabolomics",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "5",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "12"
}
],
"keywords": [
"untargeted mass spectrometry",
"quality control samples",
"better correction",
"study sample",
"injection order",
"control samples",
"regression models",
"metabolomics studies",
"regular intervals",
"more metabolites",
"mass spectrometry",
"effect",
"small number",
"LC-MS",
"metabolites",
"samples",
"metabolomics",
"strategies",
"correction",
"sensitive detection techniques",
"interval",
"better results",
"criteria",
"batch correction",
"study",
"quality criteria",
"different strategies",
"use",
"method",
"information",
"correction strategy",
"inclusion",
"intensity",
"certainty",
"number",
"lead",
"approach",
"trends",
"batch effects",
"results",
"characteristics",
"metabolomics experiments",
"different characteristics",
"untargeted metabolomics experiments",
"labels",
"values",
"spectrometry",
"order",
"technique",
"model",
"experiments",
"GC-MS data sets",
"data sets",
"batch",
"signals",
"distinct behaviors",
"advantages",
"behavior",
"normalization approach",
"normalization method",
"general lead",
"account",
"set",
"peak intensity",
"correction method",
"detection techniques",
"explicit inclusion",
"merits",
"paper",
"small values",
"batch correction methods",
"Arabidopsis thaliana",
"high-quality correction",
"thaliana"
],
"name": "Improved batch correction in untargeted MS-based metabolomics",
"pagination": "88",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1010819562"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1007/s11306-016-1015-8"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"27073351"
]
}
],
"sameAs": [
"https://doi.org/10.1007/s11306-016-1015-8",
"https://app.dimensions.ai/details/publication/pub.1010819562"
],
"sdDataset": "articles",
"sdDatePublished": "2022-05-10T10:13",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220509/entities/gbq_results/article/article_711.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1007/s11306-016-1015-8"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s11306-016-1015-8'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s11306-016-1015-8'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s11306-016-1015-8'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s11306-016-1015-8'
This table displays all metadata directly associated to this object as RDF triples.
294 TRIPLES
22 PREDICATES
116 URIs
92 LITERALS
7 BLANK NODES