Ontology type: schema:ScholarlyArticle Open Access: True
2017-12
AUTHORSCeline Everaert, Manuel Luypaert, Jesper L. V. Maag, Quek Xiu Cheng, Marcel E. Dinger, Jan Hellemans, Pieter Mestdagh
ABSTRACTRNA-sequencing has become the gold standard for whole-transcriptome gene expression quantification. Multiple algorithms have been developed to derive gene counts from sequencing reads. While a number of benchmarking studies have been conducted, the question remains how individual methods perform at accurately quantifying gene expression levels from RNA-sequencing reads. We performed an independent benchmarking study using RNA-sequencing data from the well established MAQCA and MAQCB reference samples. RNA-sequencing reads were processed using five workflows (Tophat-HTSeq, Tophat-Cufflinks, STAR-HTSeq, Kallisto and Salmon) and resulting gene expression measurements were compared to expression data generated by wet-lab validated qPCR assays for all protein coding genes. All methods showed high gene expression correlations with qPCR data. When comparing gene expression fold changes between MAQCA and MAQCB samples, about 85% of the genes showed consistent results between RNA-sequencing and qPCR data. Of note, each method revealed a small but specific gene set with inconsistent expression measurements. A significant proportion of these method-specific inconsistent genes were reproducibly identified in independent datasets. These genes were typically smaller, had fewer exons, and were lower expressed compared to genes with consistent expression measurements. We propose that careful validation is warranted when evaluating RNA-seq based expression profiles for this specific gene set. More... »
PAGES1559
http://scigraph.springernature.com/pub.10.1038/s41598-017-01617-3
DOIhttp://dx.doi.org/10.1038/s41598-017-01617-3
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1085213169
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/28484260
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Ghent University",
"id": "https://www.grid.ac/institutes/grid.5342.0",
"name": [
"Center for Medical Genetics, Ghent University, Ghent, Belgium",
"Cancer Research Institute Ghent, Ghent University, Ghent, Belgium",
"Bioinformatics Institute Ghent N2N, Ghent University, Ghent, Belgium"
],
"type": "Organization"
},
"familyName": "Everaert",
"givenName": "Celine",
"id": "sg:person.016034630741.15",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016034630741.15"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"Biogazelle, Ghent, Belgium"
],
"type": "Organization"
},
"familyName": "Luypaert",
"givenName": "Manuel",
"id": "sg:person.013066207627.77",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013066207627.77"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"Kinghorn Cancer Center, Sydney, Australia"
],
"type": "Organization"
},
"familyName": "Maag",
"givenName": "Jesper L. V.",
"id": "sg:person.01306110600.10",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01306110600.10"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"Kinghorn Cancer Center, Sydney, Australia"
],
"type": "Organization"
},
"familyName": "Cheng",
"givenName": "Quek Xiu",
"type": "Person"
},
{
"affiliation": {
"name": [
"Kinghorn Cancer Center, Sydney, Australia"
],
"type": "Organization"
},
"familyName": "Dinger",
"givenName": "Marcel E.",
"id": "sg:person.01335510163.78",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01335510163.78"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"Biogazelle, Ghent, Belgium"
],
"type": "Organization"
},
"familyName": "Hellemans",
"givenName": "Jan",
"id": "sg:person.01337400337.70",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01337400337.70"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Ghent University",
"id": "https://www.grid.ac/institutes/grid.5342.0",
"name": [
"Center for Medical Genetics, Ghent University, Ghent, Belgium",
"Cancer Research Institute Ghent, Ghent University, Ghent, Belgium",
"Bioinformatics Institute Ghent N2N, Ghent University, Ghent, Belgium"
],
"type": "Organization"
},
"familyName": "Mestdagh",
"givenName": "Pieter",
"id": "sg:person.01174241331.86",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01174241331.86"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1016/j.molcel.2004.12.004",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1000823839"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.2862",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1011219673",
"https://doi.org/10.1038/nbt.2862"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btp120",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1012425816"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-014-0550-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015222646",
"https://doi.org/10.1186/s13059-014-0550-8"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-014-0550-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015222646",
"https://doi.org/10.1186/s13059-014-0550-8"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkv007",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016098431"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btp616",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023247882"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.3519",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1024493480",
"https://doi.org/10.1038/nbt.3519"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-015-0734-x",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1025237425",
"https://doi.org/10.1186/s13059-015-0734-x"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-016-1060-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026981892",
"https://doi.org/10.1186/s13059-016-1060-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-016-1060-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026981892",
"https://doi.org/10.1186/s13059-016-1060-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.2957",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1027683701",
"https://doi.org/10.1038/nbt.2957"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nprot.2012.016",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1030124536",
"https://doi.org/10.1038/nprot.2012.016"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-7-276",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031404240",
"https://doi.org/10.1186/1471-2105-7-276"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s12064-012-0162-3",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1036112334",
"https://doi.org/10.1007/s12064-012-0162-3"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt1239",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037875102",
"https://doi.org/10.1038/nbt1239"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt1239",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037875102",
"https://doi.org/10.1038/nbt1239"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-016-0940-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044116170",
"https://doi.org/10.1186/s13059-016-0940-1"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2009-10-6-r64",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044394180",
"https://doi.org/10.1186/gb-2009-10-6-r64"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-461",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044855683",
"https://doi.org/10.1186/1471-2105-8-461"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/srep16923",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045031591",
"https://doi.org/10.1038/srep16923"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.1226",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045381177",
"https://doi.org/10.1038/nmeth.1226"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.3014",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045449900",
"https://doi.org/10.1038/nmeth.3014"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btu638",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1053282140"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bts635",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1053365587"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/embc.2013.6609583",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1078796277"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.4197",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084129290",
"https://doi.org/10.1038/nmeth.4197"
],
"type": "CreativeWork"
}
],
"datePublished": "2017-12",
"datePublishedReg": "2017-12-01",
"description": "RNA-sequencing has become the gold standard for whole-transcriptome gene expression quantification. Multiple algorithms have been developed to derive gene counts from sequencing reads. While a number of benchmarking studies have been conducted, the question remains how individual methods perform at accurately quantifying gene expression levels from RNA-sequencing reads. We performed an independent benchmarking study using RNA-sequencing data from the well established MAQCA and MAQCB reference samples. RNA-sequencing reads were processed using five workflows (Tophat-HTSeq, Tophat-Cufflinks, STAR-HTSeq, Kallisto and Salmon) and resulting gene expression measurements were compared to expression data generated by wet-lab validated qPCR assays for all protein coding genes. All methods showed high gene expression correlations with qPCR data. When comparing gene expression fold changes between MAQCA and MAQCB samples, about 85% of the genes showed consistent results between RNA-sequencing and qPCR data. Of note, each method revealed a small but specific gene set with inconsistent expression measurements. A significant proportion of these method-specific inconsistent genes were reproducibly identified in independent datasets. These genes were typically smaller, had fewer exons, and were lower expressed compared to genes with consistent expression measurements. We propose that careful validation is warranted when evaluating RNA-seq based expression profiles for this specific gene set.",
"genre": "research_article",
"id": "sg:pub.10.1038/s41598-017-01617-3",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isPartOf": [
{
"id": "sg:journal.1045337",
"issn": [
"2045-2322"
],
"name": "Scientific Reports",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "7"
}
],
"name": "Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data",
"pagination": "1559",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"8c50c7368d39b0052c6ad18965ea8600cb8782a2c7e6afdc1dcefa22f8752f3f"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"28484260"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"101563288"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1038/s41598-017-01617-3"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1085213169"
]
}
],
"sameAs": [
"https://doi.org/10.1038/s41598-017-01617-3",
"https://app.dimensions.ai/details/publication/pub.1085213169"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-11T00:30",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8695_00000600.jsonl",
"type": "ScholarlyArticle",
"url": "https://www.nature.com/articles/s41598-017-01617-3"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/s41598-017-01617-3'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/s41598-017-01617-3'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/s41598-017-01617-3'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/s41598-017-01617-3'
This table displays all metadata directly associated to this object as RDF triples.
210 TRIPLES
21 PREDICATES
53 URIs
21 LITERALS
9 BLANK NODES