Ontology type: schema:ScholarlyArticle Open Access: True
2022-05-06
AUTHORSJianzhi Yang, Mark J.P. Chaisson
ABSTRACTVariant benchmarking is often performed by comparing a test callset to a gold standard set of variants. In repetitive regions of the genome, it may be difficult to establish what is the truth for a call, for example, when different alignment scoring metrics provide equally supported but different variant calls on the same data. Here, we provide an alternative approach, TT-Mars, that takes advantage of the recent production of high-quality haplotype-resolved genome assemblies by providing false discovery rates for variant calls based on how well their call reflects the content of the assembly, rather than comparing calls themselves. More... »
PAGES110
http://scigraph.springernature.com/pub.10.1186/s13059-022-02666-2
DOIhttp://dx.doi.org/10.1186/s13059-022-02666-2
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1147714464
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/35524317
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information Systems",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Benchmarking",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genome",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Haplotypes",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "High-Throughput Nucleotide Sequencing",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Polymorphism, Single Nucleotide",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Software",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA",
"id": "http://www.grid.ac/institutes/grid.42505.36",
"name": [
"Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA"
],
"type": "Organization"
},
"familyName": "Yang",
"givenName": "Jianzhi",
"id": "sg:person.012264014074.17",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012264014074.17"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA",
"id": "http://www.grid.ac/institutes/grid.42505.36",
"name": [
"Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA"
],
"type": "Organization"
},
"familyName": "Chaisson",
"givenName": "Mark J.P.",
"id": "sg:person.012610254333.24",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012610254333.24"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1186/s13059-021-02380-5",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1138323905",
"https://doi.org/10.1186/s13059-021-02380-5"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41586-020-2371-0",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1127898772",
"https://doi.org/10.1038/s41586-020-2371-0"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-019-1828-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1122758449",
"https://doi.org/10.1186/s13059-019-1828-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41586-018-0566-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1107222820",
"https://doi.org/10.1038/s41586-018-0566-4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.1754",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019307928",
"https://doi.org/10.1038/nbt.1754"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s12864-015-1479-3",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052180130",
"https://doi.org/10.1186/s12864-015-1479-3"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41592-018-0054-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1105534003",
"https://doi.org/10.1038/s41592-018-0054-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41467-018-08148-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1113482416",
"https://doi.org/10.1038/s41467-018-08148-z"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41592-018-0001-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1103690602",
"https://doi.org/10.1038/s41592-018-0001-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s12864-016-2366-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026798286",
"https://doi.org/10.1186/s12864-016-2366-2"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41467-020-18564-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1131073477",
"https://doi.org/10.1038/s41467-020-18564-9"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41587-019-0217-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1120285368",
"https://doi.org/10.1038/s41587-019-0217-9"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2014-15-6-r84",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019478599",
"https://doi.org/10.1186/gb-2014-15-6-r84"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2010-11-1-r1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1012784779",
"https://doi.org/10.1186/gb-2010-11-1-r1"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41467-019-13993-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1123915232",
"https://doi.org/10.1038/s41467-019-13993-7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature14962",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049666571",
"https://doi.org/10.1038/nature14962"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41467-016-0009-6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1083776852",
"https://doi.org/10.1038/s41467-016-0009-6"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-020-02207-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1134304494",
"https://doi.org/10.1186/s13059-020-02207-9"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41592-020-01056-5",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1135019727",
"https://doi.org/10.1038/s41592-020-01056-5"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/gim.2017.86",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1086112279",
"https://doi.org/10.1038/gim.2017.86"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41586-020-2287-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1127888051",
"https://doi.org/10.1038/s41586-020-2287-8"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13059-018-1612-0",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1111097116",
"https://doi.org/10.1186/s13059-018-1612-0"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41467-018-07882-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1110929190",
"https://doi.org/10.1038/s41467-018-07882-8"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41587-020-0538-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1128479298",
"https://doi.org/10.1038/s41587-020-0538-8"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nnano.2009.12",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1006588299",
"https://doi.org/10.1038/nnano.2009.12"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature15394",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031274231",
"https://doi.org/10.1038/nature15394"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13073-017-0512-3",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1100201473",
"https://doi.org/10.1186/s13073-017-0512-3"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13073-018-0606-6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1110465224",
"https://doi.org/10.1186/s13073-018-0606-6"
],
"type": "CreativeWork"
}
],
"datePublished": "2022-05-06",
"datePublishedReg": "2022-05-06",
"description": "Variant benchmarking is often performed by comparing a test callset to a gold standard set of variants. In repetitive regions of the genome, it may be difficult to establish what is the truth for a call, for example, when different alignment scoring metrics provide equally supported but different variant calls on the same data. Here, we provide an alternative approach, TT-Mars, that takes advantage of the recent production of high-quality haplotype-resolved genome assemblies by providing false discovery rates for variant calls based on how well their call reflects the content of the assembly, rather than comparing calls themselves.",
"genre": "article",
"id": "sg:pub.10.1186/s13059-022-02666-2",
"inLanguage": "en",
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.8557065",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.9753435",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023439",
"issn": [
"1474-760X",
"1465-6906"
],
"name": "Genome Biology",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "23"
}
],
"keywords": [
"gold standard set",
"variant calls",
"same data",
"standard set",
"calls",
"metrics",
"variant assessment",
"benchmarking",
"callsets",
"set",
"different alignments",
"alternative approach",
"truth",
"alignment",
"advantages",
"false discovery rate",
"discovery rate",
"example",
"data",
"genome assembly",
"variants",
"content",
"haplotype-resolved assemblies",
"repetitive regions",
"assembly",
"recent production",
"assessment",
"rate",
"region",
"genome",
"production",
"approach",
"haplotype-resolved genome assemblies"
],
"name": "TT-Mars: structural variants assessment based on haplotype-resolved assemblies",
"pagination": "110",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1147714464"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/s13059-022-02666-2"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"35524317"
]
}
],
"sameAs": [
"https://doi.org/10.1186/s13059-022-02666-2",
"https://app.dimensions.ai/details/publication/pub.1147714464"
],
"sdDataset": "articles",
"sdDatePublished": "2022-06-01T22:25",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220601/entities/gbq_results/article/article_927.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1186/s13059-022-02666-2"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/s13059-022-02666-2'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/s13059-022-02666-2'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/s13059-022-02666-2'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/s13059-022-02666-2'
This table displays all metadata directly associated to this object as RDF triples.
242 TRIPLES
22 PREDICATES
93 URIs
57 LITERALS
13 BLANK NODES