Ontology type: schema:ScholarlyArticle Open Access: True
2019-12
AUTHORSMan Tang, Mohammad Shabbir Hasan, Hongxiao Zhu, Liqing Zhang, Xiaowei Wu
ABSTRACTBACKGROUND: Accurate and reliable identification of sequence variants, including single nucleotide polymorphisms (SNPs) and insertion-deletion polymorphisms (INDELs), plays a fundamental role in next-generation sequencing (NGS) applications. Existing methods for calling these variants often make simplified assumptions of positional independence and fail to leverage the dependence between genotypes at nearby loci that is caused by linkage disequilibrium (LD). RESULTS AND CONCLUSION: We propose vi-HMM, a hidden Markov model (HMM)-based method for calling SNPs and INDELs in mapped short-read data. This method allows transitions between hidden states (defined as "SNP," "Ins," "Del," and "Match") of adjacent genomic bases and determines an optimal hidden state path by using the Viterbi algorithm. The inferred hidden state path provides a direct solution to the identification of SNPs and INDELs. Simulation studies show that, under various sequencing depths, vi-HMM outperforms commonly used variant calling methods in terms of sensitivity and F1 score. When applied to the real data, vi-HMM demonstrates higher accuracy in calling SNPs and INDELs. More... »
PAGES9
http://scigraph.springernature.com/pub.10.1186/s40246-019-0194-6
DOIhttp://dx.doi.org/10.1186/s40246-019-0194-6
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1112315693
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/30795817
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Virginia Tech",
"id": "https://www.grid.ac/institutes/grid.438526.e",
"name": [
"Department of Statistics, Virginia Tech, 250 Drillfield Drive, 24061, Blacksburg, VA, USA"
],
"type": "Organization"
},
"familyName": "Tang",
"givenName": "Man",
"type": "Person"
},
{
"affiliation": {
"alternateName": "Virginia Tech",
"id": "https://www.grid.ac/institutes/grid.438526.e",
"name": [
"Department of Computer Science, Virginia Tech, 225 Stanger Street, 24060, Blacksburg, VA, USA"
],
"type": "Organization"
},
"familyName": "Hasan",
"givenName": "Mohammad Shabbir",
"type": "Person"
},
{
"affiliation": {
"alternateName": "Virginia Tech",
"id": "https://www.grid.ac/institutes/grid.438526.e",
"name": [
"Department of Statistics, Virginia Tech, 250 Drillfield Drive, 24061, Blacksburg, VA, USA"
],
"type": "Organization"
},
"familyName": "Zhu",
"givenName": "Hongxiao",
"type": "Person"
},
{
"affiliation": {
"alternateName": "Virginia Tech",
"id": "https://www.grid.ac/institutes/grid.438526.e",
"name": [
"Department of Computer Science, Virginia Tech, 225 Stanger Street, 24060, Blacksburg, VA, USA"
],
"type": "Organization"
},
"familyName": "Zhang",
"givenName": "Liqing",
"type": "Person"
},
{
"affiliation": {
"alternateName": "Virginia Tech",
"id": "https://www.grid.ac/institutes/grid.438526.e",
"name": [
"Department of Statistics, Virginia Tech, 250 Drillfield Drive, 24061, Blacksburg, VA, USA"
],
"type": "Organization"
},
"familyName": "Wu",
"givenName": "Xiaowei",
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1007/s00439-012-1213-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002421548",
"https://doi.org/10.1007/s00439-012-1213-z"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s00439-012-1213-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002421548",
"https://doi.org/10.1007/s00439-012-1213-z"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/29.1.308",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005817660"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.1923",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1006541515",
"https://doi.org/10.1038/nmeth.1923"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btp373",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008270984"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng.806",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1010244476",
"https://doi.org/10.1038/ng.806"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-7-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1011318612",
"https://doi.org/10.1186/1471-2105-7-4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.2835",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015051370",
"https://doi.org/10.1038/nbt.2835"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1146/annurev.genom.9.081307.164359",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015853776"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.6151507",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1020576715"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bib/bbs086",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1021271185"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.115907.110",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1021970032"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkt372",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1022061847"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s40246-015-0042-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1022669794",
"https://doi.org/10.1186/s40246-015-0042-2"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btp352",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023014918"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrg2626",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023911485",
"https://doi.org/10.1038/nrg2626"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrg2626",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023911485",
"https://doi.org/10.1038/nrg2626"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2012-13-2-r15",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1028349968",
"https://doi.org/10.1186/gb-2012-13-2-r15"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.2174/138920209789177575",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1029905740"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/hmg/ddq400",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031822826"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/hmg/ddq400",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031822826"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/bf02458837",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033367959",
"https://doi.org/10.1007/bf02458837"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/bf02458837",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033367959",
"https://doi.org/10.1007/bf02458837"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng.3036",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037327856",
"https://doi.org/10.1038/ng.3036"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.112326.110",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039404689"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.078212.108",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047542880"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btt512",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1048188329"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-7-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052642623",
"https://doi.org/10.1186/1471-2105-7-1"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/5.18626",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061178979"
],
"type": "CreativeWork"
}
],
"datePublished": "2019-12",
"datePublishedReg": "2019-12-01",
"description": "BACKGROUND: Accurate and reliable identification of sequence variants, including single nucleotide polymorphisms (SNPs) and insertion-deletion polymorphisms (INDELs), plays a fundamental role in next-generation sequencing (NGS) applications. Existing methods for calling these variants often make simplified assumptions of positional independence and fail to leverage the dependence between genotypes at nearby loci that is caused by linkage disequilibrium (LD).\nRESULTS AND CONCLUSION: We propose vi-HMM, a hidden Markov model (HMM)-based method for calling SNPs and INDELs in mapped short-read data. This method allows transitions between hidden states (defined as \"SNP,\" \"Ins,\" \"Del,\" and \"Match\") of adjacent genomic bases and determines an optimal hidden state path by using the Viterbi algorithm. The inferred hidden state path provides a direct solution to the identification of SNPs and INDELs. Simulation studies show that, under various sequencing depths, vi-HMM outperforms commonly used variant calling methods in terms of sensitivity and F1 score. When applied to the real data, vi-HMM demonstrates higher accuracy in calling SNPs and INDELs.",
"genre": "research_article",
"id": "sg:pub.10.1186/s40246-019-0194-6",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.5541338",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1033252",
"issn": [
"1473-9542",
"1479-7364"
],
"name": "Human Genomics",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "13"
}
],
"name": "vi-HMM: a novel HMM-based method for sequence variant identification in short-read data",
"pagination": "9",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"9c52be42093afe9270aafe99db21cf95eb277ba6181dac642c8001cdd8f8e2e6"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"30795817"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"101202210"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/s40246-019-0194-6"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1112315693"
]
}
],
"sameAs": [
"https://doi.org/10.1186/s40246-019-0194-6",
"https://app.dimensions.ai/details/publication/pub.1112315693"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-11T13:19",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000368_0000000368/records_78959_00000001.jsonl",
"type": "ScholarlyArticle",
"url": "https://link.springer.com/10.1186%2Fs40246-019-0194-6"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/s40246-019-0194-6'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/s40246-019-0194-6'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/s40246-019-0194-6'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/s40246-019-0194-6'
This table displays all metadata directly associated to this object as RDF triples.
181 TRIPLES
21 PREDICATES
54 URIs
21 LITERALS
9 BLANK NODES