Ontology type: schema:ScholarlyArticle Open Access: True
2019-01-28
AUTHORSThomas D. S. Sutton, Adam G. Clooney, Feargal J. Ryan, R. Paul Ross, Colin Hill
ABSTRACTBackgroundThe viral component of microbial communities plays a vital role in driving bacterial diversity, facilitating nutrient turnover and shaping community composition. Despite their importance, the vast majority of viral sequences are poorly annotated and share little or no homology to reference databases. As a result, investigation of the viral metagenome (virome) relies heavily on de novo assembly of short sequencing reads to recover compositional and functional information. Metagenomic assembly is particularly challenging for virome data, often resulting in fragmented assemblies and poor recovery of viral community members. Despite the essential role of assembly in virome analysis and difficulties posed by these data, current assembly comparisons have been limited to subsections of virome studies or bacterial datasets.DesignThis study presents the most comprehensive virome assembly comparison to date, featuring 16 metagenomic assembly approaches which have featured in human virome studies. Assemblers were assessed using four independent virome datasets, namely, simulated reads, two mock communities, viromes spiked with a known phage and human gut viromes.ResultsAssembly performance varied significantly across all test datasets, with SPAdes (meta) performing consistently well. Performance of MIRA and VICUNA varied, highlighting the importance of using a range of datasets when comparing assembly programs. It was also found that while some assemblers addressed the challenges of virome data better than others, all assemblers had limitations. Low read coverage and genomic repeats resulted in assemblies with poor genome recovery, high degrees of fragmentation and low-accuracy contigs across all assemblers. These limitations must be considered when setting thresholds for downstream analysis and when drawing conclusions from virome data. More... »
PAGES12
http://scigraph.springernature.com/pub.10.1186/s40168-019-0626-5
DOIhttp://dx.doi.org/10.1186/s40168-019-0626-5
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1111751750
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/30691529
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Bacteriophages",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Databases, Factual",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Gastrointestinal Microbiome",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Gene Library",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genome, Viral",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Humans",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Sequence Analysis, DNA",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "School for Microbiology, University College Cork, Cork, Ireland",
"id": "http://www.grid.ac/institutes/grid.7872.a",
"name": [
"APC Microbiome Ireland, Cork, Ireland",
"School for Microbiology, University College Cork, Cork, Ireland"
],
"type": "Organization"
},
"familyName": "Sutton",
"givenName": "Thomas D. S.",
"id": "sg:person.015670045004.66",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015670045004.66"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "School for Microbiology, University College Cork, Cork, Ireland",
"id": "http://www.grid.ac/institutes/grid.7872.a",
"name": [
"APC Microbiome Ireland, Cork, Ireland",
"School for Microbiology, University College Cork, Cork, Ireland"
],
"type": "Organization"
},
"familyName": "Clooney",
"givenName": "Adam G.",
"id": "sg:person.01271720363.91",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01271720363.91"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Present Address: South Australian Health and Medical Research Institute, Adelaide, Australia",
"id": "http://www.grid.ac/institutes/grid.430453.5",
"name": [
"APC Microbiome Ireland, Cork, Ireland",
"School for Microbiology, University College Cork, Cork, Ireland",
"Present Address: South Australian Health and Medical Research Institute, Adelaide, Australia"
],
"type": "Organization"
},
"familyName": "Ryan",
"givenName": "Feargal J.",
"id": "sg:person.01041657615.09",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01041657615.09"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Teagasc Food Research Centre, Fermoy, Cork, Ireland",
"id": "http://www.grid.ac/institutes/grid.6435.4",
"name": [
"APC Microbiome Ireland, Cork, Ireland",
"School for Microbiology, University College Cork, Cork, Ireland",
"Teagasc Food Research Centre, Fermoy, Cork, Ireland"
],
"type": "Organization"
},
"familyName": "Ross",
"givenName": "R. Paul",
"id": "sg:person.014605670062.43",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014605670062.43"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "School for Microbiology, University College Cork, Cork, Ireland",
"id": "http://www.grid.ac/institutes/grid.7872.a",
"name": [
"APC Microbiome Ireland, Cork, Ireland",
"School for Microbiology, University College Cork, Cork, Ireland"
],
"type": "Organization"
},
"familyName": "Hill",
"givenName": "Colin",
"id": "sg:person.01202224511.99",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01202224511.99"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1186/1471-2164-13-475",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008570077",
"https://doi.org/10.1186/1471-2164-13-475"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2164-15-989",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1040304359",
"https://doi.org/10.1186/1471-2164-15-989"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s40168-018-0446-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1103148891",
"https://doi.org/10.1186/s40168-018-0446-z"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth1043",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047202519",
"https://doi.org/10.1038/nmeth1043"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s13100-017-0095-y",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1092064437",
"https://doi.org/10.1186/s13100-017-0095-y"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ismej.2008.35",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047643385",
"https://doi.org/10.1038/ismej.2008.35"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ncomms5498",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047569017",
"https://doi.org/10.1038/ncomms5498"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s12250-016-3896-0",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1085125783",
"https://doi.org/10.1007/s12250-016-3896-0"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s12864-017-3679-5",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084803333",
"https://doi.org/10.1186/s12864-017-3679-5"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s12864-017-4429-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1100433735",
"https://doi.org/10.1186/s12864-017-4429-4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/srep19233",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1051309616",
"https://doi.org/10.1038/srep19233"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-15-126",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1021130525",
"https://doi.org/10.1186/1471-2105-15-126"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nm.3950",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044921131",
"https://doi.org/10.1038/nm.3950"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2164-15-37",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1013038725",
"https://doi.org/10.1186/1471-2164-15-37"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/2047-217x-1-18",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016521225",
"https://doi.org/10.1186/2047-217x-1-18"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2012-13-12-r122",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019755831",
"https://doi.org/10.1186/gb-2012-13-12-r122"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.4458",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1092065319",
"https://doi.org/10.1038/nmeth.4458"
],
"type": "CreativeWork"
}
],
"datePublished": "2019-01-28",
"datePublishedReg": "2019-01-28",
"description": "BackgroundThe viral component of microbial communities plays a vital role in driving bacterial diversity, facilitating nutrient turnover and shaping community composition. Despite their importance, the vast majority of viral sequences are poorly annotated and share little or no homology to reference databases. As a result, investigation of the viral metagenome (virome) relies heavily on de novo assembly of short sequencing reads to recover compositional and functional information. Metagenomic assembly is particularly challenging for virome data, often resulting in fragmented assemblies and poor recovery of viral community members. Despite the essential role of assembly in virome analysis and difficulties posed by these data, current assembly comparisons have been limited to subsections of virome studies or bacterial datasets.DesignThis study presents the most comprehensive virome assembly comparison to date, featuring 16 metagenomic assembly approaches which have featured in human virome studies. Assemblers were assessed using four independent virome datasets, namely, simulated reads, two mock communities, viromes spiked with a known phage and human gut viromes.ResultsAssembly performance varied significantly across all test datasets, with SPAdes (meta) performing consistently well. Performance of MIRA and VICUNA varied, highlighting the importance of using a range of datasets when comparing assembly programs. It was also found that while some assemblers addressed the challenges of virome data better than others, all assemblers had limitations. Low read coverage and genomic repeats resulted in assemblies with poor genome recovery, high degrees of fragmentation and low-accuracy contigs across all assemblers. These limitations must be considered when setting thresholds for downstream analysis and when drawing conclusions from virome data.",
"genre": "article",
"id": "sg:pub.10.1186/s40168-019-0626-5",
"inLanguage": "en",
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.7670448",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.3984108",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1048878",
"issn": [
"2049-2618"
],
"name": "Microbiome",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "7"
}
],
"keywords": [
"range of datasets",
"virome data",
"metagenomic assembly approaches",
"test dataset",
"assembly software",
"simulated reads",
"assembly programs",
"dataset",
"assembly comparison",
"assemblers",
"bacterial datasets",
"reference database",
"short sequencing reads",
"downstream analysis",
"metagenomic assembly",
"vital role",
"sequencing reads",
"software",
"genomic repeats",
"performance",
"functional information",
"critical impact",
"fragmented assemblies",
"data",
"database",
"reads",
"information",
"SPADE",
"limitations",
"challenges",
"virome studies",
"community",
"de novo assembly",
"high degree",
"viral metagenomes",
"coverage",
"mock communities",
"community composition",
"novo assembly",
"difficulties",
"assembly approach",
"microbial communities",
"community members",
"bacterial diversity",
"nutrient turnover",
"genome recovery",
"subsections",
"virome analysis",
"program",
"sequence",
"human gut",
"viral components",
"analysis",
"comparison",
"Mira",
"components",
"vast majority",
"viral sequences",
"essential role",
"results",
"importance",
"choice",
"assembly",
"metagenomes",
"contigs",
"homology",
"diversity",
"repeats",
"virome",
"phages",
"impact",
"vicuna",
"role",
"degree",
"turnover",
"study",
"date",
"gut",
"fragmentation",
"members",
"range",
"recovery",
"characterisation",
"composition",
"majority",
"investigation",
"conclusion",
"approach",
"poor recovery",
"DesignThis study"
],
"name": "Choice of assembly software has a critical impact on virome characterisation",
"pagination": "12",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1111751750"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/s40168-019-0626-5"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"30691529"
]
}
],
"sameAs": [
"https://doi.org/10.1186/s40168-019-0626-5",
"https://app.dimensions.ai/details/publication/pub.1111751750"
],
"sdDataset": "articles",
"sdDatePublished": "2022-05-20T07:35",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220519/entities/gbq_results/article/article_802.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1186/s40168-019-0626-5"
}
]
Download the RDF metadata as:Â json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/s40168-019-0626-5'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/s40168-019-0626-5'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/s40168-019-0626-5'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/s40168-019-0626-5'
This table displays all metadata directly associated to this object as RDF triples.
290 TRIPLES
22 PREDICATES
140 URIs
115 LITERALS
14 BLANK NODES