2018-05-03
AUTHORS ABSTRACTLong-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ∼41 h to generate a complete and well-annotated genome from ∼100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY. More... »
PAGES1213
http://scigraph.springernature.com/pub.10.1038/nprot.2018.025
DOIhttp://dx.doi.org/10.1038/nprot.2018.025
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1103764887
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/29725120
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Institute of Research on Cancer and Aging in Nice",
"id": "https://www.grid.ac/institutes/grid.463830.a",
"name": [
"Universit\u00e9 C\u00f4te d'Azur, CNRS, INSERM, IRCAN, Nice, France."
],
"type": "Organization"
},
"familyName": "Yue",
"givenName": "Jia-Xing",
"id": "sg:person.0646257562.14",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0646257562.14"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Institute of Research on Cancer and Aging in Nice",
"id": "https://www.grid.ac/institutes/grid.463830.a",
"name": [
"Universit\u00e9 C\u00f4te d'Azur, CNRS, INSERM, IRCAN, Nice, France."
],
"type": "Organization"
},
"familyName": "Liti",
"givenName": "Gianni",
"id": "sg:person.01140234414.73",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01140234414.73"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1038/nmeth.2474",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002897135",
"https://doi.org/10.1038/nmeth.2474"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1126/science.aae0344",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007526601"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btw152",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008120144"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btu280",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1014952353"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkr1293",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016726853"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.cell.2016.08.020",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1017989680"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nmeth.4035",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019059120",
"https://doi.org/10.1038/nmeth.4035"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1371/journal.pone.0112963",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019307347"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.1754",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019307928",
"https://doi.org/10.1038/nbt.1754"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1371/journal.pone.0092621",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1022509597"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkh379",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1029024910"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2008-9-1-r7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1029779393",
"https://doi.org/10.1186/gb-2008-9-1-r7"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-13-237",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1030139446",
"https://doi.org/10.1186/1471-2105-13-237"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.185538.114",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031054093"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1534/g3.116.029389",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031056399"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1534/g3.116.029389",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031056399"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature15714",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031293561",
"https://doi.org/10.1038/nature15714"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.107524.110",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032096953"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1111/mec.13341",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032840841"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/molbev/msu037",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032915689"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.2280",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033360952",
"https://doi.org/10.1038/nbt.2280"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2164-9-614",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1036294383",
"https://doi.org/10.1186/1471-2164-9-614"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-491",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039713283",
"https://doi.org/10.1186/1471-2105-12-491"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-491",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039713283",
"https://doi.org/10.1186/1471-2105-12-491"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.3238",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1040313779",
"https://doi.org/10.1038/nbt.3238"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.10.4.516",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1040860989"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gki487",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044813607"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-3-642-40453-5_17",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1050868498",
"https://doi.org/10.1007/978-3-642-40453-5_17"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.191395.115",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1051820620"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1126/science.274.5287.546",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1062554574"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1126/science.283.5405.1168",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1062564255"
],
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1077141234",
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng.3802",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084129143",
"https://doi.org/10.1038/ng.3802"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/gigascience/giw018",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084184782"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.215087.116",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084197434"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng.3847",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084862877",
"https://doi.org/10.1038/ng.3847"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng.3847",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1084862877",
"https://doi.org/10.1038/ng.3847"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature22380",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1085463905",
"https://doi.org/10.1038/nature22380"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature22380",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1085463905",
"https://doi.org/10.1038/nature22380"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41598-017-03996-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1086055771",
"https://doi.org/10.1038/s41598-017-03996-z"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.4060",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1100685340",
"https://doi.org/10.1038/nbt.4060"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/s41586-018-0030-5",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1103250606",
"https://doi.org/10.1038/s41586-018-0030-5"
],
"type": "CreativeWork"
}
],
"datePublished": "2018-05-03",
"datePublishedReg": "2018-05-03",
"description": "Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes \u223c41 h to generate a complete and well-annotated genome from \u223c100\u00d7 Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.",
"genre": "research_article",
"id": "sg:pub.10.1038/nprot.2018.025",
"inLanguage": [
"en"
],
"isAccessibleForFree": false,
"isFundedItemOf": [
{
"id": "sg:grant.7738556",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.3797357",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.3732087",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1037502",
"issn": [
"1754-2189",
"1750-2799"
],
"name": "Nature Protocols",
"type": "Periodical"
},
{
"issueNumber": "6",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "13"
}
],
"name": "Long-read sequencing data analysis for yeasts",
"pagination": "1213",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"76d3e967d5d7076aa7ccbd290455d5748912eae7119ab9110638cd6af705b7fc"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"29725120"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"101284307"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1038/nprot.2018.025"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1103764887"
]
}
],
"sameAs": [
"https://doi.org/10.1038/nprot.2018.025",
"https://app.dimensions.ai/details/publication/pub.1103764887"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-11T14:34",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000373_0000000373/records_13109_00000002.jsonl",
"type": "ScholarlyArticle",
"url": "https://www.nature.com/articles/nprot.2018.025"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/nprot.2018.025'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/nprot.2018.025'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/nprot.2018.025'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/nprot.2018.025'
This table displays all metadata directly associated to this object as RDF triples.
212 TRIPLES
21 PREDICATES
66 URIs
20 LITERALS
9 BLANK NODES