Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2010-05

AUTHORS

Cole Trapnell, Brian A Williams, Geo Pertea, Ali Mortazavi, Gordon Kwan, Marijke J van Baren, Steven L Salzberg, Barbara J Wold, Lior Pachter

ABSTRACT

High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation. More... »

PAGES

511

References to SciGraph publications

Journal

TITLE

Nature Biotechnology

ISSUE

5

VOLUME

28

Identifiers

URI

http://scigraph.springernature.com/pub.10.1038/nbt.1621

DOI

http://dx.doi.org/10.1038/nbt.1621

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1031035095

PUBMED

https://www.ncbi.nlm.nih.gov/pubmed/20436464


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Genetics", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Biological Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Algorithms", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Animals", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Cell Differentiation", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Cell Line", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Gene Expression Profiling", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Genome", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Mice", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Oligonucleotide Array Sequence Analysis", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Protein Isoforms", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Proto-Oncogene Proteins c-myc", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "RNA, Messenger", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Sequence Analysis, RNA", 
        "type": "DefinedTerm"
      }, 
      {
        "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
        "name": "Software", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "University of California, Berkeley", 
          "id": "https://www.grid.ac/institutes/grid.47840.3f", 
          "name": [
            "Department of Computer Science, University of Maryland, College Park, Maryland, USA.", 
            "Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA.", 
            "Department of Mathematics, University of California, Berkeley, California, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Trapnell", 
        "givenName": "Cole", 
        "id": "sg:person.01247375013.30", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01247375013.30"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "California Institute of Technology", 
          "id": "https://www.grid.ac/institutes/grid.20861.3d", 
          "name": [
            "Division of Biology and Beckman Institute, California Institute of Technology, Pasadena, California, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Williams", 
        "givenName": "Brian A", 
        "id": "sg:person.0741140331.06", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0741140331.06"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Maryland, College Park", 
          "id": "https://www.grid.ac/institutes/grid.164295.d", 
          "name": [
            "Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Pertea", 
        "givenName": "Geo", 
        "id": "sg:person.01024612415.46", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01024612415.46"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "California Institute of Technology", 
          "id": "https://www.grid.ac/institutes/grid.20861.3d", 
          "name": [
            "Division of Biology and Beckman Institute, California Institute of Technology, Pasadena, California, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Mortazavi", 
        "givenName": "Ali", 
        "id": "sg:person.0673025131.06", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0673025131.06"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "California Institute of Technology", 
          "id": "https://www.grid.ac/institutes/grid.20861.3d", 
          "name": [
            "Division of Biology and Beckman Institute, California Institute of Technology, Pasadena, California, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Kwan", 
        "givenName": "Gordon", 
        "id": "sg:person.0774300251.64", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0774300251.64"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Washington University in St. Louis", 
          "id": "https://www.grid.ac/institutes/grid.4367.6", 
          "name": [
            "Genome Sciences Center, Washington University in St. Louis, St. Louis, Missouri, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "van Baren", 
        "givenName": "Marijke J", 
        "id": "sg:person.01042023041.45", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01042023041.45"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Maryland, College Park", 
          "id": "https://www.grid.ac/institutes/grid.164295.d", 
          "name": [
            "Department of Computer Science, University of Maryland, College Park, Maryland, USA.", 
            "Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Salzberg", 
        "givenName": "Steven L", 
        "id": "sg:person.01223441713.02", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01223441713.02"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "California Institute of Technology", 
          "id": "https://www.grid.ac/institutes/grid.20861.3d", 
          "name": [
            "Division of Biology and Beckman Institute, California Institute of Technology, Pasadena, California, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wold", 
        "givenName": "Barbara J", 
        "id": "sg:person.01026162263.73", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01026162263.73"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of California, Berkeley", 
          "id": "https://www.grid.ac/institutes/grid.47840.3f", 
          "name": [
            "Department of Mathematics, University of California, Berkeley, California, USA.", 
            "Department of Molecular and Cell Biology, University of California, Berkeley, California, USA.", 
            "Department of Computer Science, University of California, Berkeley, California, USA."
          ], 
          "type": "Organization"
        }, 
        "familyName": "Pachter", 
        "givenName": "Lior", 
        "id": "sg:person.0672165566.75", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0672165566.75"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "sg:pub.10.1038/nature08195", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1007963177", 
          "https://doi.org/10.1038/nature08195"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature08195", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1007963177", 
          "https://doi.org/10.1038/nature08195"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature07638", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1009442631", 
          "https://doi.org/10.1038/nature07638"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1371/journal.pcbi.1000074", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1010292273"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nmeth.1371", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1011651858", 
          "https://doi.org/10.1038/nmeth.1371"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btp120", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1012425816"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/gb-2008-9-12-r175", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1016469219", 
          "https://doi.org/10.1186/gb-2008-9-12-r175"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1126/science.1141319", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1018702276"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btp352", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023014918"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1128/mcb.6.5.1412", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1025601272"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1242/jcs.004739", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1025879815"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btp544", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1026453845"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature07509", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1029002744", 
          "https://doi.org/10.1038/nature07509"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature05676", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1030633547", 
          "https://doi.org/10.1038/nature05676"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1242/dev.01874", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1032416057"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/s0955-0674(96)80091-3", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1038486419"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btp113", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1044688303"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btp692", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1045138418"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nmeth.1226", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1045381177", 
          "https://doi.org/10.1038/nmeth.1226"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1101/gr.079558.108", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1045837493"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nmeth.1223", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1048586936", 
          "https://doi.org/10.1038/nmeth.1223"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/gb-2009-10-3-r25", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1049583368", 
          "https://doi.org/10.1186/gb-2009-10-3-r25"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/nar/gkg770", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1050668915"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature07672", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1051133532", 
          "https://doi.org/10.1038/nature07672"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1111/j.1432-0436.1977.tb01507.x", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1051515183"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-11-94", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1053091615", 
          "https://doi.org/10.1186/1471-2105-11-94"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1126/science.1158441", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1062457766"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.2307/1969503", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1069674881"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2010-05", 
    "datePublishedReg": "2010-05-01", 
    "description": "High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1038/nbt.1621", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": true, 
    "isFundedItemOf": [
      {
        "id": "sg:grant.2529425", 
        "type": "MonetaryGrant"
      }, 
      {
        "id": "sg:grant.2545461", 
        "type": "MonetaryGrant"
      }, 
      {
        "id": "sg:grant.2519905", 
        "type": "MonetaryGrant"
      }, 
      {
        "id": "sg:grant.2699343", 
        "type": "MonetaryGrant"
      }
    ], 
    "isPartOf": [
      {
        "id": "sg:journal.1115214", 
        "issn": [
          "1087-0156", 
          "1546-1696"
        ], 
        "name": "Nature Biotechnology", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "5", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "28"
      }
    ], 
    "name": "Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation", 
    "pagination": "511", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "93d984f48bd8806cf117d2f18bb9a253fdb7a71a5557f3278a3ff33017297c98"
        ]
      }, 
      {
        "name": "pubmed_id", 
        "type": "PropertyValue", 
        "value": [
          "20436464"
        ]
      }, 
      {
        "name": "nlm_unique_id", 
        "type": "PropertyValue", 
        "value": [
          "9604648"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1038/nbt.1621"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1031035095"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1038/nbt.1621", 
      "https://app.dimensions.ai/details/publication/pub.1031035095"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-10T13:00", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8659_00000435.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "https://www.nature.com/articles/nbt.1621"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/nbt.1621'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/nbt.1621'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/nbt.1621'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/nbt.1621'


 

This table displays all metadata directly associated to this object as RDF triples.

291 TRIPLES      21 PREDICATES      69 URIs      34 LITERALS      22 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1038/nbt.1621 schema:about N16986e8014c24486a204aba7ffec74b3
2 N34543939cc9d4c108d0b70ca5597b958
3 N48124179428e4e989b4a958974a081cd
4 N4ae447ea2c1f48b0b380a29a35e483dd
5 N68f4f035d1c04c488804e344ab814404
6 N7bd18a2c244a4b3583509177409f63f7
7 N8c89f9877f7e495c9c9f61b50f8a90ed
8 N90b05a8580ca4bb0bf3aed7efb462c4f
9 Nb729ef0d71a249b080e7098f31c01397
10 Nc80d725a10354999b8779f8f666e9a93
11 Ncdf2d6f0fa0447d4802fe0ecbf36cd40
12 Nec4590ddaf984224a84ec31db0a13ca0
13 Nfbb6844533de40e5b5f28a04d910f3de
14 anzsrc-for:06
15 anzsrc-for:0604
16 schema:author Nfc41054be0ee4232bc67a5b77fdf3440
17 schema:citation sg:pub.10.1038/nature05676
18 sg:pub.10.1038/nature07509
19 sg:pub.10.1038/nature07638
20 sg:pub.10.1038/nature07672
21 sg:pub.10.1038/nature08195
22 sg:pub.10.1038/nmeth.1223
23 sg:pub.10.1038/nmeth.1226
24 sg:pub.10.1038/nmeth.1371
25 sg:pub.10.1186/1471-2105-11-94
26 sg:pub.10.1186/gb-2008-9-12-r175
27 sg:pub.10.1186/gb-2009-10-3-r25
28 https://doi.org/10.1016/s0955-0674(96)80091-3
29 https://doi.org/10.1093/bioinformatics/btp113
30 https://doi.org/10.1093/bioinformatics/btp120
31 https://doi.org/10.1093/bioinformatics/btp352
32 https://doi.org/10.1093/bioinformatics/btp544
33 https://doi.org/10.1093/bioinformatics/btp692
34 https://doi.org/10.1093/nar/gkg770
35 https://doi.org/10.1101/gr.079558.108
36 https://doi.org/10.1111/j.1432-0436.1977.tb01507.x
37 https://doi.org/10.1126/science.1141319
38 https://doi.org/10.1126/science.1158441
39 https://doi.org/10.1128/mcb.6.5.1412
40 https://doi.org/10.1242/dev.01874
41 https://doi.org/10.1242/jcs.004739
42 https://doi.org/10.1371/journal.pcbi.1000074
43 https://doi.org/10.2307/1969503
44 schema:datePublished 2010-05
45 schema:datePublishedReg 2010-05-01
46 schema:description High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.
47 schema:genre research_article
48 schema:inLanguage en
49 schema:isAccessibleForFree true
50 schema:isPartOf N9caedd2207104f15ab6a358b740929fa
51 Nf5da8dbf3e4242d68aa8781f2981778a
52 sg:journal.1115214
53 schema:name Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
54 schema:pagination 511
55 schema:productId N01e5aaad4917439381597bb9fee8b11d
56 N371f7d8e641745839c33c3a9e0b05747
57 Naa9a95e8190f4b32a0b540aae2fbb9a8
58 Nb5b7e7cf12d643f0a7e6508166e44246
59 Nc4eebd4c02fa4dfa9144028802f06cca
60 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031035095
61 https://doi.org/10.1038/nbt.1621
62 schema:sdDatePublished 2019-04-10T13:00
63 schema:sdLicense https://scigraph.springernature.com/explorer/license/
64 schema:sdPublisher N6fd119c779ca45a3982903a8de7d3a1e
65 schema:url https://www.nature.com/articles/nbt.1621
66 sgo:license sg:explorer/license/
67 sgo:sdDataset articles
68 rdf:type schema:ScholarlyArticle
69 N01e5aaad4917439381597bb9fee8b11d schema:name pubmed_id
70 schema:value 20436464
71 rdf:type schema:PropertyValue
72 N16986e8014c24486a204aba7ffec74b3 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
73 schema:name Cell Differentiation
74 rdf:type schema:DefinedTerm
75 N34543939cc9d4c108d0b70ca5597b958 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
76 schema:name Software
77 rdf:type schema:DefinedTerm
78 N371f7d8e641745839c33c3a9e0b05747 schema:name doi
79 schema:value 10.1038/nbt.1621
80 rdf:type schema:PropertyValue
81 N48124179428e4e989b4a958974a081cd schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
82 schema:name Protein Isoforms
83 rdf:type schema:DefinedTerm
84 N4ae447ea2c1f48b0b380a29a35e483dd schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
85 schema:name Cell Line
86 rdf:type schema:DefinedTerm
87 N55320618c06f4737ba228fbdf2c4a94f rdf:first sg:person.0741140331.06
88 rdf:rest N86d01ecf45c2402a9156c2df8de14ecf
89 N659e9b04939b4a2da6105969804599db rdf:first sg:person.0673025131.06
90 rdf:rest Ndebe13e7b61a45698bb4e591bfd99796
91 N68f4f035d1c04c488804e344ab814404 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
92 schema:name Mice
93 rdf:type schema:DefinedTerm
94 N6fd119c779ca45a3982903a8de7d3a1e schema:name Springer Nature - SN SciGraph project
95 rdf:type schema:Organization
96 N7bd18a2c244a4b3583509177409f63f7 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
97 schema:name Algorithms
98 rdf:type schema:DefinedTerm
99 N86d01ecf45c2402a9156c2df8de14ecf rdf:first sg:person.01024612415.46
100 rdf:rest N659e9b04939b4a2da6105969804599db
101 N8c89f9877f7e495c9c9f61b50f8a90ed schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
102 schema:name Sequence Analysis, RNA
103 rdf:type schema:DefinedTerm
104 N90b05a8580ca4bb0bf3aed7efb462c4f schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
105 schema:name RNA, Messenger
106 rdf:type schema:DefinedTerm
107 N93da9b3f24fc4049a6a64b166a6f1b21 rdf:first sg:person.01223441713.02
108 rdf:rest Naff555e765a4487984473645a6bf622a
109 N9caedd2207104f15ab6a358b740929fa schema:issueNumber 5
110 rdf:type schema:PublicationIssue
111 Naa9a95e8190f4b32a0b540aae2fbb9a8 schema:name readcube_id
112 schema:value 93d984f48bd8806cf117d2f18bb9a253fdb7a71a5557f3278a3ff33017297c98
113 rdf:type schema:PropertyValue
114 Naff555e765a4487984473645a6bf622a rdf:first sg:person.01026162263.73
115 rdf:rest Ncf6b59b2006a42bcbd8da4c40115cb4e
116 Nb5b7e7cf12d643f0a7e6508166e44246 schema:name nlm_unique_id
117 schema:value 9604648
118 rdf:type schema:PropertyValue
119 Nb729ef0d71a249b080e7098f31c01397 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
120 schema:name Gene Expression Profiling
121 rdf:type schema:DefinedTerm
122 Nc13ef451617a454ab17bc07f963995fb rdf:first sg:person.01042023041.45
123 rdf:rest N93da9b3f24fc4049a6a64b166a6f1b21
124 Nc4eebd4c02fa4dfa9144028802f06cca schema:name dimensions_id
125 schema:value pub.1031035095
126 rdf:type schema:PropertyValue
127 Nc80d725a10354999b8779f8f666e9a93 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
128 schema:name Genome
129 rdf:type schema:DefinedTerm
130 Ncdf2d6f0fa0447d4802fe0ecbf36cd40 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
131 schema:name Oligonucleotide Array Sequence Analysis
132 rdf:type schema:DefinedTerm
133 Ncf6b59b2006a42bcbd8da4c40115cb4e rdf:first sg:person.0672165566.75
134 rdf:rest rdf:nil
135 Ndebe13e7b61a45698bb4e591bfd99796 rdf:first sg:person.0774300251.64
136 rdf:rest Nc13ef451617a454ab17bc07f963995fb
137 Nec4590ddaf984224a84ec31db0a13ca0 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
138 schema:name Animals
139 rdf:type schema:DefinedTerm
140 Nf5da8dbf3e4242d68aa8781f2981778a schema:volumeNumber 28
141 rdf:type schema:PublicationVolume
142 Nfbb6844533de40e5b5f28a04d910f3de schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
143 schema:name Proto-Oncogene Proteins c-myc
144 rdf:type schema:DefinedTerm
145 Nfc41054be0ee4232bc67a5b77fdf3440 rdf:first sg:person.01247375013.30
146 rdf:rest N55320618c06f4737ba228fbdf2c4a94f
147 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
148 schema:name Biological Sciences
149 rdf:type schema:DefinedTerm
150 anzsrc-for:0604 schema:inDefinedTermSet anzsrc-for:
151 schema:name Genetics
152 rdf:type schema:DefinedTerm
153 sg:grant.2519905 http://pending.schema.org/fundedItem sg:pub.10.1038/nbt.1621
154 rdf:type schema:MonetaryGrant
155 sg:grant.2529425 http://pending.schema.org/fundedItem sg:pub.10.1038/nbt.1621
156 rdf:type schema:MonetaryGrant
157 sg:grant.2545461 http://pending.schema.org/fundedItem sg:pub.10.1038/nbt.1621
158 rdf:type schema:MonetaryGrant
159 sg:grant.2699343 http://pending.schema.org/fundedItem sg:pub.10.1038/nbt.1621
160 rdf:type schema:MonetaryGrant
161 sg:journal.1115214 schema:issn 1087-0156
162 1546-1696
163 schema:name Nature Biotechnology
164 rdf:type schema:Periodical
165 sg:person.01024612415.46 schema:affiliation https://www.grid.ac/institutes/grid.164295.d
166 schema:familyName Pertea
167 schema:givenName Geo
168 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01024612415.46
169 rdf:type schema:Person
170 sg:person.01026162263.73 schema:affiliation https://www.grid.ac/institutes/grid.20861.3d
171 schema:familyName Wold
172 schema:givenName Barbara J
173 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01026162263.73
174 rdf:type schema:Person
175 sg:person.01042023041.45 schema:affiliation https://www.grid.ac/institutes/grid.4367.6
176 schema:familyName van Baren
177 schema:givenName Marijke J
178 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01042023041.45
179 rdf:type schema:Person
180 sg:person.01223441713.02 schema:affiliation https://www.grid.ac/institutes/grid.164295.d
181 schema:familyName Salzberg
182 schema:givenName Steven L
183 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01223441713.02
184 rdf:type schema:Person
185 sg:person.01247375013.30 schema:affiliation https://www.grid.ac/institutes/grid.47840.3f
186 schema:familyName Trapnell
187 schema:givenName Cole
188 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01247375013.30
189 rdf:type schema:Person
190 sg:person.0672165566.75 schema:affiliation https://www.grid.ac/institutes/grid.47840.3f
191 schema:familyName Pachter
192 schema:givenName Lior
193 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0672165566.75
194 rdf:type schema:Person
195 sg:person.0673025131.06 schema:affiliation https://www.grid.ac/institutes/grid.20861.3d
196 schema:familyName Mortazavi
197 schema:givenName Ali
198 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0673025131.06
199 rdf:type schema:Person
200 sg:person.0741140331.06 schema:affiliation https://www.grid.ac/institutes/grid.20861.3d
201 schema:familyName Williams
202 schema:givenName Brian A
203 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0741140331.06
204 rdf:type schema:Person
205 sg:person.0774300251.64 schema:affiliation https://www.grid.ac/institutes/grid.20861.3d
206 schema:familyName Kwan
207 schema:givenName Gordon
208 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0774300251.64
209 rdf:type schema:Person
210 sg:pub.10.1038/nature05676 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030633547
211 https://doi.org/10.1038/nature05676
212 rdf:type schema:CreativeWork
213 sg:pub.10.1038/nature07509 schema:sameAs https://app.dimensions.ai/details/publication/pub.1029002744
214 https://doi.org/10.1038/nature07509
215 rdf:type schema:CreativeWork
216 sg:pub.10.1038/nature07638 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009442631
217 https://doi.org/10.1038/nature07638
218 rdf:type schema:CreativeWork
219 sg:pub.10.1038/nature07672 schema:sameAs https://app.dimensions.ai/details/publication/pub.1051133532
220 https://doi.org/10.1038/nature07672
221 rdf:type schema:CreativeWork
222 sg:pub.10.1038/nature08195 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007963177
223 https://doi.org/10.1038/nature08195
224 rdf:type schema:CreativeWork
225 sg:pub.10.1038/nmeth.1223 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048586936
226 https://doi.org/10.1038/nmeth.1223
227 rdf:type schema:CreativeWork
228 sg:pub.10.1038/nmeth.1226 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045381177
229 https://doi.org/10.1038/nmeth.1226
230 rdf:type schema:CreativeWork
231 sg:pub.10.1038/nmeth.1371 schema:sameAs https://app.dimensions.ai/details/publication/pub.1011651858
232 https://doi.org/10.1038/nmeth.1371
233 rdf:type schema:CreativeWork
234 sg:pub.10.1186/1471-2105-11-94 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053091615
235 https://doi.org/10.1186/1471-2105-11-94
236 rdf:type schema:CreativeWork
237 sg:pub.10.1186/gb-2008-9-12-r175 schema:sameAs https://app.dimensions.ai/details/publication/pub.1016469219
238 https://doi.org/10.1186/gb-2008-9-12-r175
239 rdf:type schema:CreativeWork
240 sg:pub.10.1186/gb-2009-10-3-r25 schema:sameAs https://app.dimensions.ai/details/publication/pub.1049583368
241 https://doi.org/10.1186/gb-2009-10-3-r25
242 rdf:type schema:CreativeWork
243 https://doi.org/10.1016/s0955-0674(96)80091-3 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038486419
244 rdf:type schema:CreativeWork
245 https://doi.org/10.1093/bioinformatics/btp113 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044688303
246 rdf:type schema:CreativeWork
247 https://doi.org/10.1093/bioinformatics/btp120 schema:sameAs https://app.dimensions.ai/details/publication/pub.1012425816
248 rdf:type schema:CreativeWork
249 https://doi.org/10.1093/bioinformatics/btp352 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023014918
250 rdf:type schema:CreativeWork
251 https://doi.org/10.1093/bioinformatics/btp544 schema:sameAs https://app.dimensions.ai/details/publication/pub.1026453845
252 rdf:type schema:CreativeWork
253 https://doi.org/10.1093/bioinformatics/btp692 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045138418
254 rdf:type schema:CreativeWork
255 https://doi.org/10.1093/nar/gkg770 schema:sameAs https://app.dimensions.ai/details/publication/pub.1050668915
256 rdf:type schema:CreativeWork
257 https://doi.org/10.1101/gr.079558.108 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045837493
258 rdf:type schema:CreativeWork
259 https://doi.org/10.1111/j.1432-0436.1977.tb01507.x schema:sameAs https://app.dimensions.ai/details/publication/pub.1051515183
260 rdf:type schema:CreativeWork
261 https://doi.org/10.1126/science.1141319 schema:sameAs https://app.dimensions.ai/details/publication/pub.1018702276
262 rdf:type schema:CreativeWork
263 https://doi.org/10.1126/science.1158441 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062457766
264 rdf:type schema:CreativeWork
265 https://doi.org/10.1128/mcb.6.5.1412 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025601272
266 rdf:type schema:CreativeWork
267 https://doi.org/10.1242/dev.01874 schema:sameAs https://app.dimensions.ai/details/publication/pub.1032416057
268 rdf:type schema:CreativeWork
269 https://doi.org/10.1242/jcs.004739 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025879815
270 rdf:type schema:CreativeWork
271 https://doi.org/10.1371/journal.pcbi.1000074 schema:sameAs https://app.dimensions.ai/details/publication/pub.1010292273
272 rdf:type schema:CreativeWork
273 https://doi.org/10.2307/1969503 schema:sameAs https://app.dimensions.ai/details/publication/pub.1069674881
274 rdf:type schema:CreativeWork
275 https://www.grid.ac/institutes/grid.164295.d schema:alternateName University of Maryland, College Park
276 schema:name Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA.
277 Department of Computer Science, University of Maryland, College Park, Maryland, USA.
278 rdf:type schema:Organization
279 https://www.grid.ac/institutes/grid.20861.3d schema:alternateName California Institute of Technology
280 schema:name Division of Biology and Beckman Institute, California Institute of Technology, Pasadena, California, USA.
281 rdf:type schema:Organization
282 https://www.grid.ac/institutes/grid.4367.6 schema:alternateName Washington University in St. Louis
283 schema:name Genome Sciences Center, Washington University in St. Louis, St. Louis, Missouri, USA.
284 rdf:type schema:Organization
285 https://www.grid.ac/institutes/grid.47840.3f schema:alternateName University of California, Berkeley
286 schema:name Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA.
287 Department of Computer Science, University of California, Berkeley, California, USA.
288 Department of Computer Science, University of Maryland, College Park, Maryland, USA.
289 Department of Mathematics, University of California, Berkeley, California, USA.
290 Department of Molecular and Cell Biology, University of California, Berkeley, California, USA.
291 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...