SeqAnt: A web service to rapidly identify and annotate DNA sequence variations View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2010-09-20

AUTHORS

Amol Carl Shetty, Prashanth Athri, Kajari Mondal, Vanessa L Horner, Karyn Meltz Steinberg, Viren Patel, Tamara Caspary, David J Cutler, Michael E Zwick

ABSTRACT

BackgroundThe enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research.ResultsSeqAnt (Seq uence An notator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds.ConclusionSeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories. More... »

PAGES

471

References to SciGraph publications

  • 2001-02-15. Initial sequencing and analysis of the human genome in NATURE
  • 2007-10-14. Direct selection of human genomic loci by microarray hybridization in NATURE METHODS
  • 2008-10-09. Next-generation DNA sequencing in NATURE BIOTECHNOLOGY
  • 2008-04. The complete genome of an individual by massively parallel DNA sequencing in NATURE
  • 2009-11-13. Exome sequencing identifies the cause of a mendelian disorder in NATURE GENETICS
  • 2005-01. Direct genomic selection in NATURE METHODS
  • 2010-01-28. Target-enrichment strategies for next-generation sequencing in NATURE METHODS
  • 2009-02-01. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing in NATURE BIOTECHNOLOGY
  • 2007-10-14. Multiplex amplification of large sets of human exons in NATURE METHODS
  • 1993-01. A point mutation in the FMR-1 gene associated with fragile X mental retardation in NATURE GENETICS
  • 2009-08-10. Single-molecule sequencing of an individual human genome in NATURE BIOTECHNOLOGY
  • 2009-08-16. Targeted capture and massively parallel sequencing of 12 human exomes in NATURE
  • 2009-07-08. A highly annotated whole-genome sequence of a Korean individual in NATURE
  • 2007-10-14. Microarray-based genomic selection for high-throughput resequencing in NATURE METHODS
  • 2004-05. Advanced sequencing technologies: methods and goals in NATURE REVIEWS GENETICS
  • 2008-11. Accurate whole human genome sequencing using reversible terminator chemistry in NATURE
  • 2007-11-04. Genome-wide in situ exon capture for selective resequencing in NATURE GENETICS
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1186/1471-2105-11-471

    DOI

    http://dx.doi.org/10.1186/1471-2105-11-471

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1015906429

    PUBMED

    https://www.ncbi.nlm.nih.gov/pubmed/20854673


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Biological Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Genetics", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Animals", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Base Sequence", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Databases, Genetic", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Genetic Variation", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Genomics", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Humans", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Internet", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Mice", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Molecular Sequence Annotation", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Sequence Analysis, DNA", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Software", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Shetty", 
            "givenName": "Amol Carl", 
            "id": "sg:person.0705703554.98", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0705703554.98"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Athri", 
            "givenName": "Prashanth", 
            "id": "sg:person.01214413052.03", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01214413052.03"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mondal", 
            "givenName": "Kajari", 
            "id": "sg:person.0632333233.12", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0632333233.12"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Horner", 
            "givenName": "Vanessa L", 
            "id": "sg:person.01300060075.86", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01300060075.86"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Graduate Program in Population Biology, Ecology and Evolution, Emory University, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
                "Graduate Program in Population Biology, Ecology and Evolution, Emory University, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Steinberg", 
            "givenName": "Karyn Meltz", 
            "id": "sg:person.01366500323.34", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01366500323.34"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Patel", 
            "givenName": "Viren", 
            "id": "sg:person.01172134543.75", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01172134543.75"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Caspary", 
            "givenName": "Tamara", 
            "id": "sg:person.0655032652.43", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0655032652.43"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Cutler", 
            "givenName": "David J", 
            "id": "sg:person.015711065457.76", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015711065457.76"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA", 
              "id": "http://www.grid.ac/institutes/grid.189967.8", 
              "name": [
                "Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Zwick", 
            "givenName": "Michael E", 
            "id": "sg:person.01177236633.13", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01177236633.13"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1038/35057062", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1042854081", 
              "https://doi.org/10.1038/35057062"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/ng.2007.42", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1012402265", 
              "https://doi.org/10.1038/ng.2007.42"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth.1419", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1042265948", 
              "https://doi.org/10.1038/nmeth.1419"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nbt.1523", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1040653661", 
              "https://doi.org/10.1038/nbt.1523"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature06884", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1047672670", 
              "https://doi.org/10.1038/nature06884"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth1110", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1045927978", 
              "https://doi.org/10.1038/nmeth1110"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature08250", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1038593056", 
              "https://doi.org/10.1038/nature08250"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/ng.499", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1021123766", 
              "https://doi.org/10.1038/ng.499"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth0105-63", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1051721477", 
              "https://doi.org/10.1038/nmeth0105-63"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nbt.1561", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1008087076", 
              "https://doi.org/10.1038/nbt.1561"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature08211", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1030217088", 
              "https://doi.org/10.1038/nature08211"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/ng0193-31", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1018483390", 
              "https://doi.org/10.1038/ng0193-31"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth1109", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1009647980", 
              "https://doi.org/10.1038/nmeth1109"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature07517", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1052925719", 
              "https://doi.org/10.1038/nature07517"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nmeth1111", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1025042325", 
              "https://doi.org/10.1038/nmeth1111"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nbt1486", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1005954516", 
              "https://doi.org/10.1038/nbt1486"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg1325", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041884975", 
              "https://doi.org/10.1038/nrg1325"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2010-09-20", 
        "datePublishedReg": "2010-09-20", 
        "description": "BackgroundThe enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research.ResultsSeqAnt (Seq uence An notator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds.ConclusionSeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.", 
        "genre": "article", 
        "id": "sg:pub.10.1186/1471-2105-11-471", 
        "isAccessibleForFree": true, 
        "isFundedItemOf": [
          {
            "id": "sg:grant.2686632", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2705144", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2423340", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2686701", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2705241", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2551135", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2423243", 
            "type": "MonetaryGrant"
          }, 
          {
            "id": "sg:grant.2384583", 
            "type": "MonetaryGrant"
          }
        ], 
        "isPartOf": [
          {
            "id": "sg:journal.1023786", 
            "issn": [
              "1471-2105"
            ], 
            "name": "BMC Bioinformatics", 
            "publisher": "Springer Nature", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "1", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "11"
          }
        ], 
        "keywords": [
          "open-source web service", 
          "web services", 
          "web browser", 
          "software package", 
          "tab-delimited text file", 
          "large sequencing datasets", 
          "enormous throughput", 
          "text files", 
          "available datasets", 
          "browser", 
          "genome sequencing experiments", 
          "UCSC Genome Browser", 
          "high-throughput pipeline", 
          "tens of thousands", 
          "significant bottleneck", 
          "genome browser", 
          "available databases", 
          "sequence annotation", 
          "critical bottleneck", 
          "BED format", 
          "dataset", 
          "services", 
          "bottleneck", 
          "platform", 
          "second-generation sequencing platforms", 
          "sequencing datasets", 
          "sequencing experiments", 
          "low cost", 
          "annotation", 
          "package", 
          "throughput", 
          "files", 
          "infrastructure", 
          "format", 
          "sequencing platforms", 
          "pipeline", 
          "database", 
          "seconds", 
          "information", 
          "total time", 
          "millions", 
          "research", 
          "thousands", 
          "experiments", 
          "cost", 
          "variant sites", 
          "speed", 
          "variants", 
          "identifies", 
          "data", 
          "method", 
          "personnel", 
          "tens", 
          "time", 
          "single experiment", 
          "humans", 
          "lack", 
          "respect", 
          "clinical geneticists", 
          "types", 
          "geneticists", 
          "size", 
          "laboratory", 
          "series", 
          "genetic research", 
          "investigators", 
          "sites", 
          "variation", 
          "DNA sequence variants", 
          "sequence variants", 
          "DNA sequence variation", 
          "frequency", 
          "conservation", 
          "functional types", 
          "characterization", 
          "evolutionary conservation", 
          "heterozygous loci", 
          "functional characterization", 
          "sequence variation", 
          "loci", 
          "mice"
        ], 
        "name": "SeqAnt: A web service to rapidly identify and annotate DNA sequence variations", 
        "pagination": "471", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1015906429"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1186/1471-2105-11-471"
            ]
          }, 
          {
            "name": "pubmed_id", 
            "type": "PropertyValue", 
            "value": [
              "20854673"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1186/1471-2105-11-471", 
          "https://app.dimensions.ai/details/publication/pub.1015906429"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2022-10-01T06:36", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20221001/entities/gbq_results/article/article_519.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "https://doi.org/10.1186/1471-2105-11-471"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-11-471'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-11-471'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-11-471'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-11-471'


     

    This table displays all metadata directly associated to this object as RDF triples.

    327 TRIPLES      21 PREDICATES      134 URIs      109 LITERALS      18 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1186/1471-2105-11-471 schema:about N1be4f0d665a34a5082168438e6326b69
    2 N2a49e40dabe44a228389909198996b46
    3 N2e0234e2f511485e990905c1bd2e5f1b
    4 N405c0179a38e442fbaa3d8bf96af79a5
    5 N5a2d2453a6654f41a48da575840da464
    6 N6a9d31b7b2584cb98a4a1edca70deb25
    7 N90c80091c8544c3db6321128cac973fe
    8 Naba28e7348624bf3afc942888779263f
    9 Nb9a99864b00a44cfb5bfd3e0d43cbf9c
    10 Ncb11b834801044318cd49decb24f8aae
    11 Nfdaf1a2fa76c422a947c07ff53b63eed
    12 anzsrc-for:06
    13 anzsrc-for:0604
    14 schema:author Ne1c55ec83b244aa1ad7d1ee65185bdeb
    15 schema:citation sg:pub.10.1038/35057062
    16 sg:pub.10.1038/nature06884
    17 sg:pub.10.1038/nature07517
    18 sg:pub.10.1038/nature08211
    19 sg:pub.10.1038/nature08250
    20 sg:pub.10.1038/nbt.1523
    21 sg:pub.10.1038/nbt.1561
    22 sg:pub.10.1038/nbt1486
    23 sg:pub.10.1038/ng.2007.42
    24 sg:pub.10.1038/ng.499
    25 sg:pub.10.1038/ng0193-31
    26 sg:pub.10.1038/nmeth.1419
    27 sg:pub.10.1038/nmeth0105-63
    28 sg:pub.10.1038/nmeth1109
    29 sg:pub.10.1038/nmeth1110
    30 sg:pub.10.1038/nmeth1111
    31 sg:pub.10.1038/nrg1325
    32 schema:datePublished 2010-09-20
    33 schema:datePublishedReg 2010-09-20
    34 schema:description BackgroundThe enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research.ResultsSeqAnt (Seq uence An notator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds.ConclusionSeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.
    35 schema:genre article
    36 schema:isAccessibleForFree true
    37 schema:isPartOf N02b8839f5a6a49349ce1100f13cd08a9
    38 N180cba74b8624668af3c4baf6a38c7df
    39 sg:journal.1023786
    40 schema:keywords BED format
    41 DNA sequence variants
    42 DNA sequence variation
    43 UCSC Genome Browser
    44 annotation
    45 available databases
    46 available datasets
    47 bottleneck
    48 browser
    49 characterization
    50 clinical geneticists
    51 conservation
    52 cost
    53 critical bottleneck
    54 data
    55 database
    56 dataset
    57 enormous throughput
    58 evolutionary conservation
    59 experiments
    60 files
    61 format
    62 frequency
    63 functional characterization
    64 functional types
    65 genetic research
    66 geneticists
    67 genome browser
    68 genome sequencing experiments
    69 heterozygous loci
    70 high-throughput pipeline
    71 humans
    72 identifies
    73 information
    74 infrastructure
    75 investigators
    76 laboratory
    77 lack
    78 large sequencing datasets
    79 loci
    80 low cost
    81 method
    82 mice
    83 millions
    84 open-source web service
    85 package
    86 personnel
    87 pipeline
    88 platform
    89 research
    90 respect
    91 second-generation sequencing platforms
    92 seconds
    93 sequence annotation
    94 sequence variants
    95 sequence variation
    96 sequencing datasets
    97 sequencing experiments
    98 sequencing platforms
    99 series
    100 services
    101 significant bottleneck
    102 single experiment
    103 sites
    104 size
    105 software package
    106 speed
    107 tab-delimited text file
    108 tens
    109 tens of thousands
    110 text files
    111 thousands
    112 throughput
    113 time
    114 total time
    115 types
    116 variant sites
    117 variants
    118 variation
    119 web browser
    120 web services
    121 schema:name SeqAnt: A web service to rapidly identify and annotate DNA sequence variations
    122 schema:pagination 471
    123 schema:productId N3d1eba4c5ade4082902a7dbee9893c77
    124 N69ae19cd8cef4611b9f788f98122ef78
    125 N8d694a1023984ef6b81936feac5eb7fd
    126 schema:sameAs https://app.dimensions.ai/details/publication/pub.1015906429
    127 https://doi.org/10.1186/1471-2105-11-471
    128 schema:sdDatePublished 2022-10-01T06:36
    129 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    130 schema:sdPublisher N7d18649aa48848d6ab33a4a16eaf627a
    131 schema:url https://doi.org/10.1186/1471-2105-11-471
    132 sgo:license sg:explorer/license/
    133 sgo:sdDataset articles
    134 rdf:type schema:ScholarlyArticle
    135 N02b8839f5a6a49349ce1100f13cd08a9 schema:volumeNumber 11
    136 rdf:type schema:PublicationVolume
    137 N065840bc799148b8a4f8d07284a5fcf7 rdf:first sg:person.01177236633.13
    138 rdf:rest rdf:nil
    139 N0d32d9d892b34fbd9a451b97c6318c85 rdf:first sg:person.0655032652.43
    140 rdf:rest N9d57a9c72d7d4c2ca6ec3be9f1b32290
    141 N0fed71db1d3147c5b3c64531ec63577a rdf:first sg:person.01300060075.86
    142 rdf:rest N81c009e948bf4a83b9122c6d21b172f1
    143 N180cba74b8624668af3c4baf6a38c7df schema:issueNumber 1
    144 rdf:type schema:PublicationIssue
    145 N1be4f0d665a34a5082168438e6326b69 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    146 schema:name Base Sequence
    147 rdf:type schema:DefinedTerm
    148 N2a49e40dabe44a228389909198996b46 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    149 schema:name Genetic Variation
    150 rdf:type schema:DefinedTerm
    151 N2e0234e2f511485e990905c1bd2e5f1b schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    152 schema:name Humans
    153 rdf:type schema:DefinedTerm
    154 N3d1eba4c5ade4082902a7dbee9893c77 schema:name doi
    155 schema:value 10.1186/1471-2105-11-471
    156 rdf:type schema:PropertyValue
    157 N405c0179a38e442fbaa3d8bf96af79a5 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    158 schema:name Genomics
    159 rdf:type schema:DefinedTerm
    160 N5a2d2453a6654f41a48da575840da464 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    161 schema:name Software
    162 rdf:type schema:DefinedTerm
    163 N69ae19cd8cef4611b9f788f98122ef78 schema:name dimensions_id
    164 schema:value pub.1015906429
    165 rdf:type schema:PropertyValue
    166 N6a9d31b7b2584cb98a4a1edca70deb25 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    167 schema:name Animals
    168 rdf:type schema:DefinedTerm
    169 N6e1d3ebf75b647c3845e6d300ef756aa rdf:first sg:person.01172134543.75
    170 rdf:rest N0d32d9d892b34fbd9a451b97c6318c85
    171 N7d18649aa48848d6ab33a4a16eaf627a schema:name Springer Nature - SN SciGraph project
    172 rdf:type schema:Organization
    173 N81c009e948bf4a83b9122c6d21b172f1 rdf:first sg:person.01366500323.34
    174 rdf:rest N6e1d3ebf75b647c3845e6d300ef756aa
    175 N8d694a1023984ef6b81936feac5eb7fd schema:name pubmed_id
    176 schema:value 20854673
    177 rdf:type schema:PropertyValue
    178 N90c80091c8544c3db6321128cac973fe schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    179 schema:name Sequence Analysis, DNA
    180 rdf:type schema:DefinedTerm
    181 N9d57a9c72d7d4c2ca6ec3be9f1b32290 rdf:first sg:person.015711065457.76
    182 rdf:rest N065840bc799148b8a4f8d07284a5fcf7
    183 Na9eddc65012b42029569bd12b83986ba rdf:first sg:person.0632333233.12
    184 rdf:rest N0fed71db1d3147c5b3c64531ec63577a
    185 Naba28e7348624bf3afc942888779263f schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    186 schema:name Internet
    187 rdf:type schema:DefinedTerm
    188 Nb9a99864b00a44cfb5bfd3e0d43cbf9c schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    189 schema:name Molecular Sequence Annotation
    190 rdf:type schema:DefinedTerm
    191 Ncb11b834801044318cd49decb24f8aae schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    192 schema:name Mice
    193 rdf:type schema:DefinedTerm
    194 Nd7d14467c2b74a43b235e57cb64cb798 rdf:first sg:person.01214413052.03
    195 rdf:rest Na9eddc65012b42029569bd12b83986ba
    196 Ne1c55ec83b244aa1ad7d1ee65185bdeb rdf:first sg:person.0705703554.98
    197 rdf:rest Nd7d14467c2b74a43b235e57cb64cb798
    198 Nfdaf1a2fa76c422a947c07ff53b63eed schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    199 schema:name Databases, Genetic
    200 rdf:type schema:DefinedTerm
    201 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
    202 schema:name Biological Sciences
    203 rdf:type schema:DefinedTerm
    204 anzsrc-for:0604 schema:inDefinedTermSet anzsrc-for:
    205 schema:name Genetics
    206 rdf:type schema:DefinedTerm
    207 sg:grant.2384583 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    208 rdf:type schema:MonetaryGrant
    209 sg:grant.2423243 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    210 rdf:type schema:MonetaryGrant
    211 sg:grant.2423340 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    212 rdf:type schema:MonetaryGrant
    213 sg:grant.2551135 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    214 rdf:type schema:MonetaryGrant
    215 sg:grant.2686632 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    216 rdf:type schema:MonetaryGrant
    217 sg:grant.2686701 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    218 rdf:type schema:MonetaryGrant
    219 sg:grant.2705144 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    220 rdf:type schema:MonetaryGrant
    221 sg:grant.2705241 http://pending.schema.org/fundedItem sg:pub.10.1186/1471-2105-11-471
    222 rdf:type schema:MonetaryGrant
    223 sg:journal.1023786 schema:issn 1471-2105
    224 schema:name BMC Bioinformatics
    225 schema:publisher Springer Nature
    226 rdf:type schema:Periodical
    227 sg:person.01172134543.75 schema:affiliation grid-institutes:grid.189967.8
    228 schema:familyName Patel
    229 schema:givenName Viren
    230 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01172134543.75
    231 rdf:type schema:Person
    232 sg:person.01177236633.13 schema:affiliation grid-institutes:grid.189967.8
    233 schema:familyName Zwick
    234 schema:givenName Michael E
    235 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01177236633.13
    236 rdf:type schema:Person
    237 sg:person.01214413052.03 schema:affiliation grid-institutes:grid.189967.8
    238 schema:familyName Athri
    239 schema:givenName Prashanth
    240 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01214413052.03
    241 rdf:type schema:Person
    242 sg:person.01300060075.86 schema:affiliation grid-institutes:grid.189967.8
    243 schema:familyName Horner
    244 schema:givenName Vanessa L
    245 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01300060075.86
    246 rdf:type schema:Person
    247 sg:person.01366500323.34 schema:affiliation grid-institutes:grid.189967.8
    248 schema:familyName Steinberg
    249 schema:givenName Karyn Meltz
    250 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01366500323.34
    251 rdf:type schema:Person
    252 sg:person.015711065457.76 schema:affiliation grid-institutes:grid.189967.8
    253 schema:familyName Cutler
    254 schema:givenName David J
    255 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015711065457.76
    256 rdf:type schema:Person
    257 sg:person.0632333233.12 schema:affiliation grid-institutes:grid.189967.8
    258 schema:familyName Mondal
    259 schema:givenName Kajari
    260 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0632333233.12
    261 rdf:type schema:Person
    262 sg:person.0655032652.43 schema:affiliation grid-institutes:grid.189967.8
    263 schema:familyName Caspary
    264 schema:givenName Tamara
    265 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0655032652.43
    266 rdf:type schema:Person
    267 sg:person.0705703554.98 schema:affiliation grid-institutes:grid.189967.8
    268 schema:familyName Shetty
    269 schema:givenName Amol Carl
    270 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0705703554.98
    271 rdf:type schema:Person
    272 sg:pub.10.1038/35057062 schema:sameAs https://app.dimensions.ai/details/publication/pub.1042854081
    273 https://doi.org/10.1038/35057062
    274 rdf:type schema:CreativeWork
    275 sg:pub.10.1038/nature06884 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047672670
    276 https://doi.org/10.1038/nature06884
    277 rdf:type schema:CreativeWork
    278 sg:pub.10.1038/nature07517 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052925719
    279 https://doi.org/10.1038/nature07517
    280 rdf:type schema:CreativeWork
    281 sg:pub.10.1038/nature08211 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030217088
    282 https://doi.org/10.1038/nature08211
    283 rdf:type schema:CreativeWork
    284 sg:pub.10.1038/nature08250 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038593056
    285 https://doi.org/10.1038/nature08250
    286 rdf:type schema:CreativeWork
    287 sg:pub.10.1038/nbt.1523 schema:sameAs https://app.dimensions.ai/details/publication/pub.1040653661
    288 https://doi.org/10.1038/nbt.1523
    289 rdf:type schema:CreativeWork
    290 sg:pub.10.1038/nbt.1561 schema:sameAs https://app.dimensions.ai/details/publication/pub.1008087076
    291 https://doi.org/10.1038/nbt.1561
    292 rdf:type schema:CreativeWork
    293 sg:pub.10.1038/nbt1486 schema:sameAs https://app.dimensions.ai/details/publication/pub.1005954516
    294 https://doi.org/10.1038/nbt1486
    295 rdf:type schema:CreativeWork
    296 sg:pub.10.1038/ng.2007.42 schema:sameAs https://app.dimensions.ai/details/publication/pub.1012402265
    297 https://doi.org/10.1038/ng.2007.42
    298 rdf:type schema:CreativeWork
    299 sg:pub.10.1038/ng.499 schema:sameAs https://app.dimensions.ai/details/publication/pub.1021123766
    300 https://doi.org/10.1038/ng.499
    301 rdf:type schema:CreativeWork
    302 sg:pub.10.1038/ng0193-31 schema:sameAs https://app.dimensions.ai/details/publication/pub.1018483390
    303 https://doi.org/10.1038/ng0193-31
    304 rdf:type schema:CreativeWork
    305 sg:pub.10.1038/nmeth.1419 schema:sameAs https://app.dimensions.ai/details/publication/pub.1042265948
    306 https://doi.org/10.1038/nmeth.1419
    307 rdf:type schema:CreativeWork
    308 sg:pub.10.1038/nmeth0105-63 schema:sameAs https://app.dimensions.ai/details/publication/pub.1051721477
    309 https://doi.org/10.1038/nmeth0105-63
    310 rdf:type schema:CreativeWork
    311 sg:pub.10.1038/nmeth1109 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009647980
    312 https://doi.org/10.1038/nmeth1109
    313 rdf:type schema:CreativeWork
    314 sg:pub.10.1038/nmeth1110 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045927978
    315 https://doi.org/10.1038/nmeth1110
    316 rdf:type schema:CreativeWork
    317 sg:pub.10.1038/nmeth1111 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025042325
    318 https://doi.org/10.1038/nmeth1111
    319 rdf:type schema:CreativeWork
    320 sg:pub.10.1038/nrg1325 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041884975
    321 https://doi.org/10.1038/nrg1325
    322 rdf:type schema:CreativeWork
    323 grid-institutes:grid.189967.8 schema:alternateName Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
    324 Graduate Program in Population Biology, Ecology and Evolution, Emory University, Atlanta, GA, USA
    325 schema:name Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
    326 Graduate Program in Population Biology, Ecology and Evolution, Emory University, Atlanta, GA, USA
    327 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...