The COG database: an updated version includes eukaryotes View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2003-12

AUTHORS

Roman L Tatusov, Natalie D Fedorova, John D Jackson, Aviva R Jacobs, Boris Kiryutin, Eugene V Koonin, Dmitri M Krylov, Raja Mazumder, Sergei L Mekhedov, Anastasia N Nikolskaya, B Sridhar Rao, Sergei Smirnov, Alexander V Sverdlov, Sona Vasudevan, Yuri I Wolf, Jodie J Yin, Darren A Natale

ABSTRACT

BACKGROUND: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. RESULTS: We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. CONCLUSION: The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. More... »

PAGES

41

References to SciGraph publications

  • 2001-02. Initial sequencing and analysis of the human genome in NATURE
  • 2001-12. Constant relative rate of protein evolution and detection of functional diversification among bacterial, archaeal and eukaryotic proteins in GENOME BIOLOGY
  • 2002-10. Genome sequence of the human malaria parasite Plasmodium falciparum in NATURE
  • 2002-12. RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs in BMC BIOINFORMATICS
  • 2002-12. Initial sequencing and comparative analysis of the mouse genome in NATURE
  • 2002-11. Genome evolution in bacterial endosymbionts of insects in NATURE REVIEWS GENETICS
  • 2001-10. Genome sequence of Yersinia pestis, the causative agent of plague in NATURE
  • 2000-12. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana in NATURE
  • 2001-12. Genome trees constructed using five different approaches suggest new major bacterial clades in BMC EVOLUTIONARY BIOLOGY
  • 2002-02. The genome sequence of Schizosaccharomyces pombe in NATURE
  • 2001-11. Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi in NATURE
  • 2003-12. Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes in BMC EVOLUTIONARY BIOLOGY
  • 2000-10. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs) in GENOME BIOLOGY
  • 2002-11. The origin and evolution of model organisms in NATURE REVIEWS GENETICS
  • 2001-10. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2 in NATURE
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1186/1471-2105-4-41

    DOI

    http://dx.doi.org/10.1186/1471-2105-4-41

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1013163036

    PUBMED

    https://www.ncbi.nlm.nih.gov/pubmed/12969510


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Genetics", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Biological Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Animals", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Databases, Nucleic Acid", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Databases, Protein", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Eukaryotic Cells", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Evolution, Molecular", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Humans", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "National Institutes of Health (U.S.)", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Proteins", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Terminology as Topic", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "United States", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Tatusov", 
            "givenName": "Roman L", 
            "id": "sg:person.01360057351.65", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01360057351.65"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Fedorova", 
            "givenName": "Natalie D", 
            "id": "sg:person.01161731273.64", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01161731273.64"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Jackson", 
            "givenName": "John D", 
            "id": "sg:person.01372217634.12", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01372217634.12"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Jacobs", 
            "givenName": "Aviva R", 
            "id": "sg:person.0601003574.66", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0601003574.66"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Kiryutin", 
            "givenName": "Boris", 
            "id": "sg:person.0702773360.18", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0702773360.18"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Koonin", 
            "givenName": "Eugene V", 
            "id": "sg:person.01017015051.78", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01017015051.78"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Krylov", 
            "givenName": "Dmitri M", 
            "id": "sg:person.0763345374.05", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0763345374.05"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Georgetown University Medical Center", 
              "id": "https://www.grid.ac/institutes/grid.411667.3", 
              "name": [
                "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mazumder", 
            "givenName": "Raja", 
            "id": "sg:person.01036614410.14", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01036614410.14"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mekhedov", 
            "givenName": "Sergei L", 
            "id": "sg:person.014017517447.33", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014017517447.33"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Georgetown University Medical Center", 
              "id": "https://www.grid.ac/institutes/grid.411667.3", 
              "name": [
                "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Nikolskaya", 
            "givenName": "Anastasia N", 
            "id": "sg:person.01145707174.35", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01145707174.35"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Rao", 
            "givenName": "B Sridhar", 
            "id": "sg:person.01214022374.98", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01214022374.98"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Smirnov", 
            "givenName": "Sergei", 
            "id": "sg:person.014003770757.69", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014003770757.69"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Sverdlov", 
            "givenName": "Alexander V", 
            "id": "sg:person.01330250774.14", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01330250774.14"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Vasudevan", 
            "givenName": "Sona", 
            "id": "sg:person.0604644674.14", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0604644674.14"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Wolf", 
            "givenName": "Yuri I", 
            "id": "sg:person.0634453251.89", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0634453251.89"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information", 
              "id": "https://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Yin", 
            "givenName": "Jodie J", 
            "id": "sg:person.0721073274.50", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0721073274.50"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Georgetown University Medical Center", 
              "id": "https://www.grid.ac/institutes/grid.411667.3", 
              "name": [
                "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Natale", 
            "givenName": "Darren A", 
            "id": "sg:person.01055364702.08", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01055364702.08"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "https://doi.org/10.1093/embo-reports/kvf248", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1000333278"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1006/jmbi.2000.3550", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1000396794"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0168-9525(02)02793-2", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002684451"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0168-9525(02)02793-2", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002684451"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/nar/gkg087", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1005864982"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2000-1-5-research0009", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007636928", 
              "https://doi.org/10.1186/gb-2000-1-5-research0009"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35097083", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007834691", 
              "https://doi.org/10.1038/35097083"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35097083", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007834691", 
              "https://doi.org/10.1038/35097083"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35097083", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007834691", 
              "https://doi.org/10.1038/35097083"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature724", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1009114411", 
              "https://doi.org/10.1038/nature724"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature724", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1009114411", 
              "https://doi.org/10.1038/nature724"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/nar/29.1.22", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1010981269"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1046/j.1365-2958.1997.4821861.x", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1012052760"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1046/j.1365-2958.1997.4821861.x", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1012052760"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0076-6879(96)66035-2", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1018458717"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.287.5461.2185", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1019737566"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1146/annurev.micro.55.1.709", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1020062045"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35106579", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1020615079", 
              "https://doi.org/10.1038/35106579"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35106579", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1020615079", 
              "https://doi.org/10.1038/35106579"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1128/mmbr.65.1.44-79.2001", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1021785027"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1074/jbc.m004141200", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1022560741"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1128/jb.183.16.4823-4838.2001", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1025210625"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0065-3233(00)54010-8", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1026639024"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0168-9525(99)01704-7", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1027856251"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0168-9525(00)02005-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028049243"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35101614", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028123916", 
              "https://doi.org/10.1038/35101614"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35101614", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028123916", 
              "https://doi.org/10.1038/35101614"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0092-8674(00)80867-3", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1029080476"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0959-440x(00)00098-1", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1029769599"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/nar/25.9.1665", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1030289871"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1073/pnas.032671499", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1030340865"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature01097", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1030736656", 
              "https://doi.org/10.1038/nature01097"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature01097", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1030736656", 
              "https://doi.org/10.1038/nature01097"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/nar/29.2.545", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1031098434"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1073/pnas.95.11.5849", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1031199342"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/bioinformatics/18.1.92", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1032953758"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2148-1-8", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1035042918", 
              "https://doi.org/10.1186/1471-2148-1-8"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1073/pnas.95.11.5857", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1035682917"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.87702", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1035714287"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.174302", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1036137080"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2001-2-12-research0053", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1036924176", 
              "https://doi.org/10.1186/gb-2001-2-12-research0053"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1073/pnas.97.10.5334", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039472008"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.278.5338.631", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039646901"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg931", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039727832", 
              "https://doi.org/10.1038/nrg931"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg931", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039727832", 
              "https://doi.org/10.1038/nrg931"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature01262", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039854529", 
              "https://doi.org/10.1038/nature01262"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature01262", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039854529", 
              "https://doi.org/10.1038/nature01262"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2105-3-14", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041450232", 
              "https://doi.org/10.1186/1471-2105-3-14"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg929", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041641433", 
              "https://doi.org/10.1038/nrg929"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg929", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041641433", 
              "https://doi.org/10.1038/nrg929"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35057062", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1042854081", 
              "https://doi.org/10.1038/35057062"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35057062", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1042854081", 
              "https://doi.org/10.1038/35057062"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1073/pnas.141236298", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1043005018"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35048692", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1044298669", 
              "https://doi.org/10.1038/35048692"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35048692", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1044298669", 
              "https://doi.org/10.1038/35048692"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.1072104", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1045516288"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1006/jmbi.2000.5197", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1046271071"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.1080049", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1047114664"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/nar/25.17.3389", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1047265454"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2148-3-2", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1047997844", 
              "https://doi.org/10.1186/1471-2148-3-2"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0968-0004(98)01334-6", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1048073612"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.176501", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1048081914"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s0092-8674(00)80284-6", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1051957131"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.gr-1619r", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1060407554"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.161901", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1060407554"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.gr-1660r", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1060407559"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1101/gr.gr1653r", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1060407572"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.1076181", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1062446937"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.274.5287.546", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1062554574"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.278.5338.609", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1062558435"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1126/science.282.5396.2012", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1062563514"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.2307/2412448", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1069920881"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://app.dimensions.ai/details/publication/pub.1074493861", 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://app.dimensions.ai/details/publication/pub.1074727289", 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2003-12", 
        "datePublishedReg": "2003-12-01", 
        "description": "BACKGROUND: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies.\nRESULTS: We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.\nCONCLUSION: The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.", 
        "genre": "research_article", 
        "id": "sg:pub.10.1186/1471-2105-4-41", 
        "inLanguage": [
          "en"
        ], 
        "isAccessibleForFree": true, 
        "isPartOf": [
          {
            "id": "sg:journal.1023786", 
            "issn": [
              "1471-2105"
            ], 
            "name": "BMC Bioinformatics", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "1", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "4"
          }
        ], 
        "name": "The COG database: an updated version includes eukaryotes", 
        "pagination": "41", 
        "productId": [
          {
            "name": "readcube_id", 
            "type": "PropertyValue", 
            "value": [
              "3eb45806c33c68dc73e9b4c666ce33786b62fc08b8cea4f4beea312d66593b18"
            ]
          }, 
          {
            "name": "pubmed_id", 
            "type": "PropertyValue", 
            "value": [
              "12969510"
            ]
          }, 
          {
            "name": "nlm_unique_id", 
            "type": "PropertyValue", 
            "value": [
              "100965194"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1186/1471-2105-4-41"
            ]
          }, 
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1013163036"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1186/1471-2105-4-41", 
          "https://app.dimensions.ai/details/publication/pub.1013163036"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2019-04-10T18:18", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8675_00000504.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "http://link.springer.com/10.1186%2F1471-2105-4-41"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'


     

    This table displays all metadata directly associated to this object as RDF triples.

    419 TRIPLES      21 PREDICATES      100 URIs      31 LITERALS      19 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1186/1471-2105-4-41 schema:about N17b1e6fc839e49079dab8400de95d59c
    2 N1c2a4ba6f96d4ad988fccd8425dfada5
    3 N205761a9eb444efd923c0d584aca74a6
    4 N310d0a59757041cfa4f60de894749f28
    5 N48e577dcb91e4c6eb8dd32e51c79b561
    6 N5070ff1a3d324eb9acf683d422c63463
    7 N64babf422fed4fc7bf774e04c2179363
    8 N66999c828019406894aaf501f3ae5a65
    9 Nd0ccf5aae36c42e6b6889fe70274a965
    10 Ne488c9aa62d54437af2ca36d82c1e7d8
    11 anzsrc-for:06
    12 anzsrc-for:0604
    13 schema:author Ncb1c97edb30642c0bd068ca44ccf3cbe
    14 schema:citation sg:pub.10.1038/35048692
    15 sg:pub.10.1038/35057062
    16 sg:pub.10.1038/35097083
    17 sg:pub.10.1038/35101614
    18 sg:pub.10.1038/35106579
    19 sg:pub.10.1038/nature01097
    20 sg:pub.10.1038/nature01262
    21 sg:pub.10.1038/nature724
    22 sg:pub.10.1038/nrg929
    23 sg:pub.10.1038/nrg931
    24 sg:pub.10.1186/1471-2105-3-14
    25 sg:pub.10.1186/1471-2148-1-8
    26 sg:pub.10.1186/1471-2148-3-2
    27 sg:pub.10.1186/gb-2000-1-5-research0009
    28 sg:pub.10.1186/gb-2001-2-12-research0053
    29 https://app.dimensions.ai/details/publication/pub.1074493861
    30 https://app.dimensions.ai/details/publication/pub.1074727289
    31 https://doi.org/10.1006/jmbi.2000.3550
    32 https://doi.org/10.1006/jmbi.2000.5197
    33 https://doi.org/10.1016/s0065-3233(00)54010-8
    34 https://doi.org/10.1016/s0076-6879(96)66035-2
    35 https://doi.org/10.1016/s0092-8674(00)80284-6
    36 https://doi.org/10.1016/s0092-8674(00)80867-3
    37 https://doi.org/10.1016/s0168-9525(00)02005-9
    38 https://doi.org/10.1016/s0168-9525(02)02793-2
    39 https://doi.org/10.1016/s0168-9525(99)01704-7
    40 https://doi.org/10.1016/s0959-440x(00)00098-1
    41 https://doi.org/10.1016/s0968-0004(98)01334-6
    42 https://doi.org/10.1046/j.1365-2958.1997.4821861.x
    43 https://doi.org/10.1073/pnas.032671499
    44 https://doi.org/10.1073/pnas.141236298
    45 https://doi.org/10.1073/pnas.95.11.5849
    46 https://doi.org/10.1073/pnas.95.11.5857
    47 https://doi.org/10.1073/pnas.97.10.5334
    48 https://doi.org/10.1074/jbc.m004141200
    49 https://doi.org/10.1093/bioinformatics/18.1.92
    50 https://doi.org/10.1093/embo-reports/kvf248
    51 https://doi.org/10.1093/nar/25.17.3389
    52 https://doi.org/10.1093/nar/25.9.1665
    53 https://doi.org/10.1093/nar/29.1.22
    54 https://doi.org/10.1093/nar/29.2.545
    55 https://doi.org/10.1093/nar/gkg087
    56 https://doi.org/10.1101/gr.161901
    57 https://doi.org/10.1101/gr.174302
    58 https://doi.org/10.1101/gr.176501
    59 https://doi.org/10.1101/gr.87702
    60 https://doi.org/10.1101/gr.gr-1619r
    61 https://doi.org/10.1101/gr.gr-1660r
    62 https://doi.org/10.1101/gr.gr1653r
    63 https://doi.org/10.1126/science.1072104
    64 https://doi.org/10.1126/science.1076181
    65 https://doi.org/10.1126/science.1080049
    66 https://doi.org/10.1126/science.274.5287.546
    67 https://doi.org/10.1126/science.278.5338.609
    68 https://doi.org/10.1126/science.278.5338.631
    69 https://doi.org/10.1126/science.282.5396.2012
    70 https://doi.org/10.1126/science.287.5461.2185
    71 https://doi.org/10.1128/jb.183.16.4823-4838.2001
    72 https://doi.org/10.1128/mmbr.65.1.44-79.2001
    73 https://doi.org/10.1146/annurev.micro.55.1.709
    74 https://doi.org/10.2307/2412448
    75 schema:datePublished 2003-12
    76 schema:datePublishedReg 2003-12-01
    77 schema:description BACKGROUND: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. RESULTS: We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. CONCLUSION: The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.
    78 schema:genre research_article
    79 schema:inLanguage en
    80 schema:isAccessibleForFree true
    81 schema:isPartOf Nab41eb669fdf43db9005e2926e7b5b25
    82 Nb7f0602f916e4ce7b9b78e6434c055db
    83 sg:journal.1023786
    84 schema:name The COG database: an updated version includes eukaryotes
    85 schema:pagination 41
    86 schema:productId N2073f8eedb0b4178bdabe17f3fc3a41a
    87 N26108b49d0be4414a961c1a2fff1fdc1
    88 N3cca942fd36f4fea872cd3e593d749c4
    89 N58ea06acfb79472fa5956d57adaef0d9
    90 N6914fdf77908429cb9dfb5e85542dca1
    91 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013163036
    92 https://doi.org/10.1186/1471-2105-4-41
    93 schema:sdDatePublished 2019-04-10T18:18
    94 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    95 schema:sdPublisher N92f7949c40444602b1276b0569a655df
    96 schema:url http://link.springer.com/10.1186%2F1471-2105-4-41
    97 sgo:license sg:explorer/license/
    98 sgo:sdDataset articles
    99 rdf:type schema:ScholarlyArticle
    100 N17b1e6fc839e49079dab8400de95d59c schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    101 schema:name United States
    102 rdf:type schema:DefinedTerm
    103 N1c2a4ba6f96d4ad988fccd8425dfada5 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    104 schema:name Animals
    105 rdf:type schema:DefinedTerm
    106 N205761a9eb444efd923c0d584aca74a6 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    107 schema:name Proteins
    108 rdf:type schema:DefinedTerm
    109 N2073f8eedb0b4178bdabe17f3fc3a41a schema:name readcube_id
    110 schema:value 3eb45806c33c68dc73e9b4c666ce33786b62fc08b8cea4f4beea312d66593b18
    111 rdf:type schema:PropertyValue
    112 N20ab8f6533a74c26be2a0daf62bad758 rdf:first sg:person.0702773360.18
    113 rdf:rest Ne805e4c45a674fd59bc05543a61cc2c2
    114 N26108b49d0be4414a961c1a2fff1fdc1 schema:name doi
    115 schema:value 10.1186/1471-2105-4-41
    116 rdf:type schema:PropertyValue
    117 N310d0a59757041cfa4f60de894749f28 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    118 schema:name Databases, Protein
    119 rdf:type schema:DefinedTerm
    120 N31a2a5a9161c416899e56124dd11dcf9 rdf:first sg:person.0634453251.89
    121 rdf:rest Nb9bacbe98bd4437590f7a567414cf006
    122 N3cca942fd36f4fea872cd3e593d749c4 schema:name pubmed_id
    123 schema:value 12969510
    124 rdf:type schema:PropertyValue
    125 N4089e64aa5c94c2f94d14d4637784406 rdf:first sg:person.01036614410.14
    126 rdf:rest N9c2b501a68c644e7b31ceda9da9d3abd
    127 N48e577dcb91e4c6eb8dd32e51c79b561 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    128 schema:name Eukaryotic Cells
    129 rdf:type schema:DefinedTerm
    130 N5070ff1a3d324eb9acf683d422c63463 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    131 schema:name Databases, Nucleic Acid
    132 rdf:type schema:DefinedTerm
    133 N58ea06acfb79472fa5956d57adaef0d9 schema:name nlm_unique_id
    134 schema:value 100965194
    135 rdf:type schema:PropertyValue
    136 N60aae0c08c9b4cdea31bd3f68b6bb518 rdf:first sg:person.01330250774.14
    137 rdf:rest Nb2f6092bb6e24a08b0469336d854eff4
    138 N645384d4971c4613885c7d3886bd13fc rdf:first sg:person.014003770757.69
    139 rdf:rest N60aae0c08c9b4cdea31bd3f68b6bb518
    140 N64babf422fed4fc7bf774e04c2179363 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    141 schema:name National Institutes of Health (U.S.)
    142 rdf:type schema:DefinedTerm
    143 N64efe56a38404f4cb217541299b2b0a3 rdf:first sg:person.01161731273.64
    144 rdf:rest N6f1230e618a54665b11b0f91e71795b9
    145 N66999c828019406894aaf501f3ae5a65 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    146 schema:name Evolution, Molecular
    147 rdf:type schema:DefinedTerm
    148 N6914fdf77908429cb9dfb5e85542dca1 schema:name dimensions_id
    149 schema:value pub.1013163036
    150 rdf:type schema:PropertyValue
    151 N6f1230e618a54665b11b0f91e71795b9 rdf:first sg:person.01372217634.12
    152 rdf:rest N72e3f58c2807432099121486e814ef3c
    153 N72e3f58c2807432099121486e814ef3c rdf:first sg:person.0601003574.66
    154 rdf:rest N20ab8f6533a74c26be2a0daf62bad758
    155 N92f7949c40444602b1276b0569a655df schema:name Springer Nature - SN SciGraph project
    156 rdf:type schema:Organization
    157 N9c2b501a68c644e7b31ceda9da9d3abd rdf:first sg:person.014017517447.33
    158 rdf:rest Nb8d1f9beacc640258685cc21aa746990
    159 Nab41eb669fdf43db9005e2926e7b5b25 schema:volumeNumber 4
    160 rdf:type schema:PublicationVolume
    161 Nb2f6092bb6e24a08b0469336d854eff4 rdf:first sg:person.0604644674.14
    162 rdf:rest N31a2a5a9161c416899e56124dd11dcf9
    163 Nb7f0602f916e4ce7b9b78e6434c055db schema:issueNumber 1
    164 rdf:type schema:PublicationIssue
    165 Nb8d1f9beacc640258685cc21aa746990 rdf:first sg:person.01145707174.35
    166 rdf:rest Nfa316442fe414ace995cae3ff73032e2
    167 Nb9bacbe98bd4437590f7a567414cf006 rdf:first sg:person.0721073274.50
    168 rdf:rest Nbba891f1f1fa4ac987a77d2ccb218472
    169 Nbba891f1f1fa4ac987a77d2ccb218472 rdf:first sg:person.01055364702.08
    170 rdf:rest rdf:nil
    171 Ncb1c97edb30642c0bd068ca44ccf3cbe rdf:first sg:person.01360057351.65
    172 rdf:rest N64efe56a38404f4cb217541299b2b0a3
    173 Nd0ccf5aae36c42e6b6889fe70274a965 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    174 schema:name Humans
    175 rdf:type schema:DefinedTerm
    176 Ne488c9aa62d54437af2ca36d82c1e7d8 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    177 schema:name Terminology as Topic
    178 rdf:type schema:DefinedTerm
    179 Ne805e4c45a674fd59bc05543a61cc2c2 rdf:first sg:person.01017015051.78
    180 rdf:rest Ne8eb1b96ceac43d587601764e71f87a3
    181 Ne8eb1b96ceac43d587601764e71f87a3 rdf:first sg:person.0763345374.05
    182 rdf:rest N4089e64aa5c94c2f94d14d4637784406
    183 Nfa316442fe414ace995cae3ff73032e2 rdf:first sg:person.01214022374.98
    184 rdf:rest N645384d4971c4613885c7d3886bd13fc
    185 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
    186 schema:name Biological Sciences
    187 rdf:type schema:DefinedTerm
    188 anzsrc-for:0604 schema:inDefinedTermSet anzsrc-for:
    189 schema:name Genetics
    190 rdf:type schema:DefinedTerm
    191 sg:journal.1023786 schema:issn 1471-2105
    192 schema:name BMC Bioinformatics
    193 rdf:type schema:Periodical
    194 sg:person.01017015051.78 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    195 schema:familyName Koonin
    196 schema:givenName Eugene V
    197 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01017015051.78
    198 rdf:type schema:Person
    199 sg:person.01036614410.14 schema:affiliation https://www.grid.ac/institutes/grid.411667.3
    200 schema:familyName Mazumder
    201 schema:givenName Raja
    202 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01036614410.14
    203 rdf:type schema:Person
    204 sg:person.01055364702.08 schema:affiliation https://www.grid.ac/institutes/grid.411667.3
    205 schema:familyName Natale
    206 schema:givenName Darren A
    207 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01055364702.08
    208 rdf:type schema:Person
    209 sg:person.01145707174.35 schema:affiliation https://www.grid.ac/institutes/grid.411667.3
    210 schema:familyName Nikolskaya
    211 schema:givenName Anastasia N
    212 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01145707174.35
    213 rdf:type schema:Person
    214 sg:person.01161731273.64 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    215 schema:familyName Fedorova
    216 schema:givenName Natalie D
    217 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01161731273.64
    218 rdf:type schema:Person
    219 sg:person.01214022374.98 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    220 schema:familyName Rao
    221 schema:givenName B Sridhar
    222 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01214022374.98
    223 rdf:type schema:Person
    224 sg:person.01330250774.14 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    225 schema:familyName Sverdlov
    226 schema:givenName Alexander V
    227 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01330250774.14
    228 rdf:type schema:Person
    229 sg:person.01360057351.65 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    230 schema:familyName Tatusov
    231 schema:givenName Roman L
    232 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01360057351.65
    233 rdf:type schema:Person
    234 sg:person.01372217634.12 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    235 schema:familyName Jackson
    236 schema:givenName John D
    237 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01372217634.12
    238 rdf:type schema:Person
    239 sg:person.014003770757.69 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    240 schema:familyName Smirnov
    241 schema:givenName Sergei
    242 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014003770757.69
    243 rdf:type schema:Person
    244 sg:person.014017517447.33 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    245 schema:familyName Mekhedov
    246 schema:givenName Sergei L
    247 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014017517447.33
    248 rdf:type schema:Person
    249 sg:person.0601003574.66 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    250 schema:familyName Jacobs
    251 schema:givenName Aviva R
    252 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0601003574.66
    253 rdf:type schema:Person
    254 sg:person.0604644674.14 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    255 schema:familyName Vasudevan
    256 schema:givenName Sona
    257 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0604644674.14
    258 rdf:type schema:Person
    259 sg:person.0634453251.89 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    260 schema:familyName Wolf
    261 schema:givenName Yuri I
    262 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0634453251.89
    263 rdf:type schema:Person
    264 sg:person.0702773360.18 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    265 schema:familyName Kiryutin
    266 schema:givenName Boris
    267 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0702773360.18
    268 rdf:type schema:Person
    269 sg:person.0721073274.50 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    270 schema:familyName Yin
    271 schema:givenName Jodie J
    272 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0721073274.50
    273 rdf:type schema:Person
    274 sg:person.0763345374.05 schema:affiliation https://www.grid.ac/institutes/grid.419234.9
    275 schema:familyName Krylov
    276 schema:givenName Dmitri M
    277 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0763345374.05
    278 rdf:type schema:Person
    279 sg:pub.10.1038/35048692 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044298669
    280 https://doi.org/10.1038/35048692
    281 rdf:type schema:CreativeWork
    282 sg:pub.10.1038/35057062 schema:sameAs https://app.dimensions.ai/details/publication/pub.1042854081
    283 https://doi.org/10.1038/35057062
    284 rdf:type schema:CreativeWork
    285 sg:pub.10.1038/35097083 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007834691
    286 https://doi.org/10.1038/35097083
    287 rdf:type schema:CreativeWork
    288 sg:pub.10.1038/35101614 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028123916
    289 https://doi.org/10.1038/35101614
    290 rdf:type schema:CreativeWork
    291 sg:pub.10.1038/35106579 schema:sameAs https://app.dimensions.ai/details/publication/pub.1020615079
    292 https://doi.org/10.1038/35106579
    293 rdf:type schema:CreativeWork
    294 sg:pub.10.1038/nature01097 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030736656
    295 https://doi.org/10.1038/nature01097
    296 rdf:type schema:CreativeWork
    297 sg:pub.10.1038/nature01262 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039854529
    298 https://doi.org/10.1038/nature01262
    299 rdf:type schema:CreativeWork
    300 sg:pub.10.1038/nature724 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009114411
    301 https://doi.org/10.1038/nature724
    302 rdf:type schema:CreativeWork
    303 sg:pub.10.1038/nrg929 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041641433
    304 https://doi.org/10.1038/nrg929
    305 rdf:type schema:CreativeWork
    306 sg:pub.10.1038/nrg931 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039727832
    307 https://doi.org/10.1038/nrg931
    308 rdf:type schema:CreativeWork
    309 sg:pub.10.1186/1471-2105-3-14 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041450232
    310 https://doi.org/10.1186/1471-2105-3-14
    311 rdf:type schema:CreativeWork
    312 sg:pub.10.1186/1471-2148-1-8 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035042918
    313 https://doi.org/10.1186/1471-2148-1-8
    314 rdf:type schema:CreativeWork
    315 sg:pub.10.1186/1471-2148-3-2 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047997844
    316 https://doi.org/10.1186/1471-2148-3-2
    317 rdf:type schema:CreativeWork
    318 sg:pub.10.1186/gb-2000-1-5-research0009 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007636928
    319 https://doi.org/10.1186/gb-2000-1-5-research0009
    320 rdf:type schema:CreativeWork
    321 sg:pub.10.1186/gb-2001-2-12-research0053 schema:sameAs https://app.dimensions.ai/details/publication/pub.1036924176
    322 https://doi.org/10.1186/gb-2001-2-12-research0053
    323 rdf:type schema:CreativeWork
    324 https://app.dimensions.ai/details/publication/pub.1074493861 schema:CreativeWork
    325 https://app.dimensions.ai/details/publication/pub.1074727289 schema:CreativeWork
    326 https://doi.org/10.1006/jmbi.2000.3550 schema:sameAs https://app.dimensions.ai/details/publication/pub.1000396794
    327 rdf:type schema:CreativeWork
    328 https://doi.org/10.1006/jmbi.2000.5197 schema:sameAs https://app.dimensions.ai/details/publication/pub.1046271071
    329 rdf:type schema:CreativeWork
    330 https://doi.org/10.1016/s0065-3233(00)54010-8 schema:sameAs https://app.dimensions.ai/details/publication/pub.1026639024
    331 rdf:type schema:CreativeWork
    332 https://doi.org/10.1016/s0076-6879(96)66035-2 schema:sameAs https://app.dimensions.ai/details/publication/pub.1018458717
    333 rdf:type schema:CreativeWork
    334 https://doi.org/10.1016/s0092-8674(00)80284-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1051957131
    335 rdf:type schema:CreativeWork
    336 https://doi.org/10.1016/s0092-8674(00)80867-3 schema:sameAs https://app.dimensions.ai/details/publication/pub.1029080476
    337 rdf:type schema:CreativeWork
    338 https://doi.org/10.1016/s0168-9525(00)02005-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028049243
    339 rdf:type schema:CreativeWork
    340 https://doi.org/10.1016/s0168-9525(02)02793-2 schema:sameAs https://app.dimensions.ai/details/publication/pub.1002684451
    341 rdf:type schema:CreativeWork
    342 https://doi.org/10.1016/s0168-9525(99)01704-7 schema:sameAs https://app.dimensions.ai/details/publication/pub.1027856251
    343 rdf:type schema:CreativeWork
    344 https://doi.org/10.1016/s0959-440x(00)00098-1 schema:sameAs https://app.dimensions.ai/details/publication/pub.1029769599
    345 rdf:type schema:CreativeWork
    346 https://doi.org/10.1016/s0968-0004(98)01334-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048073612
    347 rdf:type schema:CreativeWork
    348 https://doi.org/10.1046/j.1365-2958.1997.4821861.x schema:sameAs https://app.dimensions.ai/details/publication/pub.1012052760
    349 rdf:type schema:CreativeWork
    350 https://doi.org/10.1073/pnas.032671499 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030340865
    351 rdf:type schema:CreativeWork
    352 https://doi.org/10.1073/pnas.141236298 schema:sameAs https://app.dimensions.ai/details/publication/pub.1043005018
    353 rdf:type schema:CreativeWork
    354 https://doi.org/10.1073/pnas.95.11.5849 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031199342
    355 rdf:type schema:CreativeWork
    356 https://doi.org/10.1073/pnas.95.11.5857 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035682917
    357 rdf:type schema:CreativeWork
    358 https://doi.org/10.1073/pnas.97.10.5334 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039472008
    359 rdf:type schema:CreativeWork
    360 https://doi.org/10.1074/jbc.m004141200 schema:sameAs https://app.dimensions.ai/details/publication/pub.1022560741
    361 rdf:type schema:CreativeWork
    362 https://doi.org/10.1093/bioinformatics/18.1.92 schema:sameAs https://app.dimensions.ai/details/publication/pub.1032953758
    363 rdf:type schema:CreativeWork
    364 https://doi.org/10.1093/embo-reports/kvf248 schema:sameAs https://app.dimensions.ai/details/publication/pub.1000333278
    365 rdf:type schema:CreativeWork
    366 https://doi.org/10.1093/nar/25.17.3389 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047265454
    367 rdf:type schema:CreativeWork
    368 https://doi.org/10.1093/nar/25.9.1665 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030289871
    369 rdf:type schema:CreativeWork
    370 https://doi.org/10.1093/nar/29.1.22 schema:sameAs https://app.dimensions.ai/details/publication/pub.1010981269
    371 rdf:type schema:CreativeWork
    372 https://doi.org/10.1093/nar/29.2.545 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031098434
    373 rdf:type schema:CreativeWork
    374 https://doi.org/10.1093/nar/gkg087 schema:sameAs https://app.dimensions.ai/details/publication/pub.1005864982
    375 rdf:type schema:CreativeWork
    376 https://doi.org/10.1101/gr.161901 schema:sameAs https://app.dimensions.ai/details/publication/pub.1060407554
    377 rdf:type schema:CreativeWork
    378 https://doi.org/10.1101/gr.174302 schema:sameAs https://app.dimensions.ai/details/publication/pub.1036137080
    379 rdf:type schema:CreativeWork
    380 https://doi.org/10.1101/gr.176501 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048081914
    381 rdf:type schema:CreativeWork
    382 https://doi.org/10.1101/gr.87702 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035714287
    383 rdf:type schema:CreativeWork
    384 https://doi.org/10.1101/gr.gr-1619r schema:sameAs https://app.dimensions.ai/details/publication/pub.1060407554
    385 rdf:type schema:CreativeWork
    386 https://doi.org/10.1101/gr.gr-1660r schema:sameAs https://app.dimensions.ai/details/publication/pub.1060407559
    387 rdf:type schema:CreativeWork
    388 https://doi.org/10.1101/gr.gr1653r schema:sameAs https://app.dimensions.ai/details/publication/pub.1060407572
    389 rdf:type schema:CreativeWork
    390 https://doi.org/10.1126/science.1072104 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045516288
    391 rdf:type schema:CreativeWork
    392 https://doi.org/10.1126/science.1076181 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062446937
    393 rdf:type schema:CreativeWork
    394 https://doi.org/10.1126/science.1080049 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047114664
    395 rdf:type schema:CreativeWork
    396 https://doi.org/10.1126/science.274.5287.546 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062554574
    397 rdf:type schema:CreativeWork
    398 https://doi.org/10.1126/science.278.5338.609 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062558435
    399 rdf:type schema:CreativeWork
    400 https://doi.org/10.1126/science.278.5338.631 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039646901
    401 rdf:type schema:CreativeWork
    402 https://doi.org/10.1126/science.282.5396.2012 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062563514
    403 rdf:type schema:CreativeWork
    404 https://doi.org/10.1126/science.287.5461.2185 schema:sameAs https://app.dimensions.ai/details/publication/pub.1019737566
    405 rdf:type schema:CreativeWork
    406 https://doi.org/10.1128/jb.183.16.4823-4838.2001 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025210625
    407 rdf:type schema:CreativeWork
    408 https://doi.org/10.1128/mmbr.65.1.44-79.2001 schema:sameAs https://app.dimensions.ai/details/publication/pub.1021785027
    409 rdf:type schema:CreativeWork
    410 https://doi.org/10.1146/annurev.micro.55.1.709 schema:sameAs https://app.dimensions.ai/details/publication/pub.1020062045
    411 rdf:type schema:CreativeWork
    412 https://doi.org/10.2307/2412448 schema:sameAs https://app.dimensions.ai/details/publication/pub.1069920881
    413 rdf:type schema:CreativeWork
    414 https://www.grid.ac/institutes/grid.411667.3 schema:alternateName Georgetown University Medical Center
    415 schema:name Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA
    416 rdf:type schema:Organization
    417 https://www.grid.ac/institutes/grid.419234.9 schema:alternateName National Center for Biotechnology Information
    418 schema:name National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
    419 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...