The COG database: an updated version includes eukaryotes View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2003-09-11

AUTHORS

Roman L Tatusov, Natalie D Fedorova, John D Jackson, Aviva R Jacobs, Boris Kiryutin, Eugene V Koonin, Dmitri M Krylov, Raja Mazumder, Sergei L Mekhedov, Anastasia N Nikolskaya, B Sridhar Rao, Sergei Smirnov, Alexander V Sverdlov, Sona Vasudevan, Yuri I Wolf, Jodie J Yin, Darren A Natale

ABSTRACT

BackgroundThe availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies.ResultsWe describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.ConclusionThe updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. More... »

PAGES

41

References to SciGraph publications

  • 2001-02-15. Initial sequencing and analysis of the human genome in NATURE
  • 2001-11-20. Constant relative rate of protein evolution and detection of functional diversification among bacterial, archaeal and eukaryotic proteins in GENOME BIOLOGY
  • 2002-10. Genome sequence of the human malaria parasite Plasmodium falciparum in NATURE
  • 2002-05-16. RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs in BMC BIOINFORMATICS
  • 2002-12. Initial sequencing and comparative analysis of the mouse genome in NATURE
  • 2002-11. Genome evolution in bacterial endosymbionts of insects in NATURE REVIEWS GENETICS
  • 2001-10. Genome sequence of Yersinia pestis, the causative agent of plague in NATURE
  • 2000-12. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana in NATURE
  • 2001-10-23. Genome trees constructed using five different approaches suggest new major bacterial clades in BMC ECOLOGY AND EVOLUTION
  • 2002-02. The genome sequence of Schizosaccharomyces pombe in NATURE
  • 2001-11. Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi in NATURE
  • 2003-01-06. Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes in BMC ECOLOGY AND EVOLUTION
  • 2000-11-06. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs) in GENOME BIOLOGY
  • 2002-11. The origin and evolution of model organisms in NATURE REVIEWS GENETICS
  • 2001-10-25. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2 in NATURE
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1186/1471-2105-4-41

    DOI

    http://dx.doi.org/10.1186/1471-2105-4-41

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1013163036

    PUBMED

    https://www.ncbi.nlm.nih.gov/pubmed/12969510


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Biological Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Genetics", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Animals", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Databases, Nucleic Acid", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Databases, Protein", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Eukaryotic Cells", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Evolution, Molecular", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Humans", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "National Institutes of Health (U.S.)", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Proteins", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Terminology as Topic", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "United States", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Tatusov", 
            "givenName": "Roman L", 
            "id": "sg:person.01360057351.65", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01360057351.65"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Fedorova", 
            "givenName": "Natalie D", 
            "id": "sg:person.01161731273.64", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01161731273.64"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Jackson", 
            "givenName": "John D", 
            "id": "sg:person.01372217634.12", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01372217634.12"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Jacobs", 
            "givenName": "Aviva R", 
            "id": "sg:person.0601003574.66", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0601003574.66"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Kiryutin", 
            "givenName": "Boris", 
            "id": "sg:person.0702773360.18", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0702773360.18"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Koonin", 
            "givenName": "Eugene V", 
            "id": "sg:person.01017015051.78", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01017015051.78"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Krylov", 
            "givenName": "Dmitri M", 
            "id": "sg:person.0763345374.05", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0763345374.05"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA", 
              "id": "http://www.grid.ac/institutes/grid.411667.3", 
              "name": [
                "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mazumder", 
            "givenName": "Raja", 
            "id": "sg:person.01036614410.14", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01036614410.14"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mekhedov", 
            "givenName": "Sergei L", 
            "id": "sg:person.014017517447.33", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014017517447.33"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA", 
              "id": "http://www.grid.ac/institutes/grid.411667.3", 
              "name": [
                "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Nikolskaya", 
            "givenName": "Anastasia N", 
            "id": "sg:person.01145707174.35", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01145707174.35"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Rao", 
            "givenName": "B Sridhar", 
            "id": "sg:person.01214022374.98", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01214022374.98"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Smirnov", 
            "givenName": "Sergei", 
            "id": "sg:person.014003770757.69", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014003770757.69"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Sverdlov", 
            "givenName": "Alexander V", 
            "id": "sg:person.01330250774.14", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01330250774.14"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Vasudevan", 
            "givenName": "Sona", 
            "id": "sg:person.0733604225.37", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0733604225.37"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Wolf", 
            "givenName": "Yuri I", 
            "id": "sg:person.0634453251.89", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0634453251.89"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA", 
              "id": "http://www.grid.ac/institutes/grid.419234.9", 
              "name": [
                "National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Yin", 
            "givenName": "Jodie J", 
            "id": "sg:person.0721073274.50", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0721073274.50"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA", 
              "id": "http://www.grid.ac/institutes/grid.411667.3", 
              "name": [
                "Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Natale", 
            "givenName": "Darren A", 
            "id": "sg:person.01055364702.08", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01055364702.08"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1186/1471-2148-3-2", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1047997844", 
              "https://doi.org/10.1186/1471-2148-3-2"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35048692", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1044298669", 
              "https://doi.org/10.1038/35048692"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature01097", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1030736656", 
              "https://doi.org/10.1038/nature01097"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35057062", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1042854081", 
              "https://doi.org/10.1038/35057062"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2148-1-8", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1035042918", 
              "https://doi.org/10.1186/1471-2148-1-8"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature724", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1009114411", 
              "https://doi.org/10.1038/nature724"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/1471-2105-3-14", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041450232", 
              "https://doi.org/10.1186/1471-2105-3-14"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg931", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039727832", 
              "https://doi.org/10.1038/nrg931"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35106579", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1020615079", 
              "https://doi.org/10.1038/35106579"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2001-2-12-research0053", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1036924176", 
              "https://doi.org/10.1186/gb-2001-2-12-research0053"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1186/gb-2000-1-5-research0009", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007636928", 
              "https://doi.org/10.1186/gb-2000-1-5-research0009"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35101614", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028123916", 
              "https://doi.org/10.1038/35101614"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nrg929", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041641433", 
              "https://doi.org/10.1038/nrg929"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/35097083", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007834691", 
              "https://doi.org/10.1038/35097083"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/nature01262", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039854529", 
              "https://doi.org/10.1038/nature01262"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2003-09-11", 
        "datePublishedReg": "2003-09-11", 
        "description": "BackgroundThe availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies.ResultsWe describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.ConclusionThe updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.", 
        "genre": "article", 
        "id": "sg:pub.10.1186/1471-2105-4-41", 
        "inLanguage": "en", 
        "isAccessibleForFree": true, 
        "isPartOf": [
          {
            "id": "sg:journal.1023786", 
            "issn": [
              "1471-2105"
            ], 
            "name": "BMC Bioinformatics", 
            "publisher": "Springer Nature", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "1", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "4"
          }
        ], 
        "keywords": [
          "eukaryotic genomes", 
          "functional annotation", 
          "evolutionary studies", 
          "microsporidian parasite Encephalitozoon cuniculi", 
          "large-scale evolutionary studies", 
          "clusters of orthologs", 
          "genomes of prokaryotes", 
          "orthologous protein sets", 
          "greater evolutionary stability", 
          "complete genome sequence", 
          "orthologous relationships", 
          "complex eukaryotes", 
          "unicellular eukaryotes", 
          "orthologous groups", 
          "Arabidopsis thaliana", 
          "phyletic patterns", 
          "comparative genomics", 
          "eukaryotic genes", 
          "unicellular organisms", 
          "prokaryotic genomes", 
          "genome sequence", 
          "eukaryotes", 
          "protein sets", 
          "COG database", 
          "gene products", 
          "conserved portion", 
          "genome", 
          "evolutionary classification", 
          "evolutionary stability", 
          "prokaryotes", 
          "KOG", 
          "genes", 
          "protein", 
          "orthologs", 
          "Encephalitozoon cuniculi", 
          "KOGs", 
          "annotation", 
          "thaliana", 
          "clade", 
          "genomics", 
          "major update", 
          "useful platform", 
          "fungi", 
          "COG", 
          "organisms", 
          "species", 
          "plants", 
          "small fraction", 
          "sequence", 
          "clusters", 
          "cuniculi", 
          "small number", 
          "portion", 
          "animals", 
          "substantial increase", 
          "availability", 
          "patterns", 
          "collection", 
          "study", 
          "set", 
          "delineation", 
          "addition", 
          "fraction", 
          "number", 
          "products", 
          "ResultsWe", 
          "system", 
          "increase", 
          "part", 
          "relationship", 
          "consist", 
          "differences", 
          "database", 
          "relative compactness", 
          "stability", 
          "opportunities", 
          "platform", 
          "core", 
          "group", 
          "coverage", 
          "classification system", 
          "natural framework", 
          "construction", 
          "update", 
          "classification", 
          "demand", 
          "roup", 
          "ConclusionThe", 
          "construction of clusters", 
          "examination", 
          "compactness", 
          "version", 
          "framework"
        ], 
        "name": "The COG database: an updated version includes eukaryotes", 
        "pagination": "41", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1013163036"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1186/1471-2105-4-41"
            ]
          }, 
          {
            "name": "pubmed_id", 
            "type": "PropertyValue", 
            "value": [
              "12969510"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1186/1471-2105-4-41", 
          "https://app.dimensions.ai/details/publication/pub.1013163036"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2022-05-20T07:22", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20220519/entities/gbq_results/article/article_371.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "https://doi.org/10.1186/1471-2105-4-41"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-4-41'


     

    This table displays all metadata directly associated to this object as RDF triples.

    369 TRIPLES      22 PREDICATES      144 URIs      121 LITERALS      17 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1186/1471-2105-4-41 schema:about N0dad50dbab7e4d5ea10b91873e39354a
    2 N0e3fad2b52344f88a36f0be13f1f401e
    3 N0f85c1aa7965482a982b16e87d969221
    4 N227fe745fa8f4bb6830642ed18053ccd
    5 N4dcd75639bc7410ea882a6dfe1f5da1e
    6 N5ab8c105b1044ab191deb73e24428c04
    7 N5d8fe054ddd444cd956c97170c1cacd1
    8 Nb1cf3778e0954051b67631540319b6ca
    9 Nb5f608e03f2a4b0f816a1fcfab5490c5
    10 Ne884547f4e9f4f959b12459f36856bc4
    11 anzsrc-for:06
    12 anzsrc-for:0604
    13 schema:author N3fc520ebf8834fe79e4f7895e4e09a24
    14 schema:citation sg:pub.10.1038/35048692
    15 sg:pub.10.1038/35057062
    16 sg:pub.10.1038/35097083
    17 sg:pub.10.1038/35101614
    18 sg:pub.10.1038/35106579
    19 sg:pub.10.1038/nature01097
    20 sg:pub.10.1038/nature01262
    21 sg:pub.10.1038/nature724
    22 sg:pub.10.1038/nrg929
    23 sg:pub.10.1038/nrg931
    24 sg:pub.10.1186/1471-2105-3-14
    25 sg:pub.10.1186/1471-2148-1-8
    26 sg:pub.10.1186/1471-2148-3-2
    27 sg:pub.10.1186/gb-2000-1-5-research0009
    28 sg:pub.10.1186/gb-2001-2-12-research0053
    29 schema:datePublished 2003-09-11
    30 schema:datePublishedReg 2003-09-11
    31 schema:description BackgroundThe availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies.ResultsWe describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.ConclusionThe updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.
    32 schema:genre article
    33 schema:inLanguage en
    34 schema:isAccessibleForFree true
    35 schema:isPartOf N46c2c7e087b04d3b95c68ec5d57dde1c
    36 Ne576d88e8e2b4abb956d3bf847770c09
    37 sg:journal.1023786
    38 schema:keywords Arabidopsis thaliana
    39 COG
    40 COG database
    41 ConclusionThe
    42 Encephalitozoon cuniculi
    43 KOG
    44 KOGs
    45 ResultsWe
    46 addition
    47 animals
    48 annotation
    49 availability
    50 clade
    51 classification
    52 classification system
    53 clusters
    54 clusters of orthologs
    55 collection
    56 compactness
    57 comparative genomics
    58 complete genome sequence
    59 complex eukaryotes
    60 conserved portion
    61 consist
    62 construction
    63 construction of clusters
    64 core
    65 coverage
    66 cuniculi
    67 database
    68 delineation
    69 demand
    70 differences
    71 eukaryotes
    72 eukaryotic genes
    73 eukaryotic genomes
    74 evolutionary classification
    75 evolutionary stability
    76 evolutionary studies
    77 examination
    78 fraction
    79 framework
    80 functional annotation
    81 fungi
    82 gene products
    83 genes
    84 genome
    85 genome sequence
    86 genomes of prokaryotes
    87 genomics
    88 greater evolutionary stability
    89 group
    90 increase
    91 large-scale evolutionary studies
    92 major update
    93 microsporidian parasite Encephalitozoon cuniculi
    94 natural framework
    95 number
    96 opportunities
    97 organisms
    98 orthologous groups
    99 orthologous protein sets
    100 orthologous relationships
    101 orthologs
    102 part
    103 patterns
    104 phyletic patterns
    105 plants
    106 platform
    107 portion
    108 products
    109 prokaryotes
    110 prokaryotic genomes
    111 protein
    112 protein sets
    113 relationship
    114 relative compactness
    115 roup
    116 sequence
    117 set
    118 small fraction
    119 small number
    120 species
    121 stability
    122 study
    123 substantial increase
    124 system
    125 thaliana
    126 unicellular eukaryotes
    127 unicellular organisms
    128 update
    129 useful platform
    130 version
    131 schema:name The COG database: an updated version includes eukaryotes
    132 schema:pagination 41
    133 schema:productId N70221f3bb55640708ec9f42e3efda29c
    134 N8d0d3a5a287f429f911cc24fcd52c83d
    135 Nd1328da805014a8ca609904a77e9a5e4
    136 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013163036
    137 https://doi.org/10.1186/1471-2105-4-41
    138 schema:sdDatePublished 2022-05-20T07:22
    139 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    140 schema:sdPublisher Nf552f049a4594076a2aeaa4df44e65de
    141 schema:url https://doi.org/10.1186/1471-2105-4-41
    142 sgo:license sg:explorer/license/
    143 sgo:sdDataset articles
    144 rdf:type schema:ScholarlyArticle
    145 N0dad50dbab7e4d5ea10b91873e39354a schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    146 schema:name Animals
    147 rdf:type schema:DefinedTerm
    148 N0e3fad2b52344f88a36f0be13f1f401e schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    149 schema:name Databases, Nucleic Acid
    150 rdf:type schema:DefinedTerm
    151 N0f85c1aa7965482a982b16e87d969221 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    152 schema:name Humans
    153 rdf:type schema:DefinedTerm
    154 N1b2b43da3436471da08bf4a4d179a9e8 rdf:first sg:person.01036614410.14
    155 rdf:rest N8ff69ac3601e4ff09eee943a4a2f660f
    156 N21393dd4876543d9875a1e83d8369b23 rdf:first sg:person.0721073274.50
    157 rdf:rest Nad4a367d69bf484b952d051931e00893
    158 N227fe745fa8f4bb6830642ed18053ccd schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    159 schema:name Proteins
    160 rdf:type schema:DefinedTerm
    161 N26d5e1f3de6d467fabb4be3a22fc320b rdf:first sg:person.01330250774.14
    162 rdf:rest N612f00521c0944728cf442dfdf77b6fe
    163 N3fc520ebf8834fe79e4f7895e4e09a24 rdf:first sg:person.01360057351.65
    164 rdf:rest Nb8cb6b83a90045d88a722e4639998210
    165 N46c2c7e087b04d3b95c68ec5d57dde1c schema:issueNumber 1
    166 rdf:type schema:PublicationIssue
    167 N4dcd75639bc7410ea882a6dfe1f5da1e schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    168 schema:name National Institutes of Health (U.S.)
    169 rdf:type schema:DefinedTerm
    170 N5ab8c105b1044ab191deb73e24428c04 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    171 schema:name Evolution, Molecular
    172 rdf:type schema:DefinedTerm
    173 N5d8fe054ddd444cd956c97170c1cacd1 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    174 schema:name United States
    175 rdf:type schema:DefinedTerm
    176 N612f00521c0944728cf442dfdf77b6fe rdf:first sg:person.0733604225.37
    177 rdf:rest Naf94af51b71a46fa82859b16c8add6d0
    178 N70221f3bb55640708ec9f42e3efda29c schema:name dimensions_id
    179 schema:value pub.1013163036
    180 rdf:type schema:PropertyValue
    181 N8d0d3a5a287f429f911cc24fcd52c83d schema:name pubmed_id
    182 schema:value 12969510
    183 rdf:type schema:PropertyValue
    184 N8e40bdba10954beb9352d20959cc947b rdf:first sg:person.0601003574.66
    185 rdf:rest Nade5db3b8f7c41d5a5e9db5c3979a9f2
    186 N8ff69ac3601e4ff09eee943a4a2f660f rdf:first sg:person.014017517447.33
    187 rdf:rest Na8f70da72b8c4102857d65a48d2776d3
    188 Na3cc0fcb8d0f4e23837e0fefba5b67d0 rdf:first sg:person.0763345374.05
    189 rdf:rest N1b2b43da3436471da08bf4a4d179a9e8
    190 Na8f70da72b8c4102857d65a48d2776d3 rdf:first sg:person.01145707174.35
    191 rdf:rest Ne064631340804a039c761a56f5f32eb3
    192 Naceabb89240a491290391ed89f27360a rdf:first sg:person.014003770757.69
    193 rdf:rest N26d5e1f3de6d467fabb4be3a22fc320b
    194 Nad4a367d69bf484b952d051931e00893 rdf:first sg:person.01055364702.08
    195 rdf:rest rdf:nil
    196 Nade5db3b8f7c41d5a5e9db5c3979a9f2 rdf:first sg:person.0702773360.18
    197 rdf:rest Nef7a86eb68e14d2db205429a2e6d132e
    198 Naf94af51b71a46fa82859b16c8add6d0 rdf:first sg:person.0634453251.89
    199 rdf:rest N21393dd4876543d9875a1e83d8369b23
    200 Nb1cf3778e0954051b67631540319b6ca schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    201 schema:name Terminology as Topic
    202 rdf:type schema:DefinedTerm
    203 Nb5f608e03f2a4b0f816a1fcfab5490c5 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    204 schema:name Eukaryotic Cells
    205 rdf:type schema:DefinedTerm
    206 Nb8cb6b83a90045d88a722e4639998210 rdf:first sg:person.01161731273.64
    207 rdf:rest Nd7ca3101f9284cceb881059fef32207a
    208 Nd1328da805014a8ca609904a77e9a5e4 schema:name doi
    209 schema:value 10.1186/1471-2105-4-41
    210 rdf:type schema:PropertyValue
    211 Nd7ca3101f9284cceb881059fef32207a rdf:first sg:person.01372217634.12
    212 rdf:rest N8e40bdba10954beb9352d20959cc947b
    213 Ne064631340804a039c761a56f5f32eb3 rdf:first sg:person.01214022374.98
    214 rdf:rest Naceabb89240a491290391ed89f27360a
    215 Ne576d88e8e2b4abb956d3bf847770c09 schema:volumeNumber 4
    216 rdf:type schema:PublicationVolume
    217 Ne884547f4e9f4f959b12459f36856bc4 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    218 schema:name Databases, Protein
    219 rdf:type schema:DefinedTerm
    220 Nef7a86eb68e14d2db205429a2e6d132e rdf:first sg:person.01017015051.78
    221 rdf:rest Na3cc0fcb8d0f4e23837e0fefba5b67d0
    222 Nf552f049a4594076a2aeaa4df44e65de schema:name Springer Nature - SN SciGraph project
    223 rdf:type schema:Organization
    224 anzsrc-for:06 schema:inDefinedTermSet anzsrc-for:
    225 schema:name Biological Sciences
    226 rdf:type schema:DefinedTerm
    227 anzsrc-for:0604 schema:inDefinedTermSet anzsrc-for:
    228 schema:name Genetics
    229 rdf:type schema:DefinedTerm
    230 sg:journal.1023786 schema:issn 1471-2105
    231 schema:name BMC Bioinformatics
    232 schema:publisher Springer Nature
    233 rdf:type schema:Periodical
    234 sg:person.01017015051.78 schema:affiliation grid-institutes:grid.419234.9
    235 schema:familyName Koonin
    236 schema:givenName Eugene V
    237 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01017015051.78
    238 rdf:type schema:Person
    239 sg:person.01036614410.14 schema:affiliation grid-institutes:grid.411667.3
    240 schema:familyName Mazumder
    241 schema:givenName Raja
    242 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01036614410.14
    243 rdf:type schema:Person
    244 sg:person.01055364702.08 schema:affiliation grid-institutes:grid.411667.3
    245 schema:familyName Natale
    246 schema:givenName Darren A
    247 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01055364702.08
    248 rdf:type schema:Person
    249 sg:person.01145707174.35 schema:affiliation grid-institutes:grid.411667.3
    250 schema:familyName Nikolskaya
    251 schema:givenName Anastasia N
    252 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01145707174.35
    253 rdf:type schema:Person
    254 sg:person.01161731273.64 schema:affiliation grid-institutes:grid.419234.9
    255 schema:familyName Fedorova
    256 schema:givenName Natalie D
    257 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01161731273.64
    258 rdf:type schema:Person
    259 sg:person.01214022374.98 schema:affiliation grid-institutes:grid.419234.9
    260 schema:familyName Rao
    261 schema:givenName B Sridhar
    262 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01214022374.98
    263 rdf:type schema:Person
    264 sg:person.01330250774.14 schema:affiliation grid-institutes:grid.419234.9
    265 schema:familyName Sverdlov
    266 schema:givenName Alexander V
    267 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01330250774.14
    268 rdf:type schema:Person
    269 sg:person.01360057351.65 schema:affiliation grid-institutes:grid.419234.9
    270 schema:familyName Tatusov
    271 schema:givenName Roman L
    272 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01360057351.65
    273 rdf:type schema:Person
    274 sg:person.01372217634.12 schema:affiliation grid-institutes:grid.419234.9
    275 schema:familyName Jackson
    276 schema:givenName John D
    277 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01372217634.12
    278 rdf:type schema:Person
    279 sg:person.014003770757.69 schema:affiliation grid-institutes:grid.419234.9
    280 schema:familyName Smirnov
    281 schema:givenName Sergei
    282 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014003770757.69
    283 rdf:type schema:Person
    284 sg:person.014017517447.33 schema:affiliation grid-institutes:grid.419234.9
    285 schema:familyName Mekhedov
    286 schema:givenName Sergei L
    287 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014017517447.33
    288 rdf:type schema:Person
    289 sg:person.0601003574.66 schema:affiliation grid-institutes:grid.419234.9
    290 schema:familyName Jacobs
    291 schema:givenName Aviva R
    292 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0601003574.66
    293 rdf:type schema:Person
    294 sg:person.0634453251.89 schema:affiliation grid-institutes:grid.419234.9
    295 schema:familyName Wolf
    296 schema:givenName Yuri I
    297 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0634453251.89
    298 rdf:type schema:Person
    299 sg:person.0702773360.18 schema:affiliation grid-institutes:grid.419234.9
    300 schema:familyName Kiryutin
    301 schema:givenName Boris
    302 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0702773360.18
    303 rdf:type schema:Person
    304 sg:person.0721073274.50 schema:affiliation grid-institutes:grid.419234.9
    305 schema:familyName Yin
    306 schema:givenName Jodie J
    307 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0721073274.50
    308 rdf:type schema:Person
    309 sg:person.0733604225.37 schema:affiliation grid-institutes:grid.419234.9
    310 schema:familyName Vasudevan
    311 schema:givenName Sona
    312 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0733604225.37
    313 rdf:type schema:Person
    314 sg:person.0763345374.05 schema:affiliation grid-institutes:grid.419234.9
    315 schema:familyName Krylov
    316 schema:givenName Dmitri M
    317 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0763345374.05
    318 rdf:type schema:Person
    319 sg:pub.10.1038/35048692 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044298669
    320 https://doi.org/10.1038/35048692
    321 rdf:type schema:CreativeWork
    322 sg:pub.10.1038/35057062 schema:sameAs https://app.dimensions.ai/details/publication/pub.1042854081
    323 https://doi.org/10.1038/35057062
    324 rdf:type schema:CreativeWork
    325 sg:pub.10.1038/35097083 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007834691
    326 https://doi.org/10.1038/35097083
    327 rdf:type schema:CreativeWork
    328 sg:pub.10.1038/35101614 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028123916
    329 https://doi.org/10.1038/35101614
    330 rdf:type schema:CreativeWork
    331 sg:pub.10.1038/35106579 schema:sameAs https://app.dimensions.ai/details/publication/pub.1020615079
    332 https://doi.org/10.1038/35106579
    333 rdf:type schema:CreativeWork
    334 sg:pub.10.1038/nature01097 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030736656
    335 https://doi.org/10.1038/nature01097
    336 rdf:type schema:CreativeWork
    337 sg:pub.10.1038/nature01262 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039854529
    338 https://doi.org/10.1038/nature01262
    339 rdf:type schema:CreativeWork
    340 sg:pub.10.1038/nature724 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009114411
    341 https://doi.org/10.1038/nature724
    342 rdf:type schema:CreativeWork
    343 sg:pub.10.1038/nrg929 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041641433
    344 https://doi.org/10.1038/nrg929
    345 rdf:type schema:CreativeWork
    346 sg:pub.10.1038/nrg931 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039727832
    347 https://doi.org/10.1038/nrg931
    348 rdf:type schema:CreativeWork
    349 sg:pub.10.1186/1471-2105-3-14 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041450232
    350 https://doi.org/10.1186/1471-2105-3-14
    351 rdf:type schema:CreativeWork
    352 sg:pub.10.1186/1471-2148-1-8 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035042918
    353 https://doi.org/10.1186/1471-2148-1-8
    354 rdf:type schema:CreativeWork
    355 sg:pub.10.1186/1471-2148-3-2 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047997844
    356 https://doi.org/10.1186/1471-2148-3-2
    357 rdf:type schema:CreativeWork
    358 sg:pub.10.1186/gb-2000-1-5-research0009 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007636928
    359 https://doi.org/10.1186/gb-2000-1-5-research0009
    360 rdf:type schema:CreativeWork
    361 sg:pub.10.1186/gb-2001-2-12-research0053 schema:sameAs https://app.dimensions.ai/details/publication/pub.1036924176
    362 https://doi.org/10.1186/gb-2001-2-12-research0053
    363 rdf:type schema:CreativeWork
    364 grid-institutes:grid.411667.3 schema:alternateName Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA
    365 schema:name Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, 20007, Washington, DC, USA
    366 rdf:type schema:Organization
    367 grid-institutes:grid.419234.9 schema:alternateName National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
    368 schema:name National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
    369 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...