Ontology type: schema:ScholarlyArticle Open Access: True
2013-02
AUTHORSArtem Sokolov, Christopher Funk, Kiley Graim, Karin Verspoor, Asa Ben-Hur
ABSTRACTCombining heterogeneous sources of data is essential for accurate prediction of protein function. The task is complicated by the fact that while sequence-based features can be readily compared across species, most other data are species-specific. In this paper, we present a multi-view extension to GOstruct, a structured-output framework for function annotation of proteins. The extended framework can learn from disparate data sources, with each data source provided to the framework in the form of a kernel. Our empirical results demonstrate that the multi-view framework is able to utilize all available information, yielding better performance than sequence-based models trained across species and models trained from collections of data within a given species. This version of GOstruct participated in the recent Critical Assessment of Functional Annotations (CAFA) challenge; since then we have significantly improved the natural language processing component of the method, which now provides performance that is on par with that provided by sequence information. The GOstruct framework is available for download at http://strut.sourceforge.net. More... »
PAGESs10
http://scigraph.springernature.com/pub.10.1186/1471-2105-14-s3-s10
DOIhttp://dx.doi.org/10.1186/1471-2105-14-s3-s10
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1020835861
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/23514123
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information Systems",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Algorithms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Animals",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Computational Biology",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Gene Expression",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Mice",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Molecular Sequence Annotation",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Protein Interaction Mapping",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Proteins",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Software",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Vocabulary, Controlled",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "University of California, Santa Cruz",
"id": "https://www.grid.ac/institutes/grid.205975.c",
"name": [
"Department of Biomolecular Engineering, University of California Santa Cruz, 95064, Santa Cruz, California, USA"
],
"type": "Organization"
},
"familyName": "Sokolov",
"givenName": "Artem",
"id": "sg:person.01101321731.95",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01101321731.95"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Colorado Anschutz Medical Campus",
"id": "https://www.grid.ac/institutes/grid.430503.1",
"name": [
"Computational Bioscience Program, University of Colorado School of Medicine, 80045, Aurora, Colorado, USA"
],
"type": "Organization"
},
"familyName": "Funk",
"givenName": "Christopher",
"id": "sg:person.01142564764.38",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01142564764.38"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of California, Santa Cruz",
"id": "https://www.grid.ac/institutes/grid.205975.c",
"name": [
"Department of Biomolecular Engineering, University of California Santa Cruz, 95064, Santa Cruz, California, USA"
],
"type": "Organization"
},
"familyName": "Graim",
"givenName": "Kiley",
"id": "sg:person.01243051406.16",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01243051406.16"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Data61",
"id": "https://www.grid.ac/institutes/grid.425461.0",
"name": [
"Computational Bioscience Program, University of Colorado School of Medicine, 80045, Aurora, Colorado, USA",
"National ICT Australia, Victoria Research Lab, 3010, Melbourne, Australia"
],
"type": "Organization"
},
"familyName": "Verspoor",
"givenName": "Karin",
"id": "sg:person.01372713104.04",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01372713104.04"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Colorado State University",
"id": "https://www.grid.ac/institutes/grid.47894.36",
"name": [
"Department of Computer Science, Colorado State University, 80523, Fort Collins, Colorado, USA"
],
"type": "Organization"
},
"familyName": "Ben-Hur",
"givenName": "Asa",
"id": "sg:person.01242755504.30",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01242755504.30"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1186/1471-2105-6-s1-s21",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1000074431",
"https://doi.org/10.1186/1471-2105-6-s1-s21"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1752-0509-4-43",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005678588",
"https://doi.org/10.1186/1752-0509-4-43"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btp122",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005864951"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1002/prot.23029",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005949120"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-6-s1-s18",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1006919385",
"https://doi.org/10.1186/1471-2105-6-s1-s18"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2008-9-s1-s3",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1006959314",
"https://doi.org/10.1186/gb-2008-9-s1-s3"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0022-2836(05)80360-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1013618994"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1006/jmbi.2000.4315",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1018016237"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-5-178",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1018640803",
"https://doi.org/10.1186/1471-2105-5-178"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1002/1097-0134(20001001)41:1<98::aid-prot120>3.0.co;2-s",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019688976"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng0498-313",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1021131638",
"https://doi.org/10.1038/ng0498-313"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-s8-s2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1022575918",
"https://doi.org/10.1186/1471-2105-12-s8-s2"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/1102351.1102464",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1022920538"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2008-9-s1-s4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1024435781",
"https://doi.org/10.1186/gb-2008-9-s1-s4"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btk048",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1025615135"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkg095",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1027045924"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bth921",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031648236"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1753-6561-2-s4-s2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032045755",
"https://doi.org/10.1186/1753-6561-2-s4-s2"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkn760",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032318578"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/2147805.2147820",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032813119"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-6-s1-s16",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038229739",
"https://doi.org/10.1186/1471-2105-6-s1-s16"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2008-9-s1-s2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039025062",
"https://doi.org/10.1186/gb-2008-9-s1-s2"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2009-10-2-207",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1040399255",
"https://doi.org/10.1186/gb-2009-10-2-207"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-6-s1-s22",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1042479871",
"https://doi.org/10.1186/1471-2105-6-s1-s22"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/75556",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044135237",
"https://doi.org/10.1038/75556"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/75556",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044135237",
"https://doi.org/10.1038/75556"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2008-9-s1-s6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044176432",
"https://doi.org/10.1186/gb-2008-9-s1-s6"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s00018-003-3114-8",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045087383",
"https://doi.org/10.1007/s00018-003-3114-8"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/279943.279962",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045398430"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkg582",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047342895"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkg555",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047366540"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1002/prot.20903",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052562570"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1002/prot.20903",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052562570"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-13-207",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052987047",
"https://doi.org/10.1186/1471-2105-13-207"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkr440",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1053252157"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1142/s0219720010004744",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1063004958"
],
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1074854609",
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1076803194",
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1142/9789812704856_0029",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1096052467"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1142/9781860947292_0007",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1096052853"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.7551/mitpress/7443.001.0001",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1111386243"
],
"type": "CreativeWork"
}
],
"datePublished": "2013-02",
"datePublishedReg": "2013-02-01",
"description": "Combining heterogeneous sources of data is essential for accurate prediction of protein function. The task is complicated by the fact that while sequence-based features can be readily compared across species, most other data are species-specific. In this paper, we present a multi-view extension to GOstruct, a structured-output framework for function annotation of proteins. The extended framework can learn from disparate data sources, with each data source provided to the framework in the form of a kernel. Our empirical results demonstrate that the multi-view framework is able to utilize all available information, yielding better performance than sequence-based models trained across species and models trained from collections of data within a given species. This version of GOstruct participated in the recent Critical Assessment of Functional Annotations (CAFA) challenge; since then we have significantly improved the natural language processing component of the method, which now provides performance that is on par with that provided by sequence information. The GOstruct framework is available for download at http://strut.sourceforge.net.",
"genre": "research_article",
"id": "sg:pub.10.1186/1471-2105-14-s3-s10",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.3111397",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.2681199",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.3111370",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"type": "Periodical"
},
{
"issueNumber": "Suppl 3",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "14"
}
],
"name": "Combining heterogeneous data sources for accurate functional annotation of proteins",
"pagination": "s10",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"634668d20faa1fac2f28657eaf9133af28ea3f242d72b6d0e4bba1601172a681"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"23514123"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"100965194"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-14-s3-s10"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1020835861"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-14-s3-s10",
"https://app.dimensions.ai/details/publication/pub.1020835861"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-11T08:57",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000325_0000000325/records_100809_00000000.jsonl",
"type": "ScholarlyArticle",
"url": "http://link.springer.com/10.1186/1471-2105-14-S3-S10"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-s3-s10'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-s3-s10'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-s3-s10'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-s3-s10'
This table displays all metadata directly associated to this object as RDF triples.
284 TRIPLES
21 PREDICATES
78 URIs
31 LITERALS
19 BLANK NODES