Ontology type: schema:ScholarlyArticle Open Access: True
2006-03
AUTHORSAsa Ben-Hur, William Stafford Noble
ABSTRACTThe protein-protein interaction networks of even well-studied model organisms are sketchy at best, highlighting the continued need for computational methods to help direct experimentalists in the search for novel interactions. This need has prompted the development of a number of methods for predicting protein-protein interactions based on various sources of data and methodologies. The common method for choosing negative examples for training a predictor of protein-protein interactions is based on annotations of cellular localization, and the observation that pairs of proteins that have different localization patterns are unlikely to interact. While this method leads to high quality sets of non-interacting proteins, we find that this choice can lead to biased estimates of prediction accuracy, because the constraints placed on the distribution of the negative examples makes the task easier. The effects of this bias are demonstrated in the context of both sequence-based and non-sequence based features used for predicting protein-protein interactions. More... »
PAGESs2
http://scigraph.springernature.com/pub.10.1186/1471-2105-7-s1-s2
DOIhttp://dx.doi.org/10.1186/1471-2105-7-s1-s2
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1038206691
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/16723005
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0601",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biochemistry and Cell Biology",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Algorithms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Binding Sites",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Computational Biology",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Databases, Protein",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Molecular Conformation",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Oligonucleotide Array Sequence Analysis",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Phosphorylation",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Protein Folding",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Protein Interaction Mapping",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Proteins",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Proteomics",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "ROC Curve",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Sequence Alignment",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Software",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "University of Colorado Boulder",
"id": "https://www.grid.ac/institutes/grid.266190.a",
"name": [
"Department of Computer Science, Colorado State University, Fort Collins, CO, USA",
"Department of Computer Science, University of Colorado, Boulder, CO, USA"
],
"type": "Organization"
},
"familyName": "Ben-Hur",
"givenName": "Asa",
"id": "sg:person.01242755504.30",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01242755504.30"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Washington",
"id": "https://www.grid.ac/institutes/grid.34477.33",
"name": [
"Department of Genome Sciences, University of Washington, Seattle, WA, USA",
"Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"
],
"type": "Organization"
},
"familyName": "Noble",
"givenName": "William Stafford",
"id": "sg:person.01334532172.13",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01334532172.13"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1074/mcp.m100037-mcp200",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1000521469"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-5-38",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002506753",
"https://doi.org/10.1186/1471-2105-5-38"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkg466",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007007211"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bth483",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007650574"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/29.1.242",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009009357"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1126/science.285.5428.751",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1013411081"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btg1002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1013915575"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-5-154",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1014463760",
"https://doi.org/10.1186/1471-2105-5-154"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/30.1.303",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1014503362"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature750",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1017837373",
"https://doi.org/10.1038/nature750"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature750",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1017837373",
"https://doi.org/10.1038/nature750"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gki060",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1020293550"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1006/jmbi.2001.4920",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1020884192"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1002/prot.10074",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026276784"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0022-2836(03)00239-0",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1030571302"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0022-2836(03)00239-0",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1030571302"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1101/gr.153002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031289896"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/28.1.37",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031366016"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.mib.2004.08.012",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032188145"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1073/pnas.102102699",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1034359388"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/130385.130401",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1036379424"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0022-2836(03)00114-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1036437267"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btg153",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039614576"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/75556",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044135237",
"https://doi.org/10.1038/75556"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/75556",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1044135237",
"https://doi.org/10.1038/75556"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1091/mbc.11.12.4241",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1048274369"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1126/science.1087361",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1048921070"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btg352",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049200636"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bti1016",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1050590164"
],
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1075024926",
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1077020040",
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1142/9789812702456_0050",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1096063152"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1142/9789812799623_0053",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1096080294"
],
"type": "CreativeWork"
}
],
"datePublished": "2006-03",
"datePublishedReg": "2006-03-01",
"description": "The protein-protein interaction networks of even well-studied model organisms are sketchy at best, highlighting the continued need for computational methods to help direct experimentalists in the search for novel interactions. This need has prompted the development of a number of methods for predicting protein-protein interactions based on various sources of data and methodologies. The common method for choosing negative examples for training a predictor of protein-protein interactions is based on annotations of cellular localization, and the observation that pairs of proteins that have different localization patterns are unlikely to interact. While this method leads to high quality sets of non-interacting proteins, we find that this choice can lead to biased estimates of prediction accuracy, because the constraints placed on the distribution of the negative examples makes the task easier. The effects of this bias are demonstrated in the context of both sequence-based and non-sequence based features used for predicting protein-protein interactions.",
"genre": "research_article",
"id": "sg:pub.10.1186/1471-2105-7-s1-s2",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.2439911",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.3034927",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.2632019",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"type": "Periodical"
},
{
"issueNumber": "Suppl 1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "7"
}
],
"name": "Choosing negative examples for the prediction of protein-protein interactions",
"pagination": "s2",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"b60135c4ea0994ca7295ec8e428360620f1738a2eabe26d61baf5d3909f83964"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"16723005"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"100965194"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-7-s1-s2"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1038206691"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-7-s1-s2",
"https://app.dimensions.ai/details/publication/pub.1038206691"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-11T00:14",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8695_00000506.jsonl",
"type": "ScholarlyArticle",
"url": "http://link.springer.com/10.1186%2F1471-2105-7-S1-S2"
}
]
Download the RDF metadata as:Â json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-7-s1-s2'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-7-s1-s2'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-7-s1-s2'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-7-s1-s2'
This table displays all metadata directly associated to this object as RDF triples.
234 TRIPLES
21 PREDICATES
73 URIs
35 LITERALS
23 BLANK NODES