Ontology type: schema:ScholarlyArticle Open Access: True
2011-12
AUTHORSMin Song, Hwanjo Yu, Wook-Shin Han
ABSTRACTBACKGROUND: Protein-protein interaction (PPI) extraction has been a focal point of many biomedical research and database curation tools. Both Active Learning and Semi-supervised SVMs have recently been applied to extract PPI automatically. In this paper, we explore combining the AL with the SSL to improve the performance of the PPI task. METHODS: We propose a novel PPI extraction technique called PPISpotter by combining Deterministic Annealing-based SSL and an AL technique to extract protein-protein interaction. In addition, we extract a comprehensive set of features from MEDLINE records by Natural Language Processing (NLP) techniques, which further improve the SVM classifiers. In our feature selection technique, syntactic, semantic, and lexical properties of text are incorporated into feature selection that boosts the system performance significantly. RESULTS: By conducting experiments with three different PPI corpuses, we show that PPISpotter is superior to the other techniques incorporated into semi-supervised SVMs such as Random Sampling, Clustering, and Transductive SVMs by precision, recall, and F-measure. CONCLUSIONS: Our system is a novel, state-of-the-art technique for efficiently extracting protein-protein interaction pairs. More... »
PAGESs4
http://scigraph.springernature.com/pub.10.1186/1471-2105-12-s12-s4
DOIhttp://dx.doi.org/10.1186/1471-2105-12-s12-s4
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1033337271
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/22168401
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Artificial Intelligence and Image Processing",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Artificial Intelligence",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Humans",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "MEDLINE",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Natural Language Processing",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Protein Interaction Maps",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Proteins",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Semantics",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "United States",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "New Jersey Institute of Technology",
"id": "https://www.grid.ac/institutes/grid.260896.3",
"name": [
"Information Systems Department, New Jersey Institute of Technology, University Heights, Newark, New Jersey, USA"
],
"type": "Organization"
},
"familyName": "Song",
"givenName": "Min",
"id": "sg:person.01316361675.44",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01316361675.44"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Pohang University of Science and Technology",
"id": "https://www.grid.ac/institutes/grid.49100.3c",
"name": [
"Department of Computer Science & Engineering, POSTECH, Pohang, South Korea"
],
"type": "Organization"
},
"familyName": "Yu",
"givenName": "Hwanjo",
"id": "sg:person.01367471656.65",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01367471656.65"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Kyungpook National University",
"id": "https://www.grid.ac/institutes/grid.258803.4",
"name": [
"School of IT Engineering, Kyungpook National University, Daegu, South Korea"
],
"type": "Organization"
},
"familyName": "Han",
"givenName": "Wook-Shin",
"id": "sg:person.0576255616.66",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0576255616.66"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1016/j.artmed.2004.07.016",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1001290304"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/11428848_124",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002675923",
"https://doi.org/10.1007/11428848_124"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/11428848_124",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002675923",
"https://doi.org/10.1007/11428848_124"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng0501-21",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005323022",
"https://doi.org/10.1038/ng0501-21"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ng0501-21",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005323022",
"https://doi.org/10.1038/ng0501-21"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.jbi.2009.08.013",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009364731"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/gkn281",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1014757195"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.artmed.2007.07.004",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019562027"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1080/10556780108805809",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019913819"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.eswa.2009.01.043",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1020773899"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bth451",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1025108394"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/17.2.155",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1031746444"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1586/ehm.11.47",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033360956"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1006/jbin.2001.1029",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035901831"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/17.suppl_1.s74",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037764340"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2008-9-s2-s2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1041411233",
"https://doi.org/10.1186/gb-2008-9-s2-s2"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1023/a:1009715923555",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1042048349",
"https://doi.org/10.1023/a:1009715923555"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btg279",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1043052914"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/1143844.1143950",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1043465848"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btn631",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047133108"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.specom.2004.08.002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049252648"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-50",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1050932195",
"https://doi.org/10.1186/1471-2105-8-50"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/ietisy/e89-d.8.2464",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1059672048"
],
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1074631088",
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1075024907",
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.21236/ada256365",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1091603112"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.3115/1567594.1567598",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1099140167"
],
"type": "CreativeWork"
}
],
"datePublished": "2011-12",
"datePublishedReg": "2011-12-01",
"description": "BACKGROUND: Protein-protein interaction (PPI) extraction has been a focal point of many biomedical research and database curation tools. Both Active Learning and Semi-supervised SVMs have recently been applied to extract PPI automatically. In this paper, we explore combining the AL with the SSL to improve the performance of the PPI task.\nMETHODS: We propose a novel PPI extraction technique called PPISpotter by combining Deterministic Annealing-based SSL and an AL technique to extract protein-protein interaction. In addition, we extract a comprehensive set of features from MEDLINE records by Natural Language Processing (NLP) techniques, which further improve the SVM classifiers. In our feature selection technique, syntactic, semantic, and lexical properties of text are incorporated into feature selection that boosts the system performance significantly.\nRESULTS: By conducting experiments with three different PPI corpuses, we show that PPISpotter is superior to the other techniques incorporated into semi-supervised SVMs such as Random Sampling, Clustering, and Transductive SVMs by precision, recall, and F-measure.\nCONCLUSIONS: Our system is a novel, state-of-the-art technique for efficiently extracting protein-protein interaction pairs.",
"genre": "research_article",
"id": "sg:pub.10.1186/1471-2105-12-s12-s4",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.7445710",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.3106693",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"type": "Periodical"
},
{
"issueNumber": "Suppl 12",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "12"
}
],
"name": "Combining active learning and semi-supervised learning techniques to extract protein interaction sentences",
"pagination": "s4",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"9ed02413d24944c469b65d8e64ad4c426735a79443f04934d502ec9d39a8279b"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"22168401"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"100965194"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-12-s12-s4"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1033337271"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-12-s12-s4",
"https://app.dimensions.ai/details/publication/pub.1033337271"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-10T19:03",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8678_00000489.jsonl",
"type": "ScholarlyArticle",
"url": "http://link.springer.com/10.1186/1471-2105-12-S12-S4"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-12-s12-s4'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-12-s12-s4'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-12-s12-s4'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-12-s12-s4'
This table displays all metadata directly associated to this object as RDF triples.
202 TRIPLES
21 PREDICATES
62 URIs
29 LITERALS
17 BLANK NODES