Ontology type: schema:Chapter
2004
AUTHORSJavier Fernández , Elena Montañés , Irene Díaz , José Ranilla , Elías F. Combarro
ABSTRACTTerm selection is one of the main tasks in Information Retrieval and Text Categorization. It has been traditionally carried out by statistical methods based on the frequency of appearance of the words in the documents. In this paper it is presented a method for extracting relevant words of a document by taking into account their linguistic information. These relevant words are obtained by a Machine Learning algorithm which takes manually selected words as training set. With the lexica obtained by this technique Text Categorization is performed by using Support Vector Machines. The results are compared with one of the most used method for term selection (based just on statistical information) and it is found the new method performs better and has the additional advantage of automatically selecting the filtering level. More... »
PAGES253-262
Database and Expert Systems Applications
ISBN
978-3-540-22936-0
978-3-540-30075-5
http://scigraph.springernature.com/pub.10.1007/978-3-540-30075-5_25
DOIhttp://dx.doi.org/10.1007/978-3-540-30075-5_25
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1030815242
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Artificial Intelligence and Image Processing",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "University of Oviedo",
"id": "https://www.grid.ac/institutes/grid.10863.3c",
"name": [
"Artificial Intelligence Center, University of Oviedo, Spain"
],
"type": "Organization"
},
"familyName": "Fern\u00e1ndez",
"givenName": "Javier",
"id": "sg:person.011220142546.36",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011220142546.36"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Oviedo",
"id": "https://www.grid.ac/institutes/grid.10863.3c",
"name": [
"Artificial Intelligence Center, University of Oviedo, Spain"
],
"type": "Organization"
},
"familyName": "Monta\u00f1\u00e9s",
"givenName": "Elena",
"id": "sg:person.011600442422.98",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011600442422.98"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Oviedo",
"id": "https://www.grid.ac/institutes/grid.10863.3c",
"name": [
"Artificial Intelligence Center, University of Oviedo, Spain"
],
"type": "Organization"
},
"familyName": "D\u00edaz",
"givenName": "Irene",
"id": "sg:person.010242453671.42",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010242453671.42"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Oviedo",
"id": "https://www.grid.ac/institutes/grid.10863.3c",
"name": [
"Artificial Intelligence Center, University of Oviedo, Spain"
],
"type": "Organization"
},
"familyName": "Ranilla",
"givenName": "Jos\u00e9",
"id": "sg:person.011017130042.09",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011017130042.09"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Oviedo",
"id": "https://www.grid.ac/institutes/grid.10863.3c",
"name": [
"Artificial Intelligence Center, University of Oviedo, Spain"
],
"type": "Organization"
},
"familyName": "Combarro",
"givenName": "El\u00edas F.",
"id": "sg:person.014120426453.50",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014120426453.50"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1016/b978-0-08-050058-4.50007-3",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1001305396"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-3-540-24687-9_96",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005975224",
"https://doi.org/10.1007/978-3-540-24687-9_96"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-3-540-24687-9_96",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1005975224",
"https://doi.org/10.1007/978-3-540-24687-9_96"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1006/ijhc.2002.1002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007076379"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/0306-4573(81)90029-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009977786"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/0306-4573(81)90029-7",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009977786"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/3-540-45486-1_4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015602649",
"https://doi.org/10.1007/3-540-45486-1_4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/3-540-45486-1_4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015602649",
"https://doi.org/10.1007/3-540-45486-1_4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1023/a:1009976227802",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049582902",
"https://doi.org/10.1023/a:1009976227802"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/bfb0026683",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1051853845",
"https://doi.org/10.1007/bfb0026683"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.21236/ada273556",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1091558316"
],
"type": "CreativeWork"
}
],
"datePublished": "2004",
"datePublishedReg": "2004-01-01",
"description": "Term selection is one of the main tasks in Information Retrieval and Text Categorization. It has been traditionally carried out by statistical methods based on the frequency of appearance of the words in the documents. In this paper it is presented a method for extracting relevant words of a document by taking into account their linguistic information. These relevant words are obtained by a Machine Learning algorithm which takes manually selected words as training set. With the lexica obtained by this technique Text Categorization is performed by using Support Vector Machines. The results are compared with one of the most used method for term selection (based just on statistical information) and it is found the new method performs better and has the additional advantage of automatically selecting the filtering level.",
"editor": [
{
"familyName": "Galindo",
"givenName": "Fernando",
"type": "Person"
},
{
"familyName": "Takizawa",
"givenName": "Makoto",
"type": "Person"
},
{
"familyName": "Traunm\u00fcller",
"givenName": "Roland",
"type": "Person"
}
],
"genre": "chapter",
"id": "sg:pub.10.1007/978-3-540-30075-5_25",
"inLanguage": [
"en"
],
"isAccessibleForFree": false,
"isPartOf": {
"isbn": [
"978-3-540-22936-0",
"978-3-540-30075-5"
],
"name": "Database and Expert Systems Applications",
"type": "Book"
},
"name": "Text Categorization by a Machine-Learning-Based Term Selection",
"pagination": "253-262",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1030815242"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1007/978-3-540-30075-5_25"
]
},
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"92ccec62292efc860f1a2a61c944e969902100218c861aaf29a25681770af094"
]
}
],
"publisher": {
"location": "Berlin, Heidelberg",
"name": "Springer Berlin Heidelberg",
"type": "Organisation"
},
"sameAs": [
"https://doi.org/10.1007/978-3-540-30075-5_25",
"https://app.dimensions.ai/details/publication/pub.1030815242"
],
"sdDataset": "chapters",
"sdDatePublished": "2019-04-16T08:39",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000365_0000000365/records_71712_00000001.jsonl",
"type": "Chapter",
"url": "https://link.springer.com/10.1007%2F978-3-540-30075-5_25"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-30075-5_25'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-30075-5_25'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-30075-5_25'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-30075-5_25'
This table displays all metadata directly associated to this object as RDF triples.
131 TRIPLES
23 PREDICATES
35 URIs
20 LITERALS
8 BLANK NODES