Words as Rules: Feature Selection in Text Categorization View Full Text


Ontology type: schema:Chapter      Open Access: True


Chapter Info

DATE

2004

AUTHORS

E. Montañés , E. F. Combarro , I. Díaz , J. Ranilla , J. R. Quevedo

ABSTRACT

In Text Categorization problems usually there is a lot of noisy and irrelevant information present. In this paper we propose to apply some measures taken from the Machine Learning environment for Feature Selection. The classifier used is Support Vector Machines. The experiments over two different corpora show that some of the new measures perform better than the traditional Information Theory measures. More... »

PAGES

666-669

References to SciGraph publications

Book

TITLE

Computational Science - ICCS 2004

ISBN

978-3-540-22114-2
978-3-540-24685-5

Author Affiliations

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-540-24685-5_115

DOI

http://dx.doi.org/10.1007/978-3-540-24685-5_115

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1007109975


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "University of Oviedo", 
          "id": "https://www.grid.ac/institutes/grid.10863.3c", 
          "name": [
            "Artificial Intelligence Center, University of Oviedo, Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Monta\u00f1\u00e9s", 
        "givenName": "E.", 
        "id": "sg:person.011600442422.98", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011600442422.98"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oviedo", 
          "id": "https://www.grid.ac/institutes/grid.10863.3c", 
          "name": [
            "Artificial Intelligence Center, University of Oviedo, Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Combarro", 
        "givenName": "E. F.", 
        "id": "sg:person.014120426453.50", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014120426453.50"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oviedo", 
          "id": "https://www.grid.ac/institutes/grid.10863.3c", 
          "name": [
            "Artificial Intelligence Center, University of Oviedo, Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "D\u00edaz", 
        "givenName": "I.", 
        "id": "sg:person.010242453671.42", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010242453671.42"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oviedo", 
          "id": "https://www.grid.ac/institutes/grid.10863.3c", 
          "name": [
            "Artificial Intelligence Center, University of Oviedo, Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Ranilla", 
        "givenName": "J.", 
        "id": "sg:person.011017130042.09", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011017130042.09"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oviedo", 
          "id": "https://www.grid.ac/institutes/grid.10863.3c", 
          "name": [
            "Artificial Intelligence Center, University of Oviedo, Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Quevedo", 
        "givenName": "J. R.", 
        "id": "sg:person.01070600721.84", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01070600721.84"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "sg:pub.10.1007/3-540-45268-0_6", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1001734615", 
          "https://doi.org/10.1007/3-540-45268-0_6"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/3-540-45268-0_6", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1001734615", 
          "https://doi.org/10.1007/3-540-45268-0_6"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/bf03037227", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1005084660", 
          "https://doi.org/10.1007/bf03037227"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/bf03037227", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1005084660", 
          "https://doi.org/10.1007/bf03037227"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/183422.183423", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1021178021"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/505282.505283", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023316280"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/3-540-44869-1_94", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1026387934", 
          "https://doi.org/10.1007/3-540-44869-1_94"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/3-540-44869-1_94", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1026387934", 
          "https://doi.org/10.1007/3-540-44869-1_94"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1002/asi.10409", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1048193736"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/bfb0026683", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1051853845", 
          "https://doi.org/10.1007/bfb0026683"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2004", 
    "datePublishedReg": "2004-01-01", 
    "description": "In Text Categorization problems usually there is a lot of noisy and irrelevant information present. In this paper we propose to apply some measures taken from the Machine Learning environment for Feature Selection. The classifier used is Support Vector Machines. The experiments over two different corpora show that some of the new measures perform better than the traditional Information Theory measures.", 
    "editor": [
      {
        "familyName": "Bubak", 
        "givenName": "Marian", 
        "type": "Person"
      }, 
      {
        "familyName": "van Albada", 
        "givenName": "Geert Dick", 
        "type": "Person"
      }, 
      {
        "familyName": "Sloot", 
        "givenName": "Peter M. A.", 
        "type": "Person"
      }, 
      {
        "familyName": "Dongarra", 
        "givenName": "Jack", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-540-24685-5_115", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": true, 
    "isPartOf": {
      "isbn": [
        "978-3-540-22114-2", 
        "978-3-540-24685-5"
      ], 
      "name": "Computational Science - ICCS 2004", 
      "type": "Book"
    }, 
    "name": "Words as Rules: Feature Selection in Text Categorization", 
    "pagination": "666-669", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1007109975"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-540-24685-5_115"
        ]
      }, 
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "21de762d36e68eb77d164e766171a62627c441e68d6f7cc903360ecc7b00287c"
        ]
      }
    ], 
    "publisher": {
      "location": "Berlin, Heidelberg", 
      "name": "Springer Berlin Heidelberg", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-540-24685-5_115", 
      "https://app.dimensions.ai/details/publication/pub.1007109975"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2019-04-16T08:26", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000363_0000000363/records_70058_00000000.jsonl", 
    "type": "Chapter", 
    "url": "https://link.springer.com/10.1007%2F978-3-540-24685-5_115"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-24685-5_115'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-24685-5_115'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-24685-5_115'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-24685-5_115'


 

This table displays all metadata directly associated to this object as RDF triples.

133 TRIPLES      23 PREDICATES      34 URIs      20 LITERALS      8 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-540-24685-5_115 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N74c7226c64884edb807978a925799723
4 schema:citation sg:pub.10.1007/3-540-44869-1_94
5 sg:pub.10.1007/3-540-45268-0_6
6 sg:pub.10.1007/bf03037227
7 sg:pub.10.1007/bfb0026683
8 https://doi.org/10.1002/asi.10409
9 https://doi.org/10.1145/183422.183423
10 https://doi.org/10.1145/505282.505283
11 schema:datePublished 2004
12 schema:datePublishedReg 2004-01-01
13 schema:description In Text Categorization problems usually there is a lot of noisy and irrelevant information present. In this paper we propose to apply some measures taken from the Machine Learning environment for Feature Selection. The classifier used is Support Vector Machines. The experiments over two different corpora show that some of the new measures perform better than the traditional Information Theory measures.
14 schema:editor Nc41582ba1c224f37a818a5438656f76e
15 schema:genre chapter
16 schema:inLanguage en
17 schema:isAccessibleForFree true
18 schema:isPartOf N9dc92e8e316d47d4ac9cc2bf48d6980a
19 schema:name Words as Rules: Feature Selection in Text Categorization
20 schema:pagination 666-669
21 schema:productId N44233e9db5344ed4bfd59380822cd856
22 Nb23bad79c32648f8a9d78743dd4cf8e9
23 Ncd1a5b3107b84c34bc95fe83536fb4cf
24 schema:publisher N725e78aa9a4e4e56b7f089e03fea901d
25 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007109975
26 https://doi.org/10.1007/978-3-540-24685-5_115
27 schema:sdDatePublished 2019-04-16T08:26
28 schema:sdLicense https://scigraph.springernature.com/explorer/license/
29 schema:sdPublisher N6d971856b3a4485e8c1d2d0cf13be489
30 schema:url https://link.springer.com/10.1007%2F978-3-540-24685-5_115
31 sgo:license sg:explorer/license/
32 sgo:sdDataset chapters
33 rdf:type schema:Chapter
34 N0aa1aa68edb645aab33b0cc4168f21de schema:familyName Bubak
35 schema:givenName Marian
36 rdf:type schema:Person
37 N0f2a4e5a085c4ac2b72b2f9515339388 rdf:first sg:person.01070600721.84
38 rdf:rest rdf:nil
39 N22d5141577164e359cbd2872627fed22 rdf:first sg:person.011017130042.09
40 rdf:rest N0f2a4e5a085c4ac2b72b2f9515339388
41 N294cbd2b0b3c44c29c83e73b14f9aa8f schema:familyName van Albada
42 schema:givenName Geert Dick
43 rdf:type schema:Person
44 N44233e9db5344ed4bfd59380822cd856 schema:name readcube_id
45 schema:value 21de762d36e68eb77d164e766171a62627c441e68d6f7cc903360ecc7b00287c
46 rdf:type schema:PropertyValue
47 N46d13abd4b724278a84b0d41b769287f schema:familyName Dongarra
48 schema:givenName Jack
49 rdf:type schema:Person
50 N6d971856b3a4485e8c1d2d0cf13be489 schema:name Springer Nature - SN SciGraph project
51 rdf:type schema:Organization
52 N725e78aa9a4e4e56b7f089e03fea901d schema:location Berlin, Heidelberg
53 schema:name Springer Berlin Heidelberg
54 rdf:type schema:Organisation
55 N74c7226c64884edb807978a925799723 rdf:first sg:person.011600442422.98
56 rdf:rest N887e2e4077ef42fea2d7551e46c357a9
57 N887e2e4077ef42fea2d7551e46c357a9 rdf:first sg:person.014120426453.50
58 rdf:rest Nd723af2c9b7647c18ea6fe248f15be72
59 N9dc92e8e316d47d4ac9cc2bf48d6980a schema:isbn 978-3-540-22114-2
60 978-3-540-24685-5
61 schema:name Computational Science - ICCS 2004
62 rdf:type schema:Book
63 Nb23bad79c32648f8a9d78743dd4cf8e9 schema:name dimensions_id
64 schema:value pub.1007109975
65 rdf:type schema:PropertyValue
66 Nbcbc1ea083ee452db7a88d64cd3940bb schema:familyName Sloot
67 schema:givenName Peter M. A.
68 rdf:type schema:Person
69 Nc113112a2d374ac8961bf34b513966d7 rdf:first Nbcbc1ea083ee452db7a88d64cd3940bb
70 rdf:rest Ne03228283ed045e0a27ac02e89551701
71 Nc41582ba1c224f37a818a5438656f76e rdf:first N0aa1aa68edb645aab33b0cc4168f21de
72 rdf:rest Ncecd8c983bd54786a526f44fbfee0a8f
73 Ncd1a5b3107b84c34bc95fe83536fb4cf schema:name doi
74 schema:value 10.1007/978-3-540-24685-5_115
75 rdf:type schema:PropertyValue
76 Ncecd8c983bd54786a526f44fbfee0a8f rdf:first N294cbd2b0b3c44c29c83e73b14f9aa8f
77 rdf:rest Nc113112a2d374ac8961bf34b513966d7
78 Nd723af2c9b7647c18ea6fe248f15be72 rdf:first sg:person.010242453671.42
79 rdf:rest N22d5141577164e359cbd2872627fed22
80 Ne03228283ed045e0a27ac02e89551701 rdf:first N46d13abd4b724278a84b0d41b769287f
81 rdf:rest rdf:nil
82 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
83 schema:name Information and Computing Sciences
84 rdf:type schema:DefinedTerm
85 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
86 schema:name Artificial Intelligence and Image Processing
87 rdf:type schema:DefinedTerm
88 sg:person.010242453671.42 schema:affiliation https://www.grid.ac/institutes/grid.10863.3c
89 schema:familyName Díaz
90 schema:givenName I.
91 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010242453671.42
92 rdf:type schema:Person
93 sg:person.01070600721.84 schema:affiliation https://www.grid.ac/institutes/grid.10863.3c
94 schema:familyName Quevedo
95 schema:givenName J. R.
96 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01070600721.84
97 rdf:type schema:Person
98 sg:person.011017130042.09 schema:affiliation https://www.grid.ac/institutes/grid.10863.3c
99 schema:familyName Ranilla
100 schema:givenName J.
101 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011017130042.09
102 rdf:type schema:Person
103 sg:person.011600442422.98 schema:affiliation https://www.grid.ac/institutes/grid.10863.3c
104 schema:familyName Montañés
105 schema:givenName E.
106 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011600442422.98
107 rdf:type schema:Person
108 sg:person.014120426453.50 schema:affiliation https://www.grid.ac/institutes/grid.10863.3c
109 schema:familyName Combarro
110 schema:givenName E. F.
111 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014120426453.50
112 rdf:type schema:Person
113 sg:pub.10.1007/3-540-44869-1_94 schema:sameAs https://app.dimensions.ai/details/publication/pub.1026387934
114 https://doi.org/10.1007/3-540-44869-1_94
115 rdf:type schema:CreativeWork
116 sg:pub.10.1007/3-540-45268-0_6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1001734615
117 https://doi.org/10.1007/3-540-45268-0_6
118 rdf:type schema:CreativeWork
119 sg:pub.10.1007/bf03037227 schema:sameAs https://app.dimensions.ai/details/publication/pub.1005084660
120 https://doi.org/10.1007/bf03037227
121 rdf:type schema:CreativeWork
122 sg:pub.10.1007/bfb0026683 schema:sameAs https://app.dimensions.ai/details/publication/pub.1051853845
123 https://doi.org/10.1007/bfb0026683
124 rdf:type schema:CreativeWork
125 https://doi.org/10.1002/asi.10409 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048193736
126 rdf:type schema:CreativeWork
127 https://doi.org/10.1145/183422.183423 schema:sameAs https://app.dimensions.ai/details/publication/pub.1021178021
128 rdf:type schema:CreativeWork
129 https://doi.org/10.1145/505282.505283 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023316280
130 rdf:type schema:CreativeWork
131 https://www.grid.ac/institutes/grid.10863.3c schema:alternateName University of Oviedo
132 schema:name Artificial Intelligence Center, University of Oviedo, Spain
133 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...