Data Mining of Association Rules and the Process of Knowledge Discovery in Databases View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2002-08-21

AUTHORS

Jochen Hipp , Ulrich Güntzer , Gholamreza Nakhaeizadeh

ABSTRACT

In this paper we deal with association rule mining in the context of a complex, interactive and iterative knowledge discovery process. After a general introduction covering the basics of association rule mining and of the knowledge discovery process in databases we draw the attention to the problematic aspects concerning the integration of both. Actually, we come to the conclusion that with regard to human involvement and interactivity the current situation is far from being satisfying. In our paper we tackle this problem on three sides: First of all there is the algorithmic complexity. Although today’s algorithms efficiently prune the immense search space the achieved run times do not allow true interactivity. Nevertheless we present a rule caching schema that significantly reduces the number of mining runs. This schema helps to gain interactivity even in the presence of extreme run times of the mining algorithms. Second, today the mining data is typically stored in a relational database management system. We present an efficient integration with modern database systems which is one of the key factors in practical mining applications. Third, interesting rules must be picked from the set of generated rules. This might be quite costly because the generated rule sets normally are quite large whereas the percentage of useful rules is typically only a very small fraction. We enhance the traditional association rule mining framework in order to cope with this situation. More... »

PAGES

15-36

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/3-540-46131-0_2

DOI

http://dx.doi.org/10.1007/3-540-46131-0_2

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1011455514


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information Systems", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "DaimlerChrysler AG, Research & Technology, 89081, Ulm, Germany", 
          "id": "http://www.grid.ac/institutes/grid.5433.1", 
          "name": [
            "DaimlerChrysler AG, Research & Technology, 89081, Ulm, Germany"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Hipp", 
        "givenName": "Jochen", 
        "id": "sg:person.014732763563.11", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014732763563.11"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Wilhelm Schickard-Institute, University of T\u00fcbingen, 72076, T\u00fcbingen, Germany", 
          "id": "http://www.grid.ac/institutes/grid.10392.39", 
          "name": [
            "Wilhelm Schickard-Institute, University of T\u00fcbingen, 72076, T\u00fcbingen, Germany"
          ], 
          "type": "Organization"
        }, 
        "familyName": "G\u00fcntzer", 
        "givenName": "Ulrich", 
        "id": "sg:person.013324511711.75", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013324511711.75"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "DaimlerChrysler AG, Research & Technology, 89081, Ulm, Germany", 
          "id": "http://www.grid.ac/institutes/grid.5433.1", 
          "name": [
            "DaimlerChrysler AG, Research & Technology, 89081, Ulm, Germany"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Nakhaeizadeh", 
        "givenName": "Gholamreza", 
        "id": "sg:person.07467643565.95", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07467643565.95"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2002-08-21", 
    "datePublishedReg": "2002-08-21", 
    "description": "In this paper we deal with association rule mining in the context of a complex, interactive and iterative knowledge discovery process. After a general introduction covering the basics of association rule mining and of the knowledge discovery process in databases we draw the attention to the problematic aspects concerning the integration of both. Actually, we come to the conclusion that with regard to human involvement and interactivity the current situation is far from being satisfying. In our paper we tackle this problem on three sides: First of all there is the algorithmic complexity. Although today\u2019s algorithms efficiently prune the immense search space the achieved run times do not allow true interactivity. Nevertheless we present a rule caching schema that significantly reduces the number of mining runs. This schema helps to gain interactivity even in the presence of extreme run times of the mining algorithms. Second, today the mining data is typically stored in a relational database management system. We present an efficient integration with modern database systems which is one of the key factors in practical mining applications. Third, interesting rules must be picked from the set of generated rules. This might be quite costly because the generated rule sets normally are quite large whereas the percentage of useful rules is typically only a very small fraction. We enhance the traditional association rule mining framework in order to cope with this situation.", 
    "editor": [
      {
        "familyName": "Perner", 
        "givenName": "Petra", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/3-540-46131-0_2", 
    "inLanguage": "en", 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-3-540-44116-8", 
        "978-3-540-46131-9"
      ], 
      "name": "Advances in Data Mining", 
      "type": "Book"
    }, 
    "keywords": [
      "knowledge discovery process", 
      "association rule mining", 
      "rule mining", 
      "association rule mining framework", 
      "relational database management system", 
      "immense search space", 
      "modern database systems", 
      "database management system", 
      "run time", 
      "discovery process", 
      "true interactivity", 
      "mining runs", 
      "mining algorithms", 
      "mining framework", 
      "knowledge discovery", 
      "today\u2019s algorithms", 
      "database systems", 
      "mining applications", 
      "data mining", 
      "association rules", 
      "interesting rules", 
      "rule sets", 
      "mining data", 
      "search space", 
      "algorithmic complexity", 
      "human involvement", 
      "management system", 
      "useful rules", 
      "practical mining applications", 
      "efficient integration", 
      "mining", 
      "algorithm", 
      "interactivity", 
      "schema", 
      "rules", 
      "database", 
      "set", 
      "integration", 
      "current situation", 
      "complexity", 
      "system", 
      "framework", 
      "situation", 
      "applications", 
      "general introduction", 
      "basics", 
      "process", 
      "space", 
      "today", 
      "key factor", 
      "time", 
      "context", 
      "discovery", 
      "order", 
      "data", 
      "problematic aspects", 
      "aspects", 
      "number", 
      "small fraction", 
      "run", 
      "attention", 
      "introduction", 
      "regard", 
      "side", 
      "factors", 
      "conclusion", 
      "presence", 
      "percentage", 
      "involvement", 
      "fraction", 
      "paper", 
      "problem", 
      "iterative knowledge discovery process", 
      "extreme run times", 
      "traditional association rule mining framework", 
      "rule mining framework"
    ], 
    "name": "Data Mining of Association Rules and the Process of Knowledge Discovery in Databases", 
    "pagination": "15-36", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1011455514"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/3-540-46131-0_2"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/3-540-46131-0_2", 
      "https://app.dimensions.ai/details/publication/pub.1011455514"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2022-01-01T19:25", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220101/entities/gbq_results/chapter/chapter_431.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/3-540-46131-0_2"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/3-540-46131-0_2'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/3-540-46131-0_2'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/3-540-46131-0_2'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/3-540-46131-0_2'


 

This table displays all metadata directly associated to this object as RDF triples.

157 TRIPLES      23 PREDICATES      102 URIs      94 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/3-540-46131-0_2 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 anzsrc-for:0806
4 schema:author N039dcea33c854206a6ab5d749e731816
5 schema:datePublished 2002-08-21
6 schema:datePublishedReg 2002-08-21
7 schema:description In this paper we deal with association rule mining in the context of a complex, interactive and iterative knowledge discovery process. After a general introduction covering the basics of association rule mining and of the knowledge discovery process in databases we draw the attention to the problematic aspects concerning the integration of both. Actually, we come to the conclusion that with regard to human involvement and interactivity the current situation is far from being satisfying. In our paper we tackle this problem on three sides: First of all there is the algorithmic complexity. Although today’s algorithms efficiently prune the immense search space the achieved run times do not allow true interactivity. Nevertheless we present a rule caching schema that significantly reduces the number of mining runs. This schema helps to gain interactivity even in the presence of extreme run times of the mining algorithms. Second, today the mining data is typically stored in a relational database management system. We present an efficient integration with modern database systems which is one of the key factors in practical mining applications. Third, interesting rules must be picked from the set of generated rules. This might be quite costly because the generated rule sets normally are quite large whereas the percentage of useful rules is typically only a very small fraction. We enhance the traditional association rule mining framework in order to cope with this situation.
8 schema:editor Nc305f7f3d12341c5906c10dad6325cd0
9 schema:genre chapter
10 schema:inLanguage en
11 schema:isAccessibleForFree false
12 schema:isPartOf N29956f6eb7db44cc8cbac48f33f79bc5
13 schema:keywords algorithm
14 algorithmic complexity
15 applications
16 aspects
17 association rule mining
18 association rule mining framework
19 association rules
20 attention
21 basics
22 complexity
23 conclusion
24 context
25 current situation
26 data
27 data mining
28 database
29 database management system
30 database systems
31 discovery
32 discovery process
33 efficient integration
34 extreme run times
35 factors
36 fraction
37 framework
38 general introduction
39 human involvement
40 immense search space
41 integration
42 interactivity
43 interesting rules
44 introduction
45 involvement
46 iterative knowledge discovery process
47 key factor
48 knowledge discovery
49 knowledge discovery process
50 management system
51 mining
52 mining algorithms
53 mining applications
54 mining data
55 mining framework
56 mining runs
57 modern database systems
58 number
59 order
60 paper
61 percentage
62 practical mining applications
63 presence
64 problem
65 problematic aspects
66 process
67 regard
68 relational database management system
69 rule mining
70 rule mining framework
71 rule sets
72 rules
73 run
74 run time
75 schema
76 search space
77 set
78 side
79 situation
80 small fraction
81 space
82 system
83 time
84 today
85 today’s algorithms
86 traditional association rule mining framework
87 true interactivity
88 useful rules
89 schema:name Data Mining of Association Rules and the Process of Knowledge Discovery in Databases
90 schema:pagination 15-36
91 schema:productId N33ec19ecfa934128a3205523e46bde1a
92 N81ea77d503204417aa881c8b260d8147
93 schema:publisher N4225451ab06d4431b30d3eed86df8328
94 schema:sameAs https://app.dimensions.ai/details/publication/pub.1011455514
95 https://doi.org/10.1007/3-540-46131-0_2
96 schema:sdDatePublished 2022-01-01T19:25
97 schema:sdLicense https://scigraph.springernature.com/explorer/license/
98 schema:sdPublisher N83573c4e0bc148688e9d300219788557
99 schema:url https://doi.org/10.1007/3-540-46131-0_2
100 sgo:license sg:explorer/license/
101 sgo:sdDataset chapters
102 rdf:type schema:Chapter
103 N039dcea33c854206a6ab5d749e731816 rdf:first sg:person.014732763563.11
104 rdf:rest N81c6f05ee0ee456b8899e73ecf550aeb
105 N29956f6eb7db44cc8cbac48f33f79bc5 schema:isbn 978-3-540-44116-8
106 978-3-540-46131-9
107 schema:name Advances in Data Mining
108 rdf:type schema:Book
109 N33ec19ecfa934128a3205523e46bde1a schema:name dimensions_id
110 schema:value pub.1011455514
111 rdf:type schema:PropertyValue
112 N4225451ab06d4431b30d3eed86df8328 schema:name Springer Nature
113 rdf:type schema:Organisation
114 N4b11f4053e734fe4846e67fc28e75b56 schema:familyName Perner
115 schema:givenName Petra
116 rdf:type schema:Person
117 N81c6f05ee0ee456b8899e73ecf550aeb rdf:first sg:person.013324511711.75
118 rdf:rest Nd23d4fde05db413b8e7c495e733c9c80
119 N81ea77d503204417aa881c8b260d8147 schema:name doi
120 schema:value 10.1007/3-540-46131-0_2
121 rdf:type schema:PropertyValue
122 N83573c4e0bc148688e9d300219788557 schema:name Springer Nature - SN SciGraph project
123 rdf:type schema:Organization
124 Nc305f7f3d12341c5906c10dad6325cd0 rdf:first N4b11f4053e734fe4846e67fc28e75b56
125 rdf:rest rdf:nil
126 Nd23d4fde05db413b8e7c495e733c9c80 rdf:first sg:person.07467643565.95
127 rdf:rest rdf:nil
128 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
129 schema:name Information and Computing Sciences
130 rdf:type schema:DefinedTerm
131 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
132 schema:name Artificial Intelligence and Image Processing
133 rdf:type schema:DefinedTerm
134 anzsrc-for:0806 schema:inDefinedTermSet anzsrc-for:
135 schema:name Information Systems
136 rdf:type schema:DefinedTerm
137 sg:person.013324511711.75 schema:affiliation grid-institutes:grid.10392.39
138 schema:familyName Güntzer
139 schema:givenName Ulrich
140 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013324511711.75
141 rdf:type schema:Person
142 sg:person.014732763563.11 schema:affiliation grid-institutes:grid.5433.1
143 schema:familyName Hipp
144 schema:givenName Jochen
145 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014732763563.11
146 rdf:type schema:Person
147 sg:person.07467643565.95 schema:affiliation grid-institutes:grid.5433.1
148 schema:familyName Nakhaeizadeh
149 schema:givenName Gholamreza
150 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07467643565.95
151 rdf:type schema:Person
152 grid-institutes:grid.10392.39 schema:alternateName Wilhelm Schickard-Institute, University of Tübingen, 72076, Tübingen, Germany
153 schema:name Wilhelm Schickard-Institute, University of Tübingen, 72076, Tübingen, Germany
154 rdf:type schema:Organization
155 grid-institutes:grid.5433.1 schema:alternateName DaimlerChrysler AG, Research & Technology, 89081, Ulm, Germany
156 schema:name DaimlerChrysler AG, Research & Technology, 89081, Ulm, Germany
157 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...