2015
AUTHORSMamadou S. Camara , Djasrabe Naguingar , Alassane Bah
ABSTRACTData Mining (DM) projects are implemented by following the knowledge discovery process. Several techniques for detecting and handling data quality problems such as missing data, outliers, inconsistent data or time-variant data, can be found in the literature of DM and Data Warehousing (DW). Tasks that are related to the quality of data are mostly in the Data Understanding and in the Data Preparation phases of the DM process. The main limitation in the application of the data quality management techniques is the complexity caused by a lack of anticipation in the detection and resolution of the problems. A DM process model designed for the prior management of data quality is proposed in this work. In this model, the DM process is defined in relation to the Software Engineering (SE) process; the two processes are combined in parallel. The main contribution of this DM process is the anticipation and the automation of all activities necessary to remove data quality problems. More... »
PAGES299-307
New Trends in Networking, Computing, E-learning, Systems Sciences, and Engineering
ISBN
978-3-319-06763-6
978-3-319-06764-3
http://scigraph.springernature.com/pub.10.1007/978-3-319-06764-3_37
DOIhttp://dx.doi.org/10.1007/978-3-319-06764-3_37
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1048771278
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information Systems",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Cheikh Anta Diop University",
"id": "https://www.grid.ac/institutes/grid.8191.1",
"name": [
"Laboratoire d\u2019Informatique, R\u00e9seaux et T\u00e9l\u00e9coms (LIRT), Ecole Sup\u00e9rieure Polytechnique, Universit\u00e9 Cheikh Anta Diop de Dakar, BP 5085\u00a0dakar-fann, Dakar, Senegal"
],
"type": "Organization"
},
"familyName": "Camara",
"givenName": "Mamadou S.",
"id": "sg:person.07364533341.41",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07364533341.41"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Cheikh Anta Diop University",
"id": "https://www.grid.ac/institutes/grid.8191.1",
"name": [
"Laboratoire d\u2019Imagerie M\u00e9dicale et de BioInformatique (LIMBI), Ecole Sup\u00e9rieure Polytechnique, Universit\u00e9 Cheikh Anta Diop de Dakar, BP 5085\u00a0dakar-fann, Dakar, Senegal"
],
"type": "Organization"
},
"familyName": "Naguingar",
"givenName": "Djasrabe",
"type": "Person"
},
{
"affiliation": {
"alternateName": "Cheikh Anta Diop University",
"id": "https://www.grid.ac/institutes/grid.8191.1",
"name": [
"UMI 209, UMMISCO - UCAD, Ecole Sup\u00e9rieure Polytechnique, Universit\u00e9 Cheikh Anta Diop de Dakar, BP 15915\u00a0Dakar-Fann, Senegal"
],
"type": "Organization"
},
"familyName": "Bah",
"givenName": "Alassane",
"id": "sg:person.0720633131.58",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0720633131.58"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1016/j.datak.2007.06.020",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1001547036"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.is.2008.04.003",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1003686169"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.apm.2012.11.015",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1003872549"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/303976.303983",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009681559"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.eswa.2012.02.044",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019974423"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.datak.2012.04.002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1020495800"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/331499.331504",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026347712"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.jom.2005.03.001",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1027127788"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1145/1147376.1147391",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1027278261"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/bf02925480",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1028202005",
"https://doi.org/10.1007/bf02925480"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.datak.2009.08.008",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1032807948"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.patrec.2011.07.002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035943831"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.artint.2008.07.004",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1046048643"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-94-015-3994-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1053674345",
"https://doi.org/10.1007/978-94-015-3994-4"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-94-015-3994-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1053674345",
"https://doi.org/10.1007/978-94-015-3994-4"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1063/1.2995737",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1057891654"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.4018/978-1-59904-387-6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1096031270"
],
"type": "CreativeWork"
}
],
"datePublished": "2015",
"datePublishedReg": "2015-01-01",
"description": "Data Mining (DM) projects are implemented by following the knowledge discovery process. Several techniques for detecting and handling data quality problems such as missing data, outliers, inconsistent data or time-variant data, can be found in the literature of DM and Data Warehousing (DW). Tasks that are related to the quality of data are mostly in the Data Understanding and in the Data Preparation phases of the DM process. The main limitation in the application of the data quality management techniques is the complexity caused by a lack of anticipation in the detection and resolution of the problems. A DM process model designed for the prior management of data quality is proposed in this work. In this model, the DM process is defined in relation to the Software Engineering (SE) process; the two processes are combined in parallel. The main contribution of this DM process is the anticipation and the automation of all activities necessary to remove data quality problems.",
"editor": [
{
"familyName": "Elleithy",
"givenName": "Khaled",
"type": "Person"
},
{
"familyName": "Sobh",
"givenName": "Tarek",
"type": "Person"
}
],
"genre": "chapter",
"id": "sg:pub.10.1007/978-3-319-06764-3_37",
"inLanguage": [
"en"
],
"isAccessibleForFree": false,
"isPartOf": {
"isbn": [
"978-3-319-06763-6",
"978-3-319-06764-3"
],
"name": "New Trends in Networking, Computing, E-learning, Systems Sciences, and Engineering",
"type": "Book"
},
"name": "Prior Data Quality Management in Data Mining Process",
"pagination": "299-307",
"productId": [
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1007/978-3-319-06764-3_37"
]
},
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"e8564aa86890eb48b4b90cd8e9000ff4870f0c4f79bf22bc6bef9fbb7cb49189"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1048771278"
]
}
],
"publisher": {
"location": "Cham",
"name": "Springer International Publishing",
"type": "Organisation"
},
"sameAs": [
"https://doi.org/10.1007/978-3-319-06764-3_37",
"https://app.dimensions.ai/details/publication/pub.1048771278"
],
"sdDataset": "chapters",
"sdDatePublished": "2019-04-15T20:08",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8687_00000273.jsonl",
"type": "Chapter",
"url": "http://link.springer.com/10.1007/978-3-319-06764-3_37"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-06764-3_37'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-06764-3_37'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-06764-3_37'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-06764-3_37'
This table displays all metadata directly associated to this object as RDF triples.
135 TRIPLES
23 PREDICATES
43 URIs
20 LITERALS
8 BLANK NODES