Nested Schema Mappings for Integrating JSON View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2018-09-26

AUTHORS

Rihan Hai , Christoph Quix , David Kensche

ABSTRACT

JSON has become one of the most popular data formats. Yet studies on JSON data integration (DI) are scarce. In this work, we study one of the key DI tasks, nested mapping generation in the context of integrating heterogeneous JSON based data sources. We propose a novel mapping representation, namely bucket forest mappings that models the nested mappings in an efficient and native manner. We show experimentally the practicality of our approach over six real world data sets. Moreover, via intensive experiments over synthetic scenarios we demonstrate that our approach scales well to the increasing metadata complexity of DI scenarios. More... »

PAGES

397-405

References to SciGraph publications

  • 2010-04. Schema mapping and query translation in heterogeneous P2P XML databases in THE VLDB JOURNAL
  • 2005-08. Canonical forms for labelled trees and their applications in frequent subtree mining in KNOWLEDGE AND INFORMATION SYSTEMS
  • 2018. Query Rewriting for Heterogeneous Data Lakes in ADVANCES IN DATABASES AND INFORMATION SYSTEMS
  • Book

    TITLE

    Conceptual Modeling

    ISBN

    978-3-030-00846-8
    978-3-030-00847-5

    Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1007/978-3-030-00847-5_28

    DOI

    http://dx.doi.org/10.1007/978-3-030-00847-5_28

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1107244401


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0909", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Geomatic Engineering", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/09", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Engineering", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "RWTH Aachen University", 
              "id": "https://www.grid.ac/institutes/grid.1957.a", 
              "name": [
                "RWTH Aachen University, Aachen, Germany"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Hai", 
            "givenName": "Rihan", 
            "id": "sg:person.010301723336.84", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010301723336.84"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Fraunhofer Institute for Applied Information Technology", 
              "id": "https://www.grid.ac/institutes/grid.469870.4", 
              "name": [
                "RWTH Aachen University, Aachen, Germany", 
                "Fraunhofer Institute for Applied Information Technology FIT, Sankt Augustin, Germany"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Quix", 
            "givenName": "Christoph", 
            "id": "sg:person.014024640471.57", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014024640471.57"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "name": [
                "SAP Innovation Center Network, Potsdam, Germany"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Kensche", 
            "givenName": "David", 
            "id": "sg:person.014030065313.09", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014030065313.09"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1007/s10115-004-0180-7", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1004647913", 
              "https://doi.org/10.1007/s10115-004-0180-7"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/s00778-009-0159-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007466787", 
              "https://doi.org/10.1007/s00778-009-0159-9"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/s00778-009-0159-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007466787", 
              "https://doi.org/10.1007/s00778-009-0159-9"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/s00778-009-0159-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007466787", 
              "https://doi.org/10.1007/s00778-009-0159-9"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/2882903.2899389", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1007701235"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/1514894.1514903", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1015900557"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/1007568.1007611", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1022610279"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/0022-0000(86)90058-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1038969702"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/j.datak.2009.02.006", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041891708"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.14778/2777598.2777601", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1067368674"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.14778/2850583.2850586", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1067368842"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/978-3-319-98398-1_3", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1105897382", 
              "https://doi.org/10.1007/978-3-319-98398-1_3"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2018-09-26", 
        "datePublishedReg": "2018-09-26", 
        "description": "JSON has become one of the most popular data formats. Yet studies on JSON data integration (DI) are scarce. In this work, we study one of the key DI tasks, nested mapping generation in the context of integrating heterogeneous JSON based data sources. We propose a novel mapping representation, namely bucket forest mappings that models the nested mappings in an efficient and native manner. We show experimentally the practicality of our approach over six real world data sets. Moreover, via intensive experiments over synthetic scenarios we demonstrate that our approach scales well to the increasing metadata complexity of DI scenarios.", 
        "editor": [
          {
            "familyName": "Trujillo", 
            "givenName": "Juan C.", 
            "type": "Person"
          }, 
          {
            "familyName": "Davis", 
            "givenName": "Karen C.", 
            "type": "Person"
          }, 
          {
            "familyName": "Du", 
            "givenName": "Xiaoyong", 
            "type": "Person"
          }, 
          {
            "familyName": "Li", 
            "givenName": "Zhanhuai", 
            "type": "Person"
          }, 
          {
            "familyName": "Ling", 
            "givenName": "Tok Wang", 
            "type": "Person"
          }, 
          {
            "familyName": "Li", 
            "givenName": "Guoliang", 
            "type": "Person"
          }, 
          {
            "familyName": "Lee", 
            "givenName": "Mong Li", 
            "type": "Person"
          }
        ], 
        "genre": "chapter", 
        "id": "sg:pub.10.1007/978-3-030-00847-5_28", 
        "inLanguage": [
          "en"
        ], 
        "isAccessibleForFree": false, 
        "isPartOf": {
          "isbn": [
            "978-3-030-00846-8", 
            "978-3-030-00847-5"
          ], 
          "name": "Conceptual Modeling", 
          "type": "Book"
        }, 
        "name": "Nested Schema Mappings for Integrating JSON", 
        "pagination": "397-405", 
        "productId": [
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1007/978-3-030-00847-5_28"
            ]
          }, 
          {
            "name": "readcube_id", 
            "type": "PropertyValue", 
            "value": [
              "0d1b8f297badac4c2c3b66edabe74138ee363f043d839ca3f48f0bbfd3608adc"
            ]
          }, 
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1107244401"
            ]
          }
        ], 
        "publisher": {
          "location": "Cham", 
          "name": "Springer International Publishing", 
          "type": "Organisation"
        }, 
        "sameAs": [
          "https://doi.org/10.1007/978-3-030-00847-5_28", 
          "https://app.dimensions.ai/details/publication/pub.1107244401"
        ], 
        "sdDataset": "chapters", 
        "sdDatePublished": "2019-04-16T04:39", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000321_0000000321/records_74918_00000000.jsonl", 
        "type": "Chapter", 
        "url": "https://link.springer.com/10.1007%2F978-3-030-00847-5_28"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-030-00847-5_28'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-030-00847-5_28'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-030-00847-5_28'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-030-00847-5_28'


     

    This table displays all metadata directly associated to this object as RDF triples.

    148 TRIPLES      23 PREDICATES      36 URIs      19 LITERALS      8 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1007/978-3-030-00847-5_28 schema:about anzsrc-for:09
    2 anzsrc-for:0909
    3 schema:author N8da27b6668c34fb3bc468d669d7c142c
    4 schema:citation sg:pub.10.1007/978-3-319-98398-1_3
    5 sg:pub.10.1007/s00778-009-0159-9
    6 sg:pub.10.1007/s10115-004-0180-7
    7 https://doi.org/10.1016/0022-0000(86)90058-9
    8 https://doi.org/10.1016/j.datak.2009.02.006
    9 https://doi.org/10.1145/1007568.1007611
    10 https://doi.org/10.1145/1514894.1514903
    11 https://doi.org/10.1145/2882903.2899389
    12 https://doi.org/10.14778/2777598.2777601
    13 https://doi.org/10.14778/2850583.2850586
    14 schema:datePublished 2018-09-26
    15 schema:datePublishedReg 2018-09-26
    16 schema:description JSON has become one of the most popular data formats. Yet studies on JSON data integration (DI) are scarce. In this work, we study one of the key DI tasks, nested mapping generation in the context of integrating heterogeneous JSON based data sources. We propose a novel mapping representation, namely bucket forest mappings that models the nested mappings in an efficient and native manner. We show experimentally the practicality of our approach over six real world data sets. Moreover, via intensive experiments over synthetic scenarios we demonstrate that our approach scales well to the increasing metadata complexity of DI scenarios.
    17 schema:editor Nb74be215e5cf4307be7c1e88034292e5
    18 schema:genre chapter
    19 schema:inLanguage en
    20 schema:isAccessibleForFree false
    21 schema:isPartOf Nbabf40ebbcfc4cf19f1ce98fe46072fd
    22 schema:name Nested Schema Mappings for Integrating JSON
    23 schema:pagination 397-405
    24 schema:productId N7ae9f6c9a30441f1903fa635b0458920
    25 N9be0cd52fcb54c388310c719d01eee49
    26 Nd76feaee542e415c97b9677f70e57e94
    27 schema:publisher N8da4cc0fde43478cacb0f52c926d89f4
    28 schema:sameAs https://app.dimensions.ai/details/publication/pub.1107244401
    29 https://doi.org/10.1007/978-3-030-00847-5_28
    30 schema:sdDatePublished 2019-04-16T04:39
    31 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    32 schema:sdPublisher N78d5a45768b94fb0a18faa64f5d2da1a
    33 schema:url https://link.springer.com/10.1007%2F978-3-030-00847-5_28
    34 sgo:license sg:explorer/license/
    35 sgo:sdDataset chapters
    36 rdf:type schema:Chapter
    37 N163d1199321e4963bad404af55474cf1 rdf:first N9b5a979a48094ca38ef1756b7135941b
    38 rdf:rest N85bfa05803c3450f80f2207f55b70123
    39 N410684fad5b94ad8aff4f292fbecfd13 rdf:first sg:person.014024640471.57
    40 rdf:rest Na53150ddfd204e07a053efd914190f02
    41 N4d8f1ae2719a4ddca419827563990355 schema:name SAP Innovation Center Network, Potsdam, Germany
    42 rdf:type schema:Organization
    43 N529a308bc16e42aa8dbf4d0e32b2bf4b rdf:first Nb46a022bc3434cb68ef51dd27401383f
    44 rdf:rest Neab54a5043bd43ae806d367dbe9fabe4
    45 N5fcf8bb542b948a68b82358a0bae2827 rdf:first Ndd32c33365d94ccfb3acaf8f0fafe5f5
    46 rdf:rest N529a308bc16e42aa8dbf4d0e32b2bf4b
    47 N77d6c3c6ec4b494eb39248c2f194bf16 schema:familyName Li
    48 schema:givenName Zhanhuai
    49 rdf:type schema:Person
    50 N78d5a45768b94fb0a18faa64f5d2da1a schema:name Springer Nature - SN SciGraph project
    51 rdf:type schema:Organization
    52 N7ae9f6c9a30441f1903fa635b0458920 schema:name dimensions_id
    53 schema:value pub.1107244401
    54 rdf:type schema:PropertyValue
    55 N85bfa05803c3450f80f2207f55b70123 rdf:first Nef8778f763a34327b7736bb851219d85
    56 rdf:rest N8995f4f2c0214ccbb4037b500166684f
    57 N8995f4f2c0214ccbb4037b500166684f rdf:first N77d6c3c6ec4b494eb39248c2f194bf16
    58 rdf:rest N5fcf8bb542b948a68b82358a0bae2827
    59 N8da27b6668c34fb3bc468d669d7c142c rdf:first sg:person.010301723336.84
    60 rdf:rest N410684fad5b94ad8aff4f292fbecfd13
    61 N8da4cc0fde43478cacb0f52c926d89f4 schema:location Cham
    62 schema:name Springer International Publishing
    63 rdf:type schema:Organisation
    64 N9b5a979a48094ca38ef1756b7135941b schema:familyName Davis
    65 schema:givenName Karen C.
    66 rdf:type schema:Person
    67 N9be0cd52fcb54c388310c719d01eee49 schema:name doi
    68 schema:value 10.1007/978-3-030-00847-5_28
    69 rdf:type schema:PropertyValue
    70 Na53150ddfd204e07a053efd914190f02 rdf:first sg:person.014030065313.09
    71 rdf:rest rdf:nil
    72 Nb33eb90626c34ba0994b4490691a3ec6 schema:familyName Trujillo
    73 schema:givenName Juan C.
    74 rdf:type schema:Person
    75 Nb46a022bc3434cb68ef51dd27401383f schema:familyName Li
    76 schema:givenName Guoliang
    77 rdf:type schema:Person
    78 Nb74be215e5cf4307be7c1e88034292e5 rdf:first Nb33eb90626c34ba0994b4490691a3ec6
    79 rdf:rest N163d1199321e4963bad404af55474cf1
    80 Nbabf40ebbcfc4cf19f1ce98fe46072fd schema:isbn 978-3-030-00846-8
    81 978-3-030-00847-5
    82 schema:name Conceptual Modeling
    83 rdf:type schema:Book
    84 Nc1314d4d2c7f4e9984adccc3f4b79b7d schema:familyName Lee
    85 schema:givenName Mong Li
    86 rdf:type schema:Person
    87 Nd76feaee542e415c97b9677f70e57e94 schema:name readcube_id
    88 schema:value 0d1b8f297badac4c2c3b66edabe74138ee363f043d839ca3f48f0bbfd3608adc
    89 rdf:type schema:PropertyValue
    90 Ndd32c33365d94ccfb3acaf8f0fafe5f5 schema:familyName Ling
    91 schema:givenName Tok Wang
    92 rdf:type schema:Person
    93 Neab54a5043bd43ae806d367dbe9fabe4 rdf:first Nc1314d4d2c7f4e9984adccc3f4b79b7d
    94 rdf:rest rdf:nil
    95 Nef8778f763a34327b7736bb851219d85 schema:familyName Du
    96 schema:givenName Xiaoyong
    97 rdf:type schema:Person
    98 anzsrc-for:09 schema:inDefinedTermSet anzsrc-for:
    99 schema:name Engineering
    100 rdf:type schema:DefinedTerm
    101 anzsrc-for:0909 schema:inDefinedTermSet anzsrc-for:
    102 schema:name Geomatic Engineering
    103 rdf:type schema:DefinedTerm
    104 sg:person.010301723336.84 schema:affiliation https://www.grid.ac/institutes/grid.1957.a
    105 schema:familyName Hai
    106 schema:givenName Rihan
    107 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010301723336.84
    108 rdf:type schema:Person
    109 sg:person.014024640471.57 schema:affiliation https://www.grid.ac/institutes/grid.469870.4
    110 schema:familyName Quix
    111 schema:givenName Christoph
    112 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014024640471.57
    113 rdf:type schema:Person
    114 sg:person.014030065313.09 schema:affiliation N4d8f1ae2719a4ddca419827563990355
    115 schema:familyName Kensche
    116 schema:givenName David
    117 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014030065313.09
    118 rdf:type schema:Person
    119 sg:pub.10.1007/978-3-319-98398-1_3 schema:sameAs https://app.dimensions.ai/details/publication/pub.1105897382
    120 https://doi.org/10.1007/978-3-319-98398-1_3
    121 rdf:type schema:CreativeWork
    122 sg:pub.10.1007/s00778-009-0159-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007466787
    123 https://doi.org/10.1007/s00778-009-0159-9
    124 rdf:type schema:CreativeWork
    125 sg:pub.10.1007/s10115-004-0180-7 schema:sameAs https://app.dimensions.ai/details/publication/pub.1004647913
    126 https://doi.org/10.1007/s10115-004-0180-7
    127 rdf:type schema:CreativeWork
    128 https://doi.org/10.1016/0022-0000(86)90058-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038969702
    129 rdf:type schema:CreativeWork
    130 https://doi.org/10.1016/j.datak.2009.02.006 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041891708
    131 rdf:type schema:CreativeWork
    132 https://doi.org/10.1145/1007568.1007611 schema:sameAs https://app.dimensions.ai/details/publication/pub.1022610279
    133 rdf:type schema:CreativeWork
    134 https://doi.org/10.1145/1514894.1514903 schema:sameAs https://app.dimensions.ai/details/publication/pub.1015900557
    135 rdf:type schema:CreativeWork
    136 https://doi.org/10.1145/2882903.2899389 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007701235
    137 rdf:type schema:CreativeWork
    138 https://doi.org/10.14778/2777598.2777601 schema:sameAs https://app.dimensions.ai/details/publication/pub.1067368674
    139 rdf:type schema:CreativeWork
    140 https://doi.org/10.14778/2850583.2850586 schema:sameAs https://app.dimensions.ai/details/publication/pub.1067368842
    141 rdf:type schema:CreativeWork
    142 https://www.grid.ac/institutes/grid.1957.a schema:alternateName RWTH Aachen University
    143 schema:name RWTH Aachen University, Aachen, Germany
    144 rdf:type schema:Organization
    145 https://www.grid.ac/institutes/grid.469870.4 schema:alternateName Fraunhofer Institute for Applied Information Technology
    146 schema:name Fraunhofer Institute for Applied Information Technology FIT, Sankt Augustin, Germany
    147 RWTH Aachen University, Aachen, Germany
    148 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...