Distributed RDF Query Answering with Dynamic Data Exchange View Full Text


Ontology type: schema:Chapter      Open Access: True


Chapter Info

DATE

2016-09-23

AUTHORS

Anthony Potter , Boris Motik , Yavor Nenov , Ian Horrocks

ABSTRACT

Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up to an order of magnitude in terms of query evaluation times, network communication, and memory use. More... »

PAGES

480-497

Book

TITLE

The Semantic Web – ISWC 2016

ISBN

978-3-319-46522-7
978-3-319-46523-4

Author Affiliations

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29

DOI

http://dx.doi.org/10.1007/978-3-319-46523-4_29

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1084848381


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information Systems", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Potter", 
        "givenName": "Anthony", 
        "id": "sg:person.014264077713.58", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014264077713.58"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Motik", 
        "givenName": "Boris", 
        "id": "sg:person.07401076267.36", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07401076267.36"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Nenov", 
        "givenName": "Yavor", 
        "id": "sg:person.010530051131.00", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010530051131.00"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Horrocks", 
        "givenName": "Ian", 
        "id": "sg:person.013100561643.19", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013100561643.19"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2016-09-23", 
    "datePublishedReg": "2016-09-23", 
    "description": "Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up\u00a0to an order of magnitude in terms of query evaluation times, network communication, and memory use.", 
    "editor": [
      {
        "familyName": "Groth", 
        "givenName": "Paul", 
        "type": "Person"
      }, 
      {
        "familyName": "Simperl", 
        "givenName": "Elena", 
        "type": "Person"
      }, 
      {
        "familyName": "Gray", 
        "givenName": "Alasdair", 
        "type": "Person"
      }, 
      {
        "familyName": "Sabou", 
        "givenName": "Marta", 
        "type": "Person"
      }, 
      {
        "familyName": "Kr\u00f6tzsch", 
        "givenName": "Markus", 
        "type": "Person"
      }, 
      {
        "familyName": "Lecue", 
        "givenName": "Freddy", 
        "type": "Person"
      }, 
      {
        "familyName": "Fl\u00f6ck", 
        "givenName": "Fabian", 
        "type": "Person"
      }, 
      {
        "familyName": "Gil", 
        "givenName": "Yolanda", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-319-46523-4_29", 
    "inLanguage": "en", 
    "isAccessibleForFree": true, 
    "isPartOf": {
      "isbn": [
        "978-3-319-46522-7", 
        "978-3-319-46523-4"
      ], 
      "name": "The Semantic Web \u2013 ISWC 2016", 
      "type": "Book"
    }, 
    "keywords": [
      "dynamic data exchange", 
      "RDF data", 
      "data locality", 
      "data exchange", 
      "large RDF datasets", 
      "query evaluation time", 
      "query answering algorithm", 
      "amount of computation", 
      "RDF queries", 
      "RDF datasets", 
      "query plans", 
      "server cluster", 
      "network communication", 
      "local computation", 
      "partitioning algorithm", 
      "memory use", 
      "single server", 
      "novel solution", 
      "performance evaluation", 
      "queries", 
      "server", 
      "algorithm", 
      "such approaches", 
      "evaluation time", 
      "computation", 
      "datasets", 
      "graph", 
      "communication", 
      "main components", 
      "exchange operator", 
      "operators", 
      "data", 
      "art", 
      "system", 
      "localities", 
      "technique", 
      "orders of magnitude", 
      "solution", 
      "clusters", 
      "knowledge", 
      "exchange", 
      "order", 
      "plan", 
      "terms", 
      "evaluation", 
      "opportunities", 
      "time", 
      "use", 
      "components", 
      "amount", 
      "state", 
      "variants", 
      "aim", 
      "magnitude", 
      "approach", 
      "data exchange operator", 
      "answering algorithm", 
      "RDFox system"
    ], 
    "name": "Distributed RDF Query Answering with Dynamic Data Exchange", 
    "pagination": "480-497", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1084848381"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-319-46523-4_29"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-319-46523-4_29", 
      "https://app.dimensions.ai/details/publication/pub.1084848381"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2022-01-01T19:22", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220101/entities/gbq_results/chapter/chapter_381.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/978-3-319-46523-4_29"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'


 

This table displays all metadata directly associated to this object as RDF triples.

174 TRIPLES      23 PREDICATES      83 URIs      76 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-319-46523-4_29 schema:about anzsrc-for:08
2 anzsrc-for:0806
3 schema:author N8e5237ded8104831b5bfbd2d4d40ef0a
4 schema:datePublished 2016-09-23
5 schema:datePublishedReg 2016-09-23
6 schema:description Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up to an order of magnitude in terms of query evaluation times, network communication, and memory use.
7 schema:editor N54b536dd3b0943d0bdff6a74429e4a4a
8 schema:genre chapter
9 schema:inLanguage en
10 schema:isAccessibleForFree true
11 schema:isPartOf N28622402288b4b18a60ff86bf17e64bd
12 schema:keywords RDF data
13 RDF datasets
14 RDF queries
15 RDFox system
16 aim
17 algorithm
18 amount
19 amount of computation
20 answering algorithm
21 approach
22 art
23 clusters
24 communication
25 components
26 computation
27 data
28 data exchange
29 data exchange operator
30 data locality
31 datasets
32 dynamic data exchange
33 evaluation
34 evaluation time
35 exchange
36 exchange operator
37 graph
38 knowledge
39 large RDF datasets
40 local computation
41 localities
42 magnitude
43 main components
44 memory use
45 network communication
46 novel solution
47 operators
48 opportunities
49 order
50 orders of magnitude
51 partitioning algorithm
52 performance evaluation
53 plan
54 queries
55 query answering algorithm
56 query evaluation time
57 query plans
58 server
59 server cluster
60 single server
61 solution
62 state
63 such approaches
64 system
65 technique
66 terms
67 time
68 use
69 variants
70 schema:name Distributed RDF Query Answering with Dynamic Data Exchange
71 schema:pagination 480-497
72 schema:productId N79802bf3ea924d10a884d677a15dea13
73 Na0f08fc4bfd940aa93285e2ee4ca926d
74 schema:publisher Na6089e6f41f44db79aa9e41311216b57
75 schema:sameAs https://app.dimensions.ai/details/publication/pub.1084848381
76 https://doi.org/10.1007/978-3-319-46523-4_29
77 schema:sdDatePublished 2022-01-01T19:22
78 schema:sdLicense https://scigraph.springernature.com/explorer/license/
79 schema:sdPublisher N49bb942e720743e1867b0a13cf5ce27c
80 schema:url https://doi.org/10.1007/978-3-319-46523-4_29
81 sgo:license sg:explorer/license/
82 sgo:sdDataset chapters
83 rdf:type schema:Chapter
84 N06be68a27a4e4f54afbc32eda3cdf41a schema:familyName Flöck
85 schema:givenName Fabian
86 rdf:type schema:Person
87 N177971dc8e7e4632a73f3eabba6c8809 schema:familyName Krötzsch
88 schema:givenName Markus
89 rdf:type schema:Person
90 N17a63d69311b4416852a05c222575978 schema:familyName Groth
91 schema:givenName Paul
92 rdf:type schema:Person
93 N21592041ef9c406484a379b2cb95fcd7 rdf:first sg:person.010530051131.00
94 rdf:rest N891580fa9d5747c29d6cae3ed14f5307
95 N225b3e29fd5c4136be805264d671c4f2 rdf:first N7e26ae60367247c1b1ccccab19622e3e
96 rdf:rest N914bf5dfb63b4db69bab539844b856a8
97 N28622402288b4b18a60ff86bf17e64bd schema:isbn 978-3-319-46522-7
98 978-3-319-46523-4
99 schema:name The Semantic Web – ISWC 2016
100 rdf:type schema:Book
101 N34ff0acc8b594447925922b3cc36535f rdf:first N06be68a27a4e4f54afbc32eda3cdf41a
102 rdf:rest N9f597d3f802d4409913d2b2c6c89d57b
103 N42c2c349a3c64679a229c9dd4baa0a38 rdf:first sg:person.07401076267.36
104 rdf:rest N21592041ef9c406484a379b2cb95fcd7
105 N468b290cd28a4e17b6f369bb9de7f76f rdf:first N177971dc8e7e4632a73f3eabba6c8809
106 rdf:rest N6a6e16ad84034a029c621bf25e35f326
107 N49bb942e720743e1867b0a13cf5ce27c schema:name Springer Nature - SN SciGraph project
108 rdf:type schema:Organization
109 N54b536dd3b0943d0bdff6a74429e4a4a rdf:first N17a63d69311b4416852a05c222575978
110 rdf:rest N225b3e29fd5c4136be805264d671c4f2
111 N5a25c66017be40d0933782a1a7fa064c rdf:first Na46a0a6edd064248aec86aafce6febaa
112 rdf:rest N468b290cd28a4e17b6f369bb9de7f76f
113 N6a6e16ad84034a029c621bf25e35f326 rdf:first Nb1fea2d4d721492e8731673f45e092bd
114 rdf:rest N34ff0acc8b594447925922b3cc36535f
115 N78d0b76fd27043eaacee9fd84959dff1 schema:familyName Gray
116 schema:givenName Alasdair
117 rdf:type schema:Person
118 N79802bf3ea924d10a884d677a15dea13 schema:name doi
119 schema:value 10.1007/978-3-319-46523-4_29
120 rdf:type schema:PropertyValue
121 N7e26ae60367247c1b1ccccab19622e3e schema:familyName Simperl
122 schema:givenName Elena
123 rdf:type schema:Person
124 N891580fa9d5747c29d6cae3ed14f5307 rdf:first sg:person.013100561643.19
125 rdf:rest rdf:nil
126 N8e5237ded8104831b5bfbd2d4d40ef0a rdf:first sg:person.014264077713.58
127 rdf:rest N42c2c349a3c64679a229c9dd4baa0a38
128 N914bf5dfb63b4db69bab539844b856a8 rdf:first N78d0b76fd27043eaacee9fd84959dff1
129 rdf:rest N5a25c66017be40d0933782a1a7fa064c
130 N9f597d3f802d4409913d2b2c6c89d57b rdf:first Nd843f0c10e9548d3a1b11e2a2fcb9add
131 rdf:rest rdf:nil
132 Na0f08fc4bfd940aa93285e2ee4ca926d schema:name dimensions_id
133 schema:value pub.1084848381
134 rdf:type schema:PropertyValue
135 Na46a0a6edd064248aec86aafce6febaa schema:familyName Sabou
136 schema:givenName Marta
137 rdf:type schema:Person
138 Na6089e6f41f44db79aa9e41311216b57 schema:name Springer Nature
139 rdf:type schema:Organisation
140 Nb1fea2d4d721492e8731673f45e092bd schema:familyName Lecue
141 schema:givenName Freddy
142 rdf:type schema:Person
143 Nd843f0c10e9548d3a1b11e2a2fcb9add schema:familyName Gil
144 schema:givenName Yolanda
145 rdf:type schema:Person
146 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
147 schema:name Information and Computing Sciences
148 rdf:type schema:DefinedTerm
149 anzsrc-for:0806 schema:inDefinedTermSet anzsrc-for:
150 schema:name Information Systems
151 rdf:type schema:DefinedTerm
152 sg:person.010530051131.00 schema:affiliation grid-institutes:grid.4991.5
153 schema:familyName Nenov
154 schema:givenName Yavor
155 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010530051131.00
156 rdf:type schema:Person
157 sg:person.013100561643.19 schema:affiliation grid-institutes:grid.4991.5
158 schema:familyName Horrocks
159 schema:givenName Ian
160 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013100561643.19
161 rdf:type schema:Person
162 sg:person.014264077713.58 schema:affiliation grid-institutes:grid.4991.5
163 schema:familyName Potter
164 schema:givenName Anthony
165 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014264077713.58
166 rdf:type schema:Person
167 sg:person.07401076267.36 schema:affiliation grid-institutes:grid.4991.5
168 schema:familyName Motik
169 schema:givenName Boris
170 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07401076267.36
171 rdf:type schema:Person
172 grid-institutes:grid.4991.5 schema:alternateName University of Oxford, Oxford, UK
173 schema:name University of Oxford, Oxford, UK
174 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...