Distributed RDF Query Answering with Dynamic Data Exchange View Full Text


Ontology type: schema:Chapter      Open Access: True


Chapter Info

DATE

2016-09-23

AUTHORS

Anthony Potter , Boris Motik , Yavor Nenov , Ian Horrocks

ABSTRACT

Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up to an order of magnitude in terms of query evaluation times, network communication, and memory use. More... »

PAGES

480-497

Book

TITLE

The Semantic Web – ISWC 2016

ISBN

978-3-319-46522-7
978-3-319-46523-4

Author Affiliations

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29

DOI

http://dx.doi.org/10.1007/978-3-319-46523-4_29

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1084848381


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information Systems", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Potter", 
        "givenName": "Anthony", 
        "id": "sg:person.014264077713.58", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014264077713.58"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Motik", 
        "givenName": "Boris", 
        "id": "sg:person.07401076267.36", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07401076267.36"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Nenov", 
        "givenName": "Yavor", 
        "id": "sg:person.010530051131.00", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010530051131.00"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Oxford, Oxford, UK", 
          "id": "http://www.grid.ac/institutes/grid.4991.5", 
          "name": [
            "University of Oxford, Oxford, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Horrocks", 
        "givenName": "Ian", 
        "id": "sg:person.013100561643.19", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013100561643.19"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2016-09-23", 
    "datePublishedReg": "2016-09-23", 
    "description": "Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up\u00a0to an order of magnitude in terms of query evaluation times, network communication, and memory use.", 
    "editor": [
      {
        "familyName": "Groth", 
        "givenName": "Paul", 
        "type": "Person"
      }, 
      {
        "familyName": "Simperl", 
        "givenName": "Elena", 
        "type": "Person"
      }, 
      {
        "familyName": "Gray", 
        "givenName": "Alasdair", 
        "type": "Person"
      }, 
      {
        "familyName": "Sabou", 
        "givenName": "Marta", 
        "type": "Person"
      }, 
      {
        "familyName": "Kr\u00f6tzsch", 
        "givenName": "Markus", 
        "type": "Person"
      }, 
      {
        "familyName": "Lecue", 
        "givenName": "Freddy", 
        "type": "Person"
      }, 
      {
        "familyName": "Fl\u00f6ck", 
        "givenName": "Fabian", 
        "type": "Person"
      }, 
      {
        "familyName": "Gil", 
        "givenName": "Yolanda", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-319-46523-4_29", 
    "inLanguage": "en", 
    "isAccessibleForFree": true, 
    "isPartOf": {
      "isbn": [
        "978-3-319-46522-7", 
        "978-3-319-46523-4"
      ], 
      "name": "The Semantic Web \u2013 ISWC 2016", 
      "type": "Book"
    }, 
    "keywords": [
      "dynamic data exchange", 
      "RDF data", 
      "data locality", 
      "data exchange", 
      "query evaluation time", 
      "large RDF datasets", 
      "query answering algorithm", 
      "amount of computation", 
      "RDF queries", 
      "query plans", 
      "RDF datasets", 
      "server cluster", 
      "network communication", 
      "local computation", 
      "memory use", 
      "single server", 
      "partitioning algorithm", 
      "novel solution", 
      "queries", 
      "server", 
      "performance evaluation", 
      "algorithm", 
      "such approaches", 
      "evaluation time", 
      "computation", 
      "dataset", 
      "graph", 
      "exchange operator", 
      "main components", 
      "communication", 
      "data", 
      "operators", 
      "art", 
      "system", 
      "localities", 
      "technique", 
      "orders of magnitude", 
      "exchange", 
      "solution", 
      "clusters", 
      "knowledge", 
      "order", 
      "plan", 
      "opportunities", 
      "evaluation", 
      "terms", 
      "time", 
      "use", 
      "amount", 
      "components", 
      "state", 
      "variants", 
      "aim", 
      "magnitude", 
      "approach"
    ], 
    "name": "Distributed RDF Query Answering with Dynamic Data Exchange", 
    "pagination": "480-497", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1084848381"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-319-46523-4_29"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-319-46523-4_29", 
      "https://app.dimensions.ai/details/publication/pub.1084848381"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2022-05-20T07:41", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220519/entities/gbq_results/chapter/chapter_101.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/978-3-319-46523-4_29"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-46523-4_29'


 

This table displays all metadata directly associated to this object as RDF triples.

171 TRIPLES      23 PREDICATES      80 URIs      73 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-319-46523-4_29 schema:about anzsrc-for:08
2 anzsrc-for:0806
3 schema:author N8279af4aa3744b71ab60248fb9064082
4 schema:datePublished 2016-09-23
5 schema:datePublishedReg 2016-09-23
6 schema:description Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up to an order of magnitude in terms of query evaluation times, network communication, and memory use.
7 schema:editor N792bc7e023c047d9a99c053375138cbe
8 schema:genre chapter
9 schema:inLanguage en
10 schema:isAccessibleForFree true
11 schema:isPartOf N4230ef10e1c74509b644b02dd889e1ff
12 schema:keywords RDF data
13 RDF datasets
14 RDF queries
15 aim
16 algorithm
17 amount
18 amount of computation
19 approach
20 art
21 clusters
22 communication
23 components
24 computation
25 data
26 data exchange
27 data locality
28 dataset
29 dynamic data exchange
30 evaluation
31 evaluation time
32 exchange
33 exchange operator
34 graph
35 knowledge
36 large RDF datasets
37 local computation
38 localities
39 magnitude
40 main components
41 memory use
42 network communication
43 novel solution
44 operators
45 opportunities
46 order
47 orders of magnitude
48 partitioning algorithm
49 performance evaluation
50 plan
51 queries
52 query answering algorithm
53 query evaluation time
54 query plans
55 server
56 server cluster
57 single server
58 solution
59 state
60 such approaches
61 system
62 technique
63 terms
64 time
65 use
66 variants
67 schema:name Distributed RDF Query Answering with Dynamic Data Exchange
68 schema:pagination 480-497
69 schema:productId N6992c80b8dec442a8a02ffc88c8f3d8b
70 Ndf76d099afdf4176935df2f539573a35
71 schema:publisher N81868ba9bf3148639660220f068f0847
72 schema:sameAs https://app.dimensions.ai/details/publication/pub.1084848381
73 https://doi.org/10.1007/978-3-319-46523-4_29
74 schema:sdDatePublished 2022-05-20T07:41
75 schema:sdLicense https://scigraph.springernature.com/explorer/license/
76 schema:sdPublisher N800cb4a429444975ad0cb33c4bf4d0ed
77 schema:url https://doi.org/10.1007/978-3-319-46523-4_29
78 sgo:license sg:explorer/license/
79 sgo:sdDataset chapters
80 rdf:type schema:Chapter
81 N091322c8f17944cbb08cd118fd01fba7 schema:familyName Gil
82 schema:givenName Yolanda
83 rdf:type schema:Person
84 N0d9eb07edc1c4c2bb942c710b4faf47d schema:familyName Simperl
85 schema:givenName Elena
86 rdf:type schema:Person
87 N1c0880a26c3044a2b909c38abfa0e9e6 schema:familyName Flöck
88 schema:givenName Fabian
89 rdf:type schema:Person
90 N22bdf77d102f404ebd8edb4d9c995039 schema:familyName Groth
91 schema:givenName Paul
92 rdf:type schema:Person
93 N248e131f84a24d0eb68b77d437ea4785 rdf:first sg:person.013100561643.19
94 rdf:rest rdf:nil
95 N288eb97a9c164e779c35b676d6ca618d rdf:first N5eaa67710b1e4ee4914d567c77ec85d6
96 rdf:rest N4f41626998594e2697c4ba458d3502cd
97 N299b09d0444540c2924e6a5ad21b3ccd schema:familyName Lecue
98 schema:givenName Freddy
99 rdf:type schema:Person
100 N4230ef10e1c74509b644b02dd889e1ff schema:isbn 978-3-319-46522-7
101 978-3-319-46523-4
102 schema:name The Semantic Web – ISWC 2016
103 rdf:type schema:Book
104 N48f7e819b83f413d847690f4474ef156 schema:familyName Krötzsch
105 schema:givenName Markus
106 rdf:type schema:Person
107 N4d40c91029554bffaac9e4087de6ebcf rdf:first sg:person.010530051131.00
108 rdf:rest N248e131f84a24d0eb68b77d437ea4785
109 N4f41626998594e2697c4ba458d3502cd rdf:first N76e433b270fe4d408dda316167477183
110 rdf:rest Nf03e96b2b9ce417d9f7f78debdc73423
111 N5b62bfca04f342aebc7bd056ce450bfa rdf:first N091322c8f17944cbb08cd118fd01fba7
112 rdf:rest rdf:nil
113 N5eaa67710b1e4ee4914d567c77ec85d6 schema:familyName Gray
114 schema:givenName Alasdair
115 rdf:type schema:Person
116 N6992c80b8dec442a8a02ffc88c8f3d8b schema:name dimensions_id
117 schema:value pub.1084848381
118 rdf:type schema:PropertyValue
119 N76e433b270fe4d408dda316167477183 schema:familyName Sabou
120 schema:givenName Marta
121 rdf:type schema:Person
122 N792bc7e023c047d9a99c053375138cbe rdf:first N22bdf77d102f404ebd8edb4d9c995039
123 rdf:rest Neb76f2396af94bea8466b5351ee32d08
124 N800cb4a429444975ad0cb33c4bf4d0ed schema:name Springer Nature - SN SciGraph project
125 rdf:type schema:Organization
126 N81868ba9bf3148639660220f068f0847 schema:name Springer Nature
127 rdf:type schema:Organisation
128 N8279af4aa3744b71ab60248fb9064082 rdf:first sg:person.014264077713.58
129 rdf:rest Nbc86327c85354ffa9f546b7be76fe8cd
130 N85760713b0d2423aa95bc5eb41792dbd rdf:first N299b09d0444540c2924e6a5ad21b3ccd
131 rdf:rest Nb099cfc9a81644ebbe295b30b9ec68a5
132 Nb099cfc9a81644ebbe295b30b9ec68a5 rdf:first N1c0880a26c3044a2b909c38abfa0e9e6
133 rdf:rest N5b62bfca04f342aebc7bd056ce450bfa
134 Nbc86327c85354ffa9f546b7be76fe8cd rdf:first sg:person.07401076267.36
135 rdf:rest N4d40c91029554bffaac9e4087de6ebcf
136 Ndf76d099afdf4176935df2f539573a35 schema:name doi
137 schema:value 10.1007/978-3-319-46523-4_29
138 rdf:type schema:PropertyValue
139 Neb76f2396af94bea8466b5351ee32d08 rdf:first N0d9eb07edc1c4c2bb942c710b4faf47d
140 rdf:rest N288eb97a9c164e779c35b676d6ca618d
141 Nf03e96b2b9ce417d9f7f78debdc73423 rdf:first N48f7e819b83f413d847690f4474ef156
142 rdf:rest N85760713b0d2423aa95bc5eb41792dbd
143 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
144 schema:name Information and Computing Sciences
145 rdf:type schema:DefinedTerm
146 anzsrc-for:0806 schema:inDefinedTermSet anzsrc-for:
147 schema:name Information Systems
148 rdf:type schema:DefinedTerm
149 sg:person.010530051131.00 schema:affiliation grid-institutes:grid.4991.5
150 schema:familyName Nenov
151 schema:givenName Yavor
152 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010530051131.00
153 rdf:type schema:Person
154 sg:person.013100561643.19 schema:affiliation grid-institutes:grid.4991.5
155 schema:familyName Horrocks
156 schema:givenName Ian
157 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013100561643.19
158 rdf:type schema:Person
159 sg:person.014264077713.58 schema:affiliation grid-institutes:grid.4991.5
160 schema:familyName Potter
161 schema:givenName Anthony
162 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014264077713.58
163 rdf:type schema:Person
164 sg:person.07401076267.36 schema:affiliation grid-institutes:grid.4991.5
165 schema:familyName Motik
166 schema:givenName Boris
167 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07401076267.36
168 rdf:type schema:Person
169 grid-institutes:grid.4991.5 schema:alternateName University of Oxford, Oxford, UK
170 schema:name University of Oxford, Oxford, UK
171 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...