Abstract
A growing number of Linked Open Data sources (from diverse provenances and about different domains) that can be freely browsed and searched to find and extract useful information have been made available. However, access to them is difficult for different reasons. This study addresses access issues concerning heterogeneity. It is common for datasets to describe the same or overlapping domains while using different vocabularies. Our study presents a transducer that transforms a SPARQL query suitably expressed in terms of the vocabularies used in a source dataset into another SPARQL query suitably expressed for a target dataset involving different vocabularies. The transformation is based on existing alignments between terms in different datasets. Whenever the transducer is unable to produce a semantically equivalent query because of the scarcity of term alignments, the transducer produces a semantic approximation of the query to avoid returning the empty answer to the user. Transformation across datasets is achieved through the management of a wide range of transformation rules. The feasibility of our proposal has been validated with a prototype implementation that processes queries that appear in well-known benchmarks and SPARQL endpoint logs. Results of the experiments show that the system is quite effective in achieving adequate transformations.
- Silvana Castano, Alfio Ferrara, Stefano Montanelli, and Gaia Varese. 2011. Ontology and instance matching. In Knowledge-Driven Multimedia Information Extraction and Ontology Evolution. Springer, 167--195. Google Scholar
Digital Library
- Gianluca Correndo, Manuel Salvadores, Ian Millard, Hugh Glaser, and Nigel Shadbolt. 2010. SPARQL query rewriting for implementing data integration over linked data. In Proceedings of the 2010 EDBT/ICDT Workshops (EDBT’10). ACM, New York, Article 4, 11 pages. Google Scholar
Digital Library
- Richard Cyganiak, David Wood, and Markus Lanthaler. 2014. RDF 1.1 Concepts and Abstract Sysntax. World Wide Web Consortium. W3C Recommendation February 25, 2014.Google Scholar
- Jérôme David, Jérôme Euzenat, François Scharffe, and Cássia Trojahn dos Santos. 2011. The alignment API 4.0. Semantic Web 2, 1 (2011), 3--10. Google Scholar
Digital Library
- Gonzalo Diaz, Marcelo Arenas, and Michael Benedikt. 2016. Sparqlbye: Querying RDF data by example. Proceedings of the VLDB Endowment 9, 13 (2016), 1533--1536. Google Scholar
Digital Library
- Dennis Diefenbach, Vanessa Lopez, Kamal Singh, and Pierre Maret. 2018. Core techniques of question answering systems over knowledge bases: A survey. Knowledge and Information Systems 55, 3 (2018), 529--569. Google Scholar
Digital Library
- Peter Dolog, Heiner Stuckenschmidt, and Holger Wache. 2006. Robust query processing for personalized information access on the semantic web. In Proceedings of the 7th International Conference on Flexible Query Answering Systems. Springer-Verlag, 343--355. Google Scholar
Digital Library
- Jérôme Euzenat and Pavel Shvaiko. 2013. Ontology Matching (2nd ed.). Springer. Google Scholar
Digital Library
- Riccardo Frosini, Andrea Calì, Alexandra Poulovassilis, and Peter T. Wood. 2017. Flexible query processing for SPARQL. Semantic Web 8, 4 (2017), 533--563.Google Scholar
Digital Library
- Takahisa Fujino and Naoki Fukuta. 2014. Utilizing weighted ontology mappings on federated SPARQL querying. In Semantic Technology, Wooju Kim, Ying Ding, and Hong-Gee Kim (Eds.). Springer International Publishing, 331--347. Google Scholar
Digital Library
- Pascal Gillet, Cassia Trojahn, Ollivier Haemmerlé, and Camille Pradel. 2013. Complex correspondences for query patterns rewriting. In Proceedings of the 8th International Conference on Ontology Matching-Volume 1111. CEUR-WS. org, 49--60. Google Scholar
Digital Library
- Hugh Glaser, Afraz Jaffri, and Ian Millard. 2009. Managing co-reference on the semantic web. In WWW2009 Workshop: Linked Data on the Web (LDOW’09). Retrieved from http://eprints.soton.ac.uk/267587/.Google Scholar
- Olaf Görlitz, Matthias Thimm, and Steffen Staab. 2012. Splodge: Systematic generation of SPARQL benchmark queries for linked open data. In International Semantic Web Conference. Springer, 116--132. Google Scholar
Digital Library
- Olaf Hartig. 2011. Zero-knowledge query planning for an iterator implementation of link traversal based query execution. In Proceedings of the 8th Extended Semantic Web Conference on the Semantic Web: Research and Applications-Part I. Springer-Verlag, 154--169. Google Scholar
Digital Library
- Bernhard Haslhofer, Flávio Martins, and João Magalhães. 2013. Using SKOS vocabularies for improving web search. In Proceedings of the 22nd International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 1253--1258. Google Scholar
Digital Library
- Konrad Höffner, Sebastian Walter, Edgard Marx, Ricardo Usbeck, Jens Lehmann, and Axel-Cyrille Ngonga Ngomo. 2017. Survey on challenges of question answering in the semantic web. Semantic Web 8, 6 (2017), 895--920.Google Scholar
Cross Ref
- Günter Ladwig and Thanh Tran. 2010. Linked data query processing strategies. In Proceedings of the 9th International Semantic Web Conference on the Semantic Web-Part I. Springer-Verlag, 453--469. Google Scholar
Digital Library
- Jens Lehmann and Lorenz Bühmann. 2011. AutoSPARQL: Let users query your knowledge base. In Proceedings of the 8th Extended Semantic Web Conference on the Semantic Web: Research and Applications-Part I. 63--79. Google Scholar
Digital Library
- Vanessa Lopez, Miriam Fernández, Enrico Motta, and Nico Stieler. 2012. PowerAqua: Supporting users in querying and exploring the semantic web. Semantic Web 3, 3 (2012), 249--265. Google Scholar
Digital Library
- Konstantinos Makris, Nikos Bikakis, Nektarios Gioldasis, and Stavros Christodoulakis. 2012. SPARQL-RW: Transparent query access over mapped RDF data sources. In Proceedings of the 15th International Conference on Extending Database Technology. ACM, 610--613. Google Scholar
Digital Library
- Mohamed Morsey, Jens Lehmann, Sören Auer, and Axel-Cyrille Ngonga Ngomo. 2011. DBpedia SPARQL benchmark--Performance assessment with real queries on real data. In International Semantic Web Conference. Springer, 454--469. Google Scholar
Digital Library
- Bastian Quilitz and Ulf Leser. 2008. Querying distributed RDF data sources with SPARQL. In Proceedings of the 5th European Semantic Web Conference on the Semantic Web: Research and Applications. Springer-Verlag, 524--538. Google Scholar
Digital Library
- Kuldeep B. R. Reddy and P. Sreenivasa Kumar. 2013. Efficient trust-based approximate SPARQL querying of the web of linked data. In Uncertainty Reasoning for the Semantic Web II. Springer, 315--330.Google Scholar
- Philip Resnik. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence - Volume 1 (IJCAI’95). Morgan Kaufmann Publishers, San Francisco, 448--453. Retrieved from http://dl.acm.org/citation.cfm?id=1625855.1625914. Google Scholar
Digital Library
- Marta Sabou, Mathieu d’Aquin, and Enrico Motta. 2008. SCARLET: Semantic relation discovery by harvesting online ontologies. In Proceedings of the 5th European Semantic Web Conference on the Semantic Web: Research and Applications. Springer-Verlag, 854--858. Google Scholar
Digital Library
- Kai Schlegel, Florian Stegmaier, Sebastian Bayerl, Michael Granitzer, and Harald Kosch. 2014. Balloon fusion: SPARQL rewriting based on unified co-reference information. In 2014 IEEE 30th International Conference on Data Engineering Workshops (ICDEW’14). IEEE, 254--259.Google Scholar
Cross Ref
- Michael Schmidt, Olaf Görlitz, Peter Haase, Günter Ladwig, Andreas Schwarte, and Thanh Tran. 2011. FedBench: A benchmark suite for federated semantic data query processing. In International Semantic Web Conference, 2011 (ISWC’11). Springer, Berlin, Heidelberg, 585--600. Google Scholar
Digital Library
- Michael Schmidt, Thomas Hornung, Georg Lausen, and Christoph Pinkel. 2009. SP 2Bench: A SPARQL performance benchmark. In IEEE 25th International Conference on Data Engineering, 2009 (ICDE’09). IEEE, 222--233. Google Scholar
Digital Library
- Andreas Schwarte, Peter Haase, Katja Hose, Ralf Schenkel, and Michael Schmidt. 2011. FedX: Optimization techniques for federated query processing on linked data. In International Semantic Web Conference, 2011 (ISWC'11). Springer, Berlin, Heidelberg, 601--616. Google Scholar
Digital Library
- Oshani Seneviratne and Rachel Sealfon. 2010. QueryMed: An intuitive federated SPARQL query builder for biomedical RDF data. Retrieved from http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.389.4282.Google Scholar
- Saeedeh Shekarpour, Edgard Marx, Axel-Cyrille Ngonga Ngomo, and SÃren Auer. 2015. SINA: Semantic interpretation of user queries for question answering on interlinked data. Web Semantics: Science, Services and Agents on the World Wide Web 30 (2015), 39--51.Google Scholar
Digital Library
- W3C SPARQL Working Group. 2013. SPARQL 1.1 Overview. Retrieved from https://www.w3.org/TR/sparql11-overview/. W3C Recommendation March 21, 2013.Google Scholar
- W3C SPARQL Working Group. 2013. SPARQL 1.1 Query Language. Retrieved from https://www.w3.org/TR/2013/REC-sparql11-query-20130321/. W3C Recommendation March 21, 2013.Google Scholar
- Yuan Tian, Jürgen Umbrich, and Yong Yu. 2011. Enhancing source selection for live queries over linked data via query log mining. In Proceedings of the 2011 Joint International Conference on the Semantic Web. Springer-Verlag, 176--191. Google Scholar
Digital Library
- Raquel Trillo, Jorge Gracia, Mauricio Espinoza, and Eduardo Mena. 2007. Discovering the semantics of user keywords. Journal of Universal Computer Science 13, 12 (2007), 1908--1935.Google Scholar
- Zhibiao Wu and Martha Palmer. 1994. Verbs semantics and lexical selection. In Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 133--138. Google Scholar
Digital Library
- Ying Zhang, Pham Minh Duc, Oscar Corcho, and Jean-Paul Calbimonte. 2012. SRBench: A streaming RDF/SPARQL benchmark. In International Semantic Web Conference. Springer, 641--657. Google Scholar
Digital Library
- Ziqi Zhang, Anna Lisa Gentile, Eva Blomqvist, Isabelle Augenstein, and Fabio Ciravegna. 2017. An unsupervised data-driven method to discover equivalent relations in large linked datasets. Semantic Web 8, 2 (2017), 197--223.Google Scholar
Cross Ref
Index Terms
A Rule-Based Transducer for Querying Incompletely Aligned Datasets
Recommendations
RDF, Jena, SparQL and the 'Semantic Web'
SIGUCCS '09: Proceedings of the 37th annual ACM SIGUCCS fall conference: communication and collaborationThe Resource Description Format (RDF) is used to represent information modeled as a "graph": a set of individual objects, along with a set of connections among those objects. In that role, RDF is one of the pillars of the so-called Semantic Web. This ...
Expressive Languages for Querying the Semantic Web
Best of PODS 2017, Best of ICDT 2017 and Regular PapersThe problem of querying RDF data is a central issue for the development of the Semantic Web. The query language SPARQL has become the standard language for querying RDF since its W3C standardization in 2008. However, the 2008 version of this language ...
A System for Querying RDF Data Using LINQ
NBIS '15: Proceedings of the 2015 18th International Conference on Network-Based Information SystemsLinked Open Data (LOD) is a principle to publish machine readable data on the Web, and has been gaining growing attention owing to the recent upsurge of open data movement. For this reason, there are strong demands to develop applications that utilize ...






Comments