Galaxy: a platform for explorative analysis of Open Data sources

Seyed Mehdi Reza Beheshti, Boualem Benatallah, Hamid Reza Motahari-Nezhad

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

15 Citations (Scopus)
35 Downloads (Pure)

Abstract

A large volume of Open Data is being generated on a continuous basis. Examples of this are the case of social, natural, and information systems such as World Wide Web and social networks. Most entities and objects in the Open Data are interconnected, forming a complex, semi-structured, and information-rich networks. In this sense, Linked Open Data has the potential to be similar to a federated database. Since Linked Open Data is based on W3C standards, it is possible to implement a federation infrastructure, however, the current SPARQL standard makes it challenging to analyze the Open Data in an explorative manner. Consequently, it will be hard to discover the hidden knowledge in the relationships among entities in Open Data sources. In this paper, we present Galaxy, a platform for explorative analysis of Open Data Sources. Galaxy facilitates the analysis of Open Data graphs based on simple abstractions, i.e. folders and paths, which enable an analyst to group related entities in the graph or find paths among entities. Galaxy uses Hadoop data processing platforms to store and retrieve large numbers of RDF triples and to support cost-effective and Web-scale processing of Semantic Web data through a Folder-Path enabled extension of SPARQL.

Original languageEnglish
Title of host publicationAdvances in Database Technology - EDBT 2016
Subtitle of host publication19th International Conference on Extending Database Technology, Proceedings
EditorsEvaggelia Pitoura, Sofian Maabout, Georgia Koutrika, Amelie Marian, Letizia Tanca, Ioana Manolescu, Kostas Stefanidis
PublisherOpenProceedings.org, University of Konstanz, University Library
Pages640-643
Number of pages4
Volume2016-March
ISBN (Electronic)9783893180707
DOIs
Publication statusPublished - 1 Jan 2016
Externally publishedYes
Event19th International Conference on Extending Database Technology, EDBT 2016 - Bordeaux, France
Duration: 15 Mar 201618 Mar 2016

Conference

Conference19th International Conference on Extending Database Technology, EDBT 2016
Country/TerritoryFrance
CityBordeaux
Period15/03/1618/03/16

Bibliographical note

Copyright the Author(s) 2016. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • Linked data
  • Open data analytics
  • Querying graphs

Fingerprint

Dive into the research topics of 'Galaxy: a platform for explorative analysis of Open Data sources'. Together they form a unique fingerprint.

Cite this