Implementation of linked data in the life sciences at BioHackathon 2011

Kiyoko F. Aoki-Kinoshita, Akira R. Kinjo, Mizuki Morita, Yoshinobu Igarashi, Yi an Chen, Yasumasa Shigemoto, Takatomo Fujisawa, Yukie Akune, Takeo Katoda, Anna Kokubu, Takaaki Mori, Mitsuteru Nakao, Shuichi Kawashima, Shinobu Okamoto, Toshiaki Katayama, Soichi Ogishima*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

12 Citations (Scopus)
14 Downloads (Pure)


Background: Linked Data has gained some attention recently in the life sciences as an effective way to provide and share data. As a part of the Semantic Web, data are linked so that a person or machine can explore the web of data. Resource Description Framework (RDF) is the standard means of implementing Linked Data. In the process of generating RDF data, not only are data simply linked to one another, the links themselves are characterized by ontologies, thereby allowing the types of links to be distinguished. Although there is a high labor cost to define an ontology for data providers, the merit lies in the higher level of interoperability with data analysis and visualization software. This increase in interoperability facilitates the multi-faceted retrieval of data, and the appropriate data can be quickly extracted and visualized. Such retrieval is usually performed using the SPARQL (SPARQL Protocol and RDF Query Language) query language, which is used to query RDF data stores. For the database provider, such interoperability will surely lead to an increase in the number of users.

Results: This manuscript describes the experiences and discussions shared among participants of the week-long BioHackathon 2011 who went through the development of RDF representations of their own data and developed specific RDF and SPARQL use cases. Advice regarding considerations to take when developing RDF representations of their data are provided for bioinformaticians considering making data available and interoperable.

Conclusions: Participants of the BioHackathon 2011 were able to produce RDF representations of their data and gain a better understanding of the requirements for producing such data in a period of just five days. We summarize the work accomplished with the hope that it will be useful for researchers involved in developing laboratory databases or data analysis, and those who are considering such technologies as RDF and Linked Data.

Original languageEnglish
Article number3
Pages (from-to)1-13
Number of pages13
JournalJournal of Biomedical Semantics
Issue number1
Publication statusPublished - 7 Jan 2015
Externally publishedYes

Bibliographical note

Copyright the Author(s) 2014. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.


  • Alzheimer's disease
  • Data integration
  • DDBJ
  • Faceted search interface
  • Glycobiology
  • PDBj
  • Semantic Web


Dive into the research topics of 'Implementation of linked data in the life sciences at BioHackathon 2011'. Together they form a unique fingerprint.

Cite this