A hybrid recommendation approach for open research datasets

Anusuriya Devaraju, Shlomo Berkovsky

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

2 Citations (Scopus)


Open data initiatives and policies have triggered a dramatic increase in the volume of available research data. This, in turn, has brought to the fore the challenge of helping users to discover relevant datasets. Research data repositories support data search primarily through keyword search and faceted navigation. However, these mechanisms may suit users, who are familiar with the structure and terminology of the repository. This raises the problem of personalized dataset recommendations for users unfamiliar with the repository or not able to clearly articulate their information needs. To this end, we present and evaluate in this paper a recommendation approach applied to a new task - recommending research datasets. Our approach hybridizes content-based similarity with item-to-item co-occurrence, tuned to a feature weighting model obtained through a survey involving real users. We applied the approach in the context of a live research data repository and evaluated it in a user study. The obtained user judgments reveal the ability of the proposed approach to accurately quantify the relevance of datasets and they constitute an important step towards developing a practical dataset recommender.
Original languageEnglish
Title of host publicationUMAP'18, Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization
PublisherAssociation for Computing Machinery (ACM)
Number of pages5
ISBN (Electronic)9781450355896
Publication statusPublished - 3 Jul 2018
Externally publishedYes
Event26th International Conference on User Modeling, Adaptation, and Personalization, UMAP 2018 - Singapore, Singapore
Duration: 8 Jul 201811 Jul 2018


Conference26th International Conference on User Modeling, Adaptation, and Personalization, UMAP 2018


  • Recommender system
  • content-based filtering
  • item-to-item similarity
  • open research data
  • user judgment
  • digital library


Dive into the research topics of 'A hybrid recommendation approach for open research datasets'. Together they form a unique fingerprint.

Cite this