A multi-phase correlation search framework for mining non-taxonomic relations from unstructured text

Mei Kuan Wong, Syed Sibte Raza Abidi, Ian D. Jonsen

Research output: Contribution to journalArticlepeer-review

15 Citations (Scopus)

Abstract

Over the last decade, ontology engineering has been pursued by "learning" the ontology from domain-specific electronic documents. Most of the research works are focused on extraction of concepts and taxonomic relations. The extraction of non-taxonomic relations is often neglected and not well researched. In this paper, we present a multi-phase correlation search framework to extract non-taxonomic relations from unstructured text. Our framework addresses the two main problems in any non-taxonomic relations extraction: (a) the discovery of non-taxonomic relations and (b) the labelling of non-taxonomic relations. First, our framework is capable of extracting correlated concepts beyond ordinary search window size of a single sentence. Interesting correlations are then filtered using association rule mining with lift interestingness measure. Next, our framework distinguishes non-taxonomic concept pairs from taxonomic concept pairs based on existing domain ontology. Finally, our framework features the usage of domain related verbs as labels for the non-taxonomic relations. Our proposed framework has been tested with the marine biology domain. Results have been validated by domain experts showing reliable results as well as demonstrate significant improvement over traditional association rule approach in search of non-taxonomic relations from unstructured text.

Original languageEnglish
Pages (from-to)641-667
Number of pages27
JournalKnowledge and Information Systems
Volume38
Issue number3
DOIs
Publication statusPublished - Mar 2014
Externally publishedYes

Fingerprint

Dive into the research topics of 'A multi-phase correlation search framework for mining non-taxonomic relations from unstructured text'. Together they form a unique fingerprint.

Cite this