Automatic construction of a concept hierarchy to assist web document classification

Woo-Chul Cho, Debbie Richards

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

In this paper, we present a new technique which is the Admixture MCRDR-FCA (AMF) algorithm for Web document classification. The technique offers a practical approach to new Web document classification by combining and extending a number of the current techniques. The AMF algorithm has a number of noteworthy features: firstly, it provides a structured conceptual correlation between keywords and secondly it is optimised. Finally, the algorithm creates refined multiple new rules in order to achieve higher accuracy in the conclusions relating to document classification. This is achieved by clarifying the relationships between one concept and another concept before going on to provide a final classification to some category. To evaluate the AMF algorithm, we have developed a demonstration system that permits easy comparison with a number of other classification techniques.
Original languageEnglish
Title of host publicationFull Proceedings of the 2nd International Conference on Information Management and Business (IMB 2006)
EditorsBhuvan Unhelkar, Yi-Chen Lan
Place of PublicationSydney
PublisherUniversity of Western Sydney
Pages562-573
Number of pages12
ISBN (Print)174108122X
Publication statusPublished - 2006
EventInternational Conference on Information Management and Business (2nd : 2006) - Sydney
Duration: 13 Feb 200616 Feb 2006

Conference

ConferenceInternational Conference on Information Management and Business (2nd : 2006)
CitySydney
Period13/02/0616/02/06

Keywords

  • ontologies
  • knowledge acquisition
  • web document classification
  • multiple classification ripple down rules
  • formal concept analysis

Fingerprint

Dive into the research topics of 'Automatic construction of a concept hierarchy to assist web document classification'. Together they form a unique fingerprint.

Cite this