Experiments with clustering-based features for sentence classification in medical publications: Macquarie Test's participation in the ALTA 2012 shared task

Research output: Contribution to journalConference paperpeer-review

31 Downloads (Pure)

Abstract

In our contribution to the ALTA 2012 shared task we experimented with the use of cluster-based features for sentence classification. In a first stage we cluster the documents according to the distribution of sentence labels. We then use this information as a feature in standard classifiers. We observed that the cluster-based feature improved the results for Naive-Bayes classifiers but not for better-informed classifiers such as MaxEnt or Logistic Regression.
Original languageEnglish
Pages (from-to)139-142
Number of pages4
JournalProceedings of the Australasian Language Technology Association Workshop 2012 : ALTA 2012
Volume10
Publication statusPublished - 2012
EventAustralasian Language Technology Workshop (10th : 2012) - Dunedin, New Zealand
Duration: 4 Dec 20126 Dec 2012

Bibliographical note

Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Fingerprint Dive into the research topics of 'Experiments with clustering-based features for sentence classification in medical publications: Macquarie Test's participation in the ALTA 2012 shared task'. Together they form a unique fingerprint.

Cite this