TY - JOUR
T1 - A particle filter algorithm for Bayesian wordsegmentation
AU - Borschinger, Benjamin
AU - Johnson, Mark
N1 - Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.
PY - 2011
Y1 - 2011
N2 - Bayesian models are usually learned using batch algorithms that have to iterate multiple times over the full dataset. This is both computationally expensive and, from a cognitive point of view, highly implausible. We present a novel online algorithm for the word segmentation models of Goldwater et al. (2009) which
is, to our knowledge, the first published version of a Particle Filter for this kind of model. Also, in contrast to other proposed algorithms, it comes with a theoretical guarantee of optimality if the number of particles goes to infinity.
While this is, of course, a theoretical point, a first experimental evaluation of our algorithm shows that, as predicted, its performance improves with the use of more particles, and that it performs competitively with other online
learners proposed in Pearl et al. (2011).
AB - Bayesian models are usually learned using batch algorithms that have to iterate multiple times over the full dataset. This is both computationally expensive and, from a cognitive point of view, highly implausible. We present a novel online algorithm for the word segmentation models of Goldwater et al. (2009) which
is, to our knowledge, the first published version of a Particle Filter for this kind of model. Also, in contrast to other proposed algorithms, it comes with a theoretical guarantee of optimality if the number of particles goes to infinity.
While this is, of course, a theoretical point, a first experimental evaluation of our algorithm shows that, as predicted, its performance improves with the use of more particles, and that it performs competitively with other online
learners proposed in Pearl et al. (2011).
M3 - Conference paper
SN - 1834-7037
SP - 10
EP - 18
JO - Proceedings of the Australasian Language Technology Association Workshop 2011
JF - Proceedings of the Australasian Language Technology Association Workshop 2011
T2 - Australasian Language Technology Workshop (9th : 2011)
Y2 - 1 December 2011 through 2 December 2011
ER -