SVD and clustering for unsupervised POS tagging

Michael Lamar, Yariv Maron, Mark Johnson, Elie Bienenstock

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

Abstract

We revisit the algorithm of Schutze (1995) for unsupervised part-of-speech tagging. The algorithm uses reduced-rank singular value decomposition followed by clustering to extract latent features from context distributions. As implemented here, it achieves state-of-the-art tagging accuracy at considerably less cost than more recent methods. It can also produce a range of finer-grained taggings, with potential applications to various tasks.

LanguageEnglish
Title of host publicationACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
Place of PublicationStroudsburg, PA
PublisherAssociation for Computational Linguistics (ACL)
Pages215-219
Number of pages5
ISBN (Print)9781617388088
Publication statusPublished - 2010
Event48th Annual Meeting of the Association for Computational Linguistics, ACL 2010 - Uppsala, Sweden
Duration: 11 Jul 201016 Jul 2010

Other

Other48th Annual Meeting of the Association for Computational Linguistics, ACL 2010
CountrySweden
CityUppsala
Period11/07/1016/07/10

Fingerprint

costs
Values
Tagging
Part-of-speech Tagging
Costs
Singular Value Decomposition

Cite this

Lamar, M., Maron, Y., Johnson, M., & Bienenstock, E. (2010). SVD and clustering for unsupervised POS tagging. In ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 215-219). Stroudsburg, PA: Association for Computational Linguistics (ACL).
Lamar, Michael ; Maron, Yariv ; Johnson, Mark ; Bienenstock, Elie. / SVD and clustering for unsupervised POS tagging. ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. Stroudsburg, PA : Association for Computational Linguistics (ACL), 2010. pp. 215-219
@inproceedings{49a57b8c8ccf44be931dff168a9a9b4e,
title = "SVD and clustering for unsupervised POS tagging",
abstract = "We revisit the algorithm of Schutze (1995) for unsupervised part-of-speech tagging. The algorithm uses reduced-rank singular value decomposition followed by clustering to extract latent features from context distributions. As implemented here, it achieves state-of-the-art tagging accuracy at considerably less cost than more recent methods. It can also produce a range of finer-grained taggings, with potential applications to various tasks.",
author = "Michael Lamar and Yariv Maron and Mark Johnson and Elie Bienenstock",
year = "2010",
language = "English",
isbn = "9781617388088",
pages = "215--219",
booktitle = "ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference",
publisher = "Association for Computational Linguistics (ACL)",

}

Lamar, M, Maron, Y, Johnson, M & Bienenstock, E 2010, SVD and clustering for unsupervised POS tagging. in ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL), Stroudsburg, PA, pp. 215-219, 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010, Uppsala, Sweden, 11/07/10.

SVD and clustering for unsupervised POS tagging. / Lamar, Michael; Maron, Yariv; Johnson, Mark; Bienenstock, Elie.

ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. Stroudsburg, PA : Association for Computational Linguistics (ACL), 2010. p. 215-219.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

TY - GEN

T1 - SVD and clustering for unsupervised POS tagging

AU - Lamar, Michael

AU - Maron, Yariv

AU - Johnson, Mark

AU - Bienenstock, Elie

PY - 2010

Y1 - 2010

N2 - We revisit the algorithm of Schutze (1995) for unsupervised part-of-speech tagging. The algorithm uses reduced-rank singular value decomposition followed by clustering to extract latent features from context distributions. As implemented here, it achieves state-of-the-art tagging accuracy at considerably less cost than more recent methods. It can also produce a range of finer-grained taggings, with potential applications to various tasks.

AB - We revisit the algorithm of Schutze (1995) for unsupervised part-of-speech tagging. The algorithm uses reduced-rank singular value decomposition followed by clustering to extract latent features from context distributions. As implemented here, it achieves state-of-the-art tagging accuracy at considerably less cost than more recent methods. It can also produce a range of finer-grained taggings, with potential applications to various tasks.

UR - http://www.scopus.com/inward/record.url?scp=84859978666&partnerID=8YFLogxK

M3 - Conference proceeding contribution

SN - 9781617388088

SP - 215

EP - 219

BT - ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference

PB - Association for Computational Linguistics (ACL)

CY - Stroudsburg, PA

ER -

Lamar M, Maron Y, Johnson M, Bienenstock E. SVD and clustering for unsupervised POS tagging. In ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. Stroudsburg, PA: Association for Computational Linguistics (ACL). 2010. p. 215-219