Abstract
Semantic parsing requires training data that is expensive and slow to collect. We apply active learning to both traditional and “overnight” data collection approaches. We show that it is possible to obtain good training hyperparameters from seed data which is only a small fraction of the full dataset. We show that uncertainty sampling based on least confidence score is competitive in traditional data collection but not applicable for overnight collection. We evaluate several active learning strategies for overnight data collection and show that different example selection strategies per domain perform best.
Original language | English |
---|---|
Title of host publication | ACL 2018 The 56th Annual Meeting of the Association for Computational Linguistics |
Subtitle of host publication | Proceedings of the Conference, Vol. 2 (Short Papers) |
Editors | Iryna Gurevych, Yusuke Miyao |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 43-48 |
Number of pages | 6 |
Volume | 2 |
ISBN (Electronic) | 9781948087346 |
DOIs | |
Publication status | Published - 1 Jan 2018 |
Externally published | Yes |
Event | 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018 - Melbourne, Australia Duration: 15 Jul 2018 → 20 Jul 2018 |
Conference
Conference | 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018 |
---|---|
Country/Territory | Australia |
City | Melbourne |
Period | 15/07/18 → 20/07/18 |