Abstract
We present a study of cross-lingual direct transfer parsing for the Irish language. Firstly we discuss mapping of the annotation scheme of the Irish Dependency Treebank to a universal dependency scheme. We explain our dependency label mapping choices and the structural changes required in the Irish Dependency Treebank. We then experiment with the universally annotated treebanks of ten languages from four language family groups to assess which languages are the most useful for cross-lingual parsing of Irish by using these treebanks to train delexicalised parsing models which are then applied to sentences from the Irish Dependency Treebank. The best results are achieved when using Indonesian, a language from the Austronesian language family.
Original language | English |
---|---|
Title of host publication | Proceedings of the Celtic Language Technology Workshop (CLTW) |
Editors | John Judge, Teresa Lynn, Monica Ward, Brian Ó Raghallaigh |
Place of Publication | Dublin, Ireland |
Publisher | Association for Computational Linguistics |
Pages | 41-49 |
Number of pages | 9 |
ISBN (Print) | 9781873769324 |
Publication status | Published - 2014 |
Event | Celtic Language Technology Workshop (1st : 2014) - Dublin, Ireland Duration: 23 Aug 2014 → 23 Aug 2014 |
Workshop
Workshop | Celtic Language Technology Workshop (1st : 2014) |
---|---|
City | Dublin, Ireland |
Period | 23/08/14 → 23/08/14 |