Abstract
Data driven POS tagging has achieved good performance for English, but can still lag behind linguistic rule based taggers for morphologically complex languages, such as Icelandic. We extend a statistical tagger to handle fine grained tagsets and improve over the best Icelandic POS tagger. Additionally, we develop a case tagger for non-local case and gender decisions. An error analysis of our system suggests future directions.
Original language | English (US) |
---|---|
Pages (from-to) | 33-36 |
Number of pages | 4 |
Journal | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
DOIs | |
State | Published - 2008 |
Externally published | Yes |
Event | 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL 2008 - Columbus, United States Duration: Jun 16 2008 → Jun 17 2008 |
ASJC Scopus subject areas
- Computer Science Applications
- Linguistics and Language
- Language and Linguistics