TY - GEN
T1 - Icelandic data driven part of speech tagging
AU - Dredze, Mark
AU - Wallenberg, Joel
PY - 2008
Y1 - 2008
N2 - Data driven POS tagging has achieved good performance for English, but can still lag behind linguistic rule based taggers for morphologically complex languages, such as Icelandic. We extend a statistical tagger to handle fine grained tagsets and improve over the best Icelandic POS tagger. Additionally, we develop a case tagger for non-local case and gender decisions. An error analysis of our system suggests future directions.
AB - Data driven POS tagging has achieved good performance for English, but can still lag behind linguistic rule based taggers for morphologically complex languages, such as Icelandic. We extend a statistical tagger to handle fine grained tagsets and improve over the best Icelandic POS tagger. Additionally, we develop a case tagger for non-local case and gender decisions. An error analysis of our system suggests future directions.
UR - http://www.scopus.com/inward/record.url?scp=84859895940&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84859895940&partnerID=8YFLogxK
U2 - 10.3115/1557690.1557700
DO - 10.3115/1557690.1557700
M3 - Conference contribution
AN - SCOPUS:84859895940
SN - 9781932432046
T3 - ACL-08: HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference
SP - 33
EP - 36
BT - ACL-08
PB - Association for Computational Linguistics (ACL)
T2 - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT
Y2 - 15 June 2008 through 20 June 2008
ER -