Abstract
In this paper, we describe a resource-light system for the automatic morphological analysis and tagging of Russian. We eschew the use of extensive resources (particularly, large annotated corpora and lexicons), exploiting instead (i) pre-existing annotated corpora of Czech; (ii) an unannotated corpus of Russian. We show that our approach has benefits, and present what we believe to be one of the first full evaluations of a Russian tagger in the openly available literature.
Original language | English |
---|---|
Pages | 222-229 |
Number of pages | 8 |
State | Published - 2004 |
Event | 2004 Conference on Empirical Methods in Natural Language Processing, EMNLP 2004 - Barcelona, Spain Duration: 25 Jul 2004 → 26 Jul 2004 |
Conference
Conference | 2004 Conference on Empirical Methods in Natural Language Processing, EMNLP 2004 |
---|---|
Country/Territory | Spain |
City | Barcelona |
Period | 25/07/04 → 26/07/04 |