In this paper, we describe a resource-light system for the automatic morphological analysis and tagging of Russian. We eschew the use of extensive resources (particularly, large annotated corpora and lexicons), exploiting instead (i) pre-existing annotated corpora of Czech; (ii) an unannotated corpus of Russian. We show that our approach has benefits, and present what we believe to be one of the first full evaluations of a Russian tagger in the openly available literature.
|Number of pages||8|
|State||Published - 2004|
|Event||2004 Conference on Empirical Methods in Natural Language Processing, EMNLP 2004 - Barcelona, Spain|
Duration: 25 Jul 2004 → 26 Jul 2004
|Conference||2004 Conference on Empirical Methods in Natural Language Processing, EMNLP 2004|
|Period||25/07/04 → 26/07/04|