A resource-light approach to Russian morphology: Tagging Russian using Czech resources

Jiri Hana, Anna Feldman, Chris Brew

Research output: Contribution to conferencePaperpeer-review

28 Scopus citations

Abstract

In this paper, we describe a resource-light system for the automatic morphological analysis and tagging of Russian. We eschew the use of extensive resources (particularly, large annotated corpora and lexicons), exploiting instead (i) pre-existing annotated corpora of Czech; (ii) an unannotated corpus of Russian. We show that our approach has benefits, and present what we believe to be one of the first full evaluations of a Russian tagger in the openly available literature.

Original languageEnglish
Pages222-229
Number of pages8
StatePublished - 2004
Event2004 Conference on Empirical Methods in Natural Language Processing, EMNLP 2004 - Barcelona, Spain
Duration: 25 Jul 200426 Jul 2004

Conference

Conference2004 Conference on Empirical Methods in Natural Language Processing, EMNLP 2004
Country/TerritorySpain
CityBarcelona
Period25/07/0426/07/04

Fingerprint

Dive into the research topics of 'A resource-light approach to Russian morphology: Tagging Russian using Czech resources'. Together they form a unique fingerprint.

Cite this