|Dynamic Lexica for Query Translation
|Year of Publication
|Karlgren J, Sahlgren M, Järvinen T, Cöster R
|Peters C, Clough P, Gonzalo J, Jones G, Kluck M, Magnini B
|Multilingual Information Access for Text, Speech and Images
|Lecture Notes in Computer Science
|Berlin / Heidelberg
This experiment tests a simple, scalable, and effective approach to building a domain-specific translation lexicon using distributional statistics over parallellized bilingual corpora. A bilingual lexicon is extracted from aligned Swedish-French data, used to translate CLEF topics from Swedish to French, which resulting French queries are then in turn used to retrieve documents from the French language CLEF collection. The results give 34 of fifty queries on or above median for the precision at 1000 documents recall oriented score; with many of the errors possible to handle by the use of string-matching and cognate search. We conclude that the approach presented here is a simple and efficient component in an automatic query translation system.