Improved transliteration and translation disambiguation modules based on CLEF 2007 analysis
Hindi to English CLIR performance is 75% of monolingual and Marathi to English is 64% of monolingual
Need further investigation on results especially the monolingual baselines – Hindi, Marathi and English
Only transliteration achieves around 35% of monolingual performance in Hindi and 25% in Marathi
The second author is supported by the Infosys Fellowship Award
Project linguists at CFILT, IIT Bombay
S. Tarek and K. Grzegorz, Substring-Based Transliteration, In Proceedings of ACL, 2007
F. Huang, Cluster-specific named entity transliteration, In HLT ’05, pages 435–442, Morristown, NJ, USA, 2005.
I. Ounis, G. Amati, P. V., B. He, C. Macdonald, and Johnson, Terrier Information Retrieval Platform, In Proceedings of ECIR 2005, volume 3408 of Lecture Notes in Computer Science, pages 517–519. Springer, 2005.
Christof Monz and Bonnie J. Dorr, Iterative Translation Disambiguation for Cross-Language Information Retrieval, In SIGIR ’05, Pages 520-527, New York, USA, ACM Press
Nicola Bertoldi and Marcello Federico, Statistical Models for Monolingual and Bilingual Information Retrieval, Information Retrieval, 7 (1-2): 53-72, 2004
Martin Braschler and Carol Peters, Cross Language Evaluation Forum: Objectives, Results, Achievements,Information Retrieval, 7 (1-2): 7-31, 2004
Ricardo BaezaYates and Berthier RibeiroNeto, Modern Information Retrieval, Pearson Education, 2005.
Dan Gusfield, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology,Cambridge University Press, 1997.