Resolving ambiguity for cross-language retrieval
- 1 August 1998
- conference paper
- Published by Association for Computing Machinery (ACM)
Abstract
One of the main hurdles to improved CLIR ef-fectiveness is resolving ambiguity associated with translation. Availability of resources is also a problem. First we present a technique based on co-occurrence statistics from unlinked cor-pora which can be used to reduce the ambiguity associated with phrasal and term translation. We then combine this method with other techniques for reducing ambiguity and achieve more than 90% monolingual effectiveness. Finally, we compare the co-occurrence method with parallel corpus and machine trans-lation techniques and show that good retrieval effectiveness can be achieved without complex resources.Keywords
This publication has 11 references indexed in Scilit:
- Corpus-based stemming using cooccurrence of word variantsACM Transactions on Information Systems, 1998
- QUILTPublished by Association for Computing Machinery (ACM) ,1997
- Phrasal translation and query expansion techniques for cross-language information retrievalPublished by Association for Computing Machinery (ACM) ,1997
- Experiments in multilingual information retrieval using the SPIDER systemPublished by Association for Computing Machinery (ACM) ,1996
- Querying across languagesPublished by Association for Computing Machinery (ACM) ,1996
- Query expansion using local and global document analysisPublished by Association for Computing Machinery (ACM) ,1996
- TREC and TIPSTER experiments with inqueryInformation Processing & Management, 1995
- An algorithm for finding noun phrase correspondences in bilingual corporaPublished by Association for Computational Linguistics (ACL) ,1993
- Two languages are more informative than onePublished by Association for Computational Linguistics (ACL) ,1991
- Local Feedback in Full-Text Retrieval SystemsJournal of the ACM, 1977