Abstract
We describe here the application of a cross-language information retrieval technique based on similarity thesauri in the domain of Swiss law. We present the theory of similarity thesauri, which are information structures derived from corpora, and show how they can be used for cross-language retrieval. We also discuss the collections of Swiss legal documents and show how we have used them to construct an environment in which we can directly evaluate the performance of our cross-language retrieval system. Evaluation shows that cross-language retrieval works equally as well as monolingual retrieval in the best case. We conclude that providing cross-language access to digital libraries is already a viable possibility.
(Martin Braschler has since left ETH Zurich)
Preview
Unable to display preview. Download preview PDF.
References
Ballesteros, L. and Croft, W. B. (1997). Phrasal Translation and Query Expansion Techniques for Cross-language Information Retrieval. In Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, PA.
Davis, M. and Ogden, W. (1997). Implementing Cross-Language Text Retrieval Systems for Large-scale Text Collections on the World Wide Web. In Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, PA.
Gale, W. A. and Church, K. W. (1993). A Program for Aligning Sentences in Bilingual Corpora. Computational Lingusitics, 19(1):75102–52. Special Issue on Using Large Corpora I.
Kay, M. and RÖscheisen, M. (1993). Text-Translation Alignment. Computational Lingusitics, 19(1):75102–52. Special Issue on Using Large Corpora I.
Littman, M., Dumais, S., and Landauer, T. (1997). Automatic Cross-linguistic Information Retrieval using Latent Semantic Indexing. In Grefen-stette, G., editor, Cross Language Information Retrieval (To appear).
Oard, D. (1997). Speech-Based Information Retrieval for Digital Libraries. Technical Report CLIS-TR-97-05 LAMP-TR-015, College of Library and Information Services, University of Maryland, College Park, MD 20742.
Peters, C. and Picchi, E. (1996). From Parallel to Comparable Text Corpora. In Proceedings of EURALEX '96, Goteborg, Sweden 13–18 August 1996.
Qiu, Y. (1995). Automatic Query Expansion Based on a Similarity Thesaurus. PhD thesis, Swiss Federal Institute of Technology.
Qiu, Y. and Frei, H. (1993). Concept Based Query Expansion. In Proceedings of the 16th ACM SIGIR Conference on Research and Development in Information Retrieval, Pittsburgh, PA, pages 160–169.
Salton, G. and McGill, M. (1983). Introduction to Modern Information Retrieval. McGraw-Hill, New York.
Schäuble, P. and Knaus, D. (1992). The Various Roles of Information Structures. In Gesellschaft für Klassifikation, Annual Meeting.
Sheridan, P. and Ballerini, J. P. (1996). Experiments in Multilingual Information Retrieval using the SPIDER System. In Proceedings of the 19th ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland, pages 58–65.
Sheridan, P., Ballerini, J. P., and Schäuble, P. (1997a). Building a Large Multilingual Test Collection From Comparable News Documents. In Grefenstette, G., editor, Cross Language Information Retrieval (To appear).
Sheridan, P., Wechsler, M., and Schäuble, P. (1997b). Cross-Language Speech Retrieval: Establishing a Baseline Performance. In Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, PA.
Wactlar, H., Kanade, T., Smith, M., and Stevens, S. (1996). Intelligent access to digital video: The informedia project. IEEE Computer, 29(5).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sheridan, P., Braschlert, M., Schäuble, P. (1997). Cross-language information retrieval in a Multilingual Legal Domain. In: Peters, C., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 1997. Lecture Notes in Computer Science, vol 1324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026732
Download citation
DOI: https://doi.org/10.1007/BFb0026732
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63554-3
Online ISBN: 978-3-540-69597-4
eBook Packages: Springer Book Archive