Análisis de la expansión de consulta para colecciones médicas utilizando información mutua, ganancia de información y la ontología MeSH

  1. Perea Ortega, José Manuel
  2. Montejo Ráez, Arturo
  3. Díaz Galiano, Manuel Carlos
  4. García Cumbreras, Miguel Ángel
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Any de publicació: 2011

Número: 47

Pàgines: 13-19

Tipus: Article

Altres publicacions en: Procesamiento del lenguaje natural

Resum

In this paper we show several experiments related to query expansion in medical information retrieval systems, using ImageCLEFmed as a evaluation framework. We have evaluated different query expansion techniques such as PRF and others based on the use of the medical ontology MeSH. Specifically, in this latter case, different types of filtering have been tested, based on a previous selection of MeSH categories and the application of concepts such as information gain and mutual information. The results show that the PRF expansion slightly improves the base case without expanding the query, while the desirability of the expansion with MeSH terms is difficult to determine. However, we can deduce that the expansion with MeSH terms introduced too much noise during the retrieval process, at least by the techniques applied in this work.

Referències bibliogràfiques

  • Aronson, A.R. y T.C. Rindflesch. 1997. Query expansion using the UMLS metathesaurus. En D.R. Masys, editor, proceedings of the 1997 AMIA Annual Fall Symposium, páginas 485–489.
  • Chevallet, Jean-Pierre, Joo-Hwee Lim, y Saïd Radhouani. 2006. A structured visual learning approach mixed with ontology dimensions for medical queries. Accessing Multilingual Information Repositories. Lecture Notes in Computer Science, páginas 642–651.
  • Cover, Thomas M. y Joy A. Thomas. 2006. Elements of information theory (2. ed.). Wiley.
  • Díaz-Galiano, M. C., M. T. Martín-Valdivia, y L. A. Ureña-López. 2009. Query expansion with a medical ontology to improve a multimodal information retrieval system. Computers in Biology and Medicine, 39(4):396–403.
  • Díaz-Galiano, Manuel Carlos. 2011. Recuperación de información multimodal basada en integración de conocimiento. Ph.D. tesis, Universidad de Jaén.
  • Díaz-Galiano, M.C., M.A. García-Cumbreras, M.T. Martín-Valdivia, y A. Montejo-R´aez, 2010. Knowledge Integration using Textual Information for Improving ImageCLEF Collections, volumen 32 de The Information Retrieval Series, páginas 295–313. Springer Berlin Heidelberg.
  • Gobeill, J., D. Theodoro, E. Patsche, y P. Ruch. 2009. Taking benefit of query and document expansion using mesh descriptors in medical imageclef 2009. Working Notes of CLEF.
  • Hersh, W., S. Price, y L. Donohoe. 2000. Assessing thesaurus-based query expansion using the umls metathesaurus. Proc AMIA Symp, páginas 344–348.
  • Hersh, William R., Henning Müller, y Jayashree Kalpathy-Cramer. 2009. The imageclefmed medical image retrieval task test collection. Journal of Digital Imaging, 22(6):648–655.
  • Karamanis, Nikiforos. 2007. Text mining for biology and biomedicine. Computational Linguistics, 33(1):135–140.
  • Lana-Serrano, S., J. Villena-Román, y J.C. González-Cristóbal. 2008. Miracle at image-clefmed 2008: Evaluating strategies for automatic topic expansion. En Working Notes of the 2008 CLEF Workshop, Aarhus, Denmark.
  • Müller, Henning, Thomas Deselaers, Thomas Martin Deserno, Paul Clough, Eugene Kim, y William R. Hersh. 2006. Overview of the ImageCLEFmed 2006 Medical Retrieval and Medical Annotation Tasks. En CLEF, volumen 4730 de Lecture Notes in Computer Science, páginas 595–608. Springer.
  • Nilsson, Kristina, Hans Hjelm, y Henrik Oxhammar. 2005. SUiS - cross-language ontology-driven information retrieval in a controlled domain. En StefanWerner, editor, Proceedings of the 15th NODALIDA conference.
  • Porter, M.F. 1980. An algorithm for suffix stripping. Program, 14(3):130–137.
  • Sebastiani, F. 2002. Machine Learning in Automated Text Categorization. ACM Computing Surveys, 34(1):1.
  • Shannon, Claude E. 1948. A mathematical theory of communication. The Bell system technical journal, 27:379–423.