TY - JOUR
T1 - Semantic annotation for concept-based cross-language medical information retrieval
AU - Volk, Martin
AU - Ripplinger, Bärbel A.
AU - Vintar, Špela
AU - Buitelaar, Paul
AU - Raileanu, Diana
AU - Sacaleanu, Bogdan
PY - 2002/12/4
Y1 - 2002/12/4
N2 - We present a framework for concept-based cross-language information retrieval in the medical domain, which is under development in the MUCHMORE project. Our approach is based on using the Unified Medical Language System (UMLS) as the primary source of semantic data. Documents and queries are annotated with multiple layers of linguistic information. Linguistic processing includes part-of-speech tagging, morphological analysis, phrase recognition and the identification of medical terms and semantic relations between them. The paper describes experiments in monolingual and cross-language document retrieval, performed on a corpus of medical abstracts. Results show that linguistic processing, especially lemmatization and compound analysis for German, is a crucial step in achieving a good baseline performance. On the other hand, they show that semantic information, specifically the combined use of concepts and relations, increases the performance in monolingual and cross-language retrieval.
AB - We present a framework for concept-based cross-language information retrieval in the medical domain, which is under development in the MUCHMORE project. Our approach is based on using the Unified Medical Language System (UMLS) as the primary source of semantic data. Documents and queries are annotated with multiple layers of linguistic information. Linguistic processing includes part-of-speech tagging, morphological analysis, phrase recognition and the identification of medical terms and semantic relations between them. The paper describes experiments in monolingual and cross-language document retrieval, performed on a corpus of medical abstracts. Results show that linguistic processing, especially lemmatization and compound analysis for German, is a crucial step in achieving a good baseline performance. On the other hand, they show that semantic information, specifically the combined use of concepts and relations, increases the performance in monolingual and cross-language retrieval.
KW - Information systems
KW - Linguistics
KW - Natural language processing
KW - Semantics
KW - Unified Medical Language System
UR - https://www.scopus.com/pages/publications/0037021426
U2 - 10.1016/S1386-5056(02)00058-8
DO - 10.1016/S1386-5056(02)00058-8
M3 - Article
C2 - 12460635
AN - SCOPUS:0037021426
SN - 1386-5056
VL - 67
SP - 97
EP - 112
JO - International Journal of Medical Informatics
JF - International Journal of Medical Informatics
IS - 1-3
ER -