To efficiently extract and manage an extremely large quantity of meaningful data, into a delicate sector like the healthcare one, means to deal with the most sophisticated linguistic strategies and computational solutions. In this research we aim to approach the semantic dimension of the medical word's formation elements into a monolingual and bilingual environment.The purpose is to automatically build Italian- English medical lexical resources, by grounding their analysis and generation on the manipulation of the morphemes which they are formed with. This approach has an significant impact on the automatic analysis of neologisms, typical of the medical domain. In detail, we created two electronic dictionaries of morphemes and a morphological Finite State Transducer which together, finding every possible combination of Pre- fixes, Confixes and Suffixes, are able to annotate/translate the terms contained into a medical corpus, according with the meaning of the morphemes that compose these words.
Morphological Analysis and Generation of Monolingual and Bilingual Medical Lexicons
Annibale Elia;MAISTO, ALESSANDRO;PELOSI, SERENA
2015
Abstract
To efficiently extract and manage an extremely large quantity of meaningful data, into a delicate sector like the healthcare one, means to deal with the most sophisticated linguistic strategies and computational solutions. In this research we aim to approach the semantic dimension of the medical word's formation elements into a monolingual and bilingual environment.The purpose is to automatically build Italian- English medical lexical resources, by grounding their analysis and generation on the manipulation of the morphemes which they are formed with. This approach has an significant impact on the automatic analysis of neologisms, typical of the medical domain. In detail, we created two electronic dictionaries of morphemes and a morphological Finite State Transducer which together, finding every possible combination of Pre- fixes, Confixes and Suffixes, are able to annotate/translate the terms contained into a medical corpus, according with the meaning of the morphemes that compose these words.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.