Automatic term extraction aims at extracting terminological units from specialized corpora in order to assist terminolographers to develop lexicographic resources. In this paper, we introduce Domain Concept Relatedness, a novel term extraction technique meant to isolate the terminology of a given subject field. In order to evaluate our technique, we apply it to the extraction of waste management terms from a new Italian corpus about waste management legislation. We test it against Sketch Engine and the contrastive approach showing that our technique effectively extracts multi-word terms belonging to a given subject field but still fails to extract single-word terms.
Isolating Terminology Layers in Complex Linguistic Environments: a Study About Waste Management (Short Paper)
Cirillo N.
2023-01-01
Abstract
Automatic term extraction aims at extracting terminological units from specialized corpora in order to assist terminolographers to develop lexicographic resources. In this paper, we introduce Domain Concept Relatedness, a novel term extraction technique meant to isolate the terminology of a given subject field. In order to evaluate our technique, we apply it to the extraction of waste management terms from a new Italian corpus about waste management legislation. We test it against Sketch Engine and the contrastive approach showing that our technique effectively extracts multi-word terms belonging to a given subject field but still fails to extract single-word terms.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.