Today Lexicon-Grammar (LG) remains one of the most consistent Natural Language Processing (NLP) approaches, especially for Semantic-Based Data Mining (SBDM) and Semantic Web. Its main goal is to describe all mechanisms of word combinations closely related to the concrete use of lexical units and to sentence creation. Also, it gives an exhaustive description of lexical and syntactic structures of several languages. LG was set up by the French linguist Maurice Gross during the ‘60s, and subsequently developed for and applied to Italian by Annibale Elia, Emilio D’Agostino and Maurizio Martinelli. Its theoretical approach is prevalently based on Zelig Sabbettai Harris’ Operator-Argument Grammar, which assumes that each human language is a self-organizing system, and that the syntactic and semantic properties of a given word may be calculated on the basis of the relationships this word has with all other co-occurring words inside given sentence contexts. Simple sentences2 are the minimal linguistic meaning structures upon which LG founds its studies on natural language syntactic features. In the last twenty years, LG has also reached important results in the domain of automatic textual analysis and parsing with NLP-oriented software such as INTEX3, UNITEX4, and more recently NOOJ5. 1 Alberto Postiglione is author of paragraph 4.1. Mario Monteleone is author of paragraphs 3.1 and 4. Federica Marano is author of paragraphs 3.2 and 4.3. Johanna Monti is author of sections 1 and 2. Antonella Napoli is author of paragraph 4.2. 2 In LG, a simple sentence is formed by a unique predicative element (a verb, but also a name or an adjective) plus all the necessary arguments it selects to achieve acceptability and grammaticality. The study of simple sentences is completed analyzing the rules of co-occurrence and selection restriction, which are distributional and transformational rules based on predicate syntactic-semantic properties. 3 For more on INTEX, see http://intex.univ-fcomte.fr/. 4 For more on UNITEX, see http://www-igm.univ-mlv.fr/~unitex/. 5 For more on NooJ, see http://www.nooj4nlp.net/pages/nooj.html. ALBERTO POSTIGLIONE - MARIO MONTELEONE - FEDERICA MARANO - JOHANNA MONTI - ANTONELLA NAPOLI1 Università degli Studi di Salerno ELECTRONIC DICTIONARIES FOR INFORMATION RETRIEVAL, AUTOMATIC TEXTUAL ANALYSIS AND SEMANTIC-BASED DATA MINING SOFTWARE 1. Theoretical and analytical framework: Lexicon-Grammar

Electronic Dictionaries for Information Retrieval, Automatic Textual Analysis and Semantic-Based Data Mining Software

POSTIGLIONE, Alberto;MONTELEONE, Mario;MARANO, FEDERICA;MONTI, JOHANNA;NAPOLI, Antonella
2012

Abstract

Today Lexicon-Grammar (LG) remains one of the most consistent Natural Language Processing (NLP) approaches, especially for Semantic-Based Data Mining (SBDM) and Semantic Web. Its main goal is to describe all mechanisms of word combinations closely related to the concrete use of lexical units and to sentence creation. Also, it gives an exhaustive description of lexical and syntactic structures of several languages. LG was set up by the French linguist Maurice Gross during the ‘60s, and subsequently developed for and applied to Italian by Annibale Elia, Emilio D’Agostino and Maurizio Martinelli. Its theoretical approach is prevalently based on Zelig Sabbettai Harris’ Operator-Argument Grammar, which assumes that each human language is a self-organizing system, and that the syntactic and semantic properties of a given word may be calculated on the basis of the relationships this word has with all other co-occurring words inside given sentence contexts. Simple sentences2 are the minimal linguistic meaning structures upon which LG founds its studies on natural language syntactic features. In the last twenty years, LG has also reached important results in the domain of automatic textual analysis and parsing with NLP-oriented software such as INTEX3, UNITEX4, and more recently NOOJ5. 1 Alberto Postiglione is author of paragraph 4.1. Mario Monteleone is author of paragraphs 3.1 and 4. Federica Marano is author of paragraphs 3.2 and 4.3. Johanna Monti is author of sections 1 and 2. Antonella Napoli is author of paragraph 4.2. 2 In LG, a simple sentence is formed by a unique predicative element (a verb, but also a name or an adjective) plus all the necessary arguments it selects to achieve acceptability and grammaticality. The study of simple sentences is completed analyzing the rules of co-occurrence and selection restriction, which are distributional and transformational rules based on predicate syntactic-semantic properties. 3 For more on INTEX, see http://intex.univ-fcomte.fr/. 4 For more on UNITEX, see http://www-igm.univ-mlv.fr/~unitex/. 5 For more on NooJ, see http://www.nooj4nlp.net/pages/nooj.html. ALBERTO POSTIGLIONE - MARIO MONTELEONE - FEDERICA MARANO - JOHANNA MONTI - ANTONELLA NAPOLI1 Università degli Studi di Salerno ELECTRONIC DICTIONARIES FOR INFORMATION RETRIEVAL, AUTOMATIC TEXTUAL ANALYSIS AND SEMANTIC-BASED DATA MINING SOFTWARE 1. Theoretical and analytical framework: Lexicon-Grammar
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11386/3548677
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact