In this work we present a new framework for the analysis of Italian texts that could help linguists to perform rapid text analysis. The framework, that performs both statistical and rule-based analysis, is called LG-Starship. The idea is to built a modular software that includes the basic algorithms to perform different kinds of analysis. The framework will include a Preprocessing Module a POS Tagging and Lemmatization module, a Statistic Module, a Semantic Module based on Distributional Analysis algorithms, and a Syntactic Module, which analyses syntax structures of a selected sentence and tags the verbs and its arguments with semantic labels. The objective of the Framework is to build an “all-in-one” platform for NLP which allows any kind of users to perform basic and advanced text analysis.
Textual Statistics and Named Entity Recognition Applied to Game of Thrones Novels
Annibale Elia
Supervision
;Alessandro Maisto
Methodology
;Giandomenico Martorelli
Validation
;Serena Pelosi
Resources
;Pierluigi Vitale
Data Curation
2020
Abstract
In this work we present a new framework for the analysis of Italian texts that could help linguists to perform rapid text analysis. The framework, that performs both statistical and rule-based analysis, is called LG-Starship. The idea is to built a modular software that includes the basic algorithms to perform different kinds of analysis. The framework will include a Preprocessing Module a POS Tagging and Lemmatization module, a Statistic Module, a Semantic Module based on Distributional Analysis algorithms, and a Syntactic Module, which analyses syntax structures of a selected sentence and tags the verbs and its arguments with semantic labels. The objective of the Framework is to build an “all-in-one” platform for NLP which allows any kind of users to perform basic and advanced text analysis.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.