For organizations using big data, one of the most important element to reach tangible results is exploiting human resources: it is not possible manage da-ta without using them intelligently. Considering the human intervention in relation to big data, means calling into question the so-called “data scien-tist”. Moving from the above, the main aim of this study is using the lin-guistic software environment NooJ to processing a large corpus of job ad-vertisements for data scientist in Italy collected on the business-networking site LinkedIn. Creating specific linguistic resources with NooJ, we are able to identify the most required skills by companies and organiza-tions. Searching the ideal candidate to hire, companies pay attention equally to technical skills and soft skills, in particular, as the capacity to work in team and communicate concerns. Finally, our research confirmed that studying the context in which the single words are inserted represents a key step in the process of information extraction by texts.

The Data Scientist on LinkedIn: job advertisement corpus processing with NooJ.

della Volpe Maddalena;Esposito Francesca
2020-01-01

Abstract

For organizations using big data, one of the most important element to reach tangible results is exploiting human resources: it is not possible manage da-ta without using them intelligently. Considering the human intervention in relation to big data, means calling into question the so-called “data scien-tist”. Moving from the above, the main aim of this study is using the lin-guistic software environment NooJ to processing a large corpus of job ad-vertisements for data scientist in Italy collected on the business-networking site LinkedIn. Creating specific linguistic resources with NooJ, we are able to identify the most required skills by companies and organiza-tions. Searching the ideal candidate to hire, companies pay attention equally to technical skills and soft skills, in particular, as the capacity to work in team and communicate concerns. Finally, our research confirmed that studying the context in which the single words are inserted represents a key step in the process of information extraction by texts.
2020
978-3-030-38833-1
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4733009
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact