Text mining has become central to bibliometrics, providing quantitative insight into the semantic structure of scientific communication. This review surveys current methodological approaches to text-based science mapping, including geometric embeddings, probabilistic models, network techniques, and neural embedding methods. The discussion examines how these approaches operate across different representations of text and evaluates their interpretability, stability, and statistical assumptions. Key issues include data quality, model validation, reproducibility, and the growing influence of large language models. Persistent challenges - language bias, topic instability, limited full-text access, and model opacity - raise open questions about dynamic, multimodal, and ethically grounded science mapping.
Text Mining in Bibliometrics and Science Mapping: A Methodological Review
Michelangelo Misuraca
In corso di stampa
Abstract
Text mining has become central to bibliometrics, providing quantitative insight into the semantic structure of scientific communication. This review surveys current methodological approaches to text-based science mapping, including geometric embeddings, probabilistic models, network techniques, and neural embedding methods. The discussion examines how these approaches operate across different representations of text and evaluates their interpretability, stability, and statistical assumptions. Key issues include data quality, model validation, reproducibility, and the growing influence of large language models. Persistent challenges - language bias, topic instability, limited full-text access, and model opacity - raise open questions about dynamic, multimodal, and ethically grounded science mapping.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


