In recent years, the surge in textual data has led to an information overload, particularly evident in the scientific literature domain due to the rapid pace of document publication. Researchers require tools to manage the sheer volume of publications, enabling the rapid extraction of knowledge. Automatic Text Summarization emerges as a key solution, reducing reading time and streamlining efforts by accentuating key points and generating concise summaries with the most pertinent information. Our innovative approach, named Integrated Text Summarization, aims to enhance extractive algorithms by incorporating specific features of scientific documents—such as keywords and title terms—into the sentence selection process. This method focuses on extracting significant sentences from each section of scientific papers, thereby identifying content that encapsulates the core knowledge of a publication. We will assess the effectiveness of our proposal across various research domains using a selection of scientific papers.
Extracting Knowledge from Scientific Literature with an Integrated Text Summarization Approach
Michelangelo Misuraca;
2024-01-01
Abstract
In recent years, the surge in textual data has led to an information overload, particularly evident in the scientific literature domain due to the rapid pace of document publication. Researchers require tools to manage the sheer volume of publications, enabling the rapid extraction of knowledge. Automatic Text Summarization emerges as a key solution, reducing reading time and streamlining efforts by accentuating key points and generating concise summaries with the most pertinent information. Our innovative approach, named Integrated Text Summarization, aims to enhance extractive algorithms by incorporating specific features of scientific documents—such as keywords and title terms—into the sentence selection process. This method focuses on extracting significant sentences from each section of scientific papers, thereby identifying content that encapsulates the core knowledge of a publication. We will assess the effectiveness of our proposal across various research domains using a selection of scientific papers.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.