Heart sound classification is a critical task in automated cardiac diagnostics, yet it is often challenged by the limited availability of labeled data and the dominance of low-frequency components in heart sound signals. This study introduces a novel data augmentation technique, the first-difference method, to address these challenges in convolutional neural network (CNN)- based classification. By enhancing high-frequency components in the time domain, this method enables the model to better capture abnormalities, such as murmurs, present in higher frequency ranges. The effectiveness of this approach was evaluated using three spectral transformations—linear spectrogram, mel-spectrogram, and mel-frequency cepstrum coefficient (MFCC) —across multiple augmentation levels. The results demonstrate substantial improvements in classification metrics, including precision, recall, F1 score, and specificity, with MFCC-based predictors achieving the highest performance gains. The findings highlight the potential of the first-difference augmentation as a simple and effective strategy for improving heart sound classification, paving the way for more robust and generalizable diagnostic tools in real-world clinical applications.

Signal First-Difference as Augmentation Method for CNN-Based Heart Sound Classification

Gallo V.;Laino V.;Carratu' M.;
2025

Abstract

Heart sound classification is a critical task in automated cardiac diagnostics, yet it is often challenged by the limited availability of labeled data and the dominance of low-frequency components in heart sound signals. This study introduces a novel data augmentation technique, the first-difference method, to address these challenges in convolutional neural network (CNN)- based classification. By enhancing high-frequency components in the time domain, this method enables the model to better capture abnormalities, such as murmurs, present in higher frequency ranges. The effectiveness of this approach was evaluated using three spectral transformations—linear spectrogram, mel-spectrogram, and mel-frequency cepstrum coefficient (MFCC) —across multiple augmentation levels. The results demonstrate substantial improvements in classification metrics, including precision, recall, F1 score, and specificity, with MFCC-based predictors achieving the highest performance gains. The findings highlight the potential of the first-difference augmentation as a simple and effective strategy for improving heart sound classification, paving the way for more robust and generalizable diagnostic tools in real-world clinical applications.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4916435
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact