The Ground Truth Effect: Investigating SZZ Variants in Just-in-Time Vulnerability Prediction

Cannavale, A.; Iannone, E.; Di Lillo, G.; Palomba, F.; De Lucia, A.

doi:10.1007/978-3-032-04207-1_21

Just-in-Time (JIT) vulnerability prediction is critical for proactively securing software, yet its effectiveness heavily relies on the quality of the ground truth used for training models. This ground truth is commonly established using variants of the SZZ algorithm to identify vulnerability-contributing commits (VCCs). However, the impact of choosing a specific SZZ variant on model performance remains largely unexplored. In this study, we systematically investigate the effect of eight SZZ variants on JIT vulnerability prediction across seven open-source Java projects. Our findings reveal that the choice of the SZZ variant is a non-trivial factor. Models trained with datasets labeled by variants like B-SZZ, V-SZZ, and VCC-SZZ achieve strong and stable predictive performance, with median MCC scores often exceeding 0.50. In contrast, variants such as L-SZZ and R-SZZ produce models that perform no better than random chance, with median MCC scores close to 0.0. This performance gap demonstrates that an inappropriate SZZ variant can invalidate prediction models, underscoring the necessity of a principled approach to defining ground truth.

The Ground Truth Effect: Investigating SZZ Variants in Just-in-Time Vulnerability Prediction

Cannavale A.;Iannone E.;Di Lillo G.;Palomba F.;De Lucia A.

2026

Abstract

Just-in-Time (JIT) vulnerability prediction is critical for proactively securing software, yet its effectiveness heavily relies on the quality of the ground truth used for training models. This ground truth is commonly established using variants of the SZZ algorithm to identify vulnerability-contributing commits (VCCs). However, the impact of choosing a specific SZZ variant on model performance remains largely unexplored. In this study, we systematically investigate the effect of eight SZZ variants on JIT vulnerability prediction across seven open-source Java projects. Our findings reveal that the choice of the SZZ variant is a non-trivial factor. Models trained with datasets labeled by variants like B-SZZ, V-SZZ, and VCC-SZZ achieve strong and stable predictive performance, with median MCC scores often exceeding 0.50. In contrast, variants such as L-SZZ and R-SZZ produce models that perform no better than random chance, with median MCC scores close to 0.0. This performance gap demonstrates that an inappropriate SZZ variant can invalidate prediction models, underscoring the necessity of a principled approach to defining ground truth.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	ISBN
	
				9783032042064
9783032042071
			
	Appare nelle tipologie:
	
				4.1 Contributi in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4943676

Citazioni

ND

0

0

UniSa - IRIS Institutional Research Information System

The Ground Truth Effect: Investigating SZZ Variants in Just-in-Time Vulnerability Prediction

Cannavale A.;Iannone E.;Di Lillo G.;Palomba F.;De Lucia A.

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

UniSa - IRIS Institutional Research Information System

The Ground Truth Effect: Investigating SZZ Variants in Just-in-Time Vulnerability Prediction

Cannavale A.;Iannone E.;Di Lillo G.;Palomba F.;De Lucia A.

2026

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)