The objective of this paper is to investigate the use of Support Vector Regression (SVR) for Web development effort estimation when using a crosscompany data set. Four kernels of SVR were used, linear, polynomial, Gaussian and sigmoid and two preprocessing strategies of the variables were applied, namely normalization and logarithmic. The hold-out validation process was carried out for all the eight configurations using a training set and a validation set from the Tukutuku data set. Our results suggest that the predictions obtained with linear kernel applying a logarithmic transformation of variables (LinLog) are significantly better than those obtained with the other configurations. In addition, SVR has been compared with the traditional estimation techniques, such as Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Our results suggest that SVR with LinLog configuration can provide significantly superior prediction accuracy than other techniques. © Springer-Verlag Berlin Heidelberg 2009.
Using Support Vector Regression for web development effort estimation
FERRUCCI, Filomena;GRAVINO, Carmine;
2009-01-01
Abstract
The objective of this paper is to investigate the use of Support Vector Regression (SVR) for Web development effort estimation when using a crosscompany data set. Four kernels of SVR were used, linear, polynomial, Gaussian and sigmoid and two preprocessing strategies of the variables were applied, namely normalization and logarithmic. The hold-out validation process was carried out for all the eight configurations using a training set and a validation set from the Tukutuku data set. Our results suggest that the predictions obtained with linear kernel applying a logarithmic transformation of variables (LinLog) are significantly better than those obtained with the other configurations. In addition, SVR has been compared with the traditional estimation techniques, such as Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Our results suggest that SVR with LinLog configuration can provide significantly superior prediction accuracy than other techniques. © Springer-Verlag Berlin Heidelberg 2009.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.