In high and ultra-high dimensional domains, the selection of relevant variables is an important task to reduce the computational cost of the data analysis and to make inference. Here, we present a variable selection algorithm based on the variable ranking which is obtained marginally evaluating the estimated coe!cient associated to each covariate in generalized regression models. The algorithm is applied to gene expression microarray data giving evidence of its performance in ultra-high dimensional context.

Variable ranking and data reduction in GLM domain

Francesco Giordano;Marcella Niglio;Marialuisa Restaino
2025

Abstract

In high and ultra-high dimensional domains, the selection of relevant variables is an important task to reduce the computational cost of the data analysis and to make inference. Here, we present a variable selection algorithm based on the variable ranking which is obtained marginally evaluating the estimated coe!cient associated to each covariate in generalized regression models. The algorithm is applied to gene expression microarray data giving evidence of its performance in ultra-high dimensional context.
2025
9788854958494
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4912501
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact