Real-world datasets, such as genomic data, are noisy and high-dimensional, and are therefore difficult to analyse without a preliminary step aimed to reduce data dimensionality and to select relevant features. Projection techniques are a useful tool to pre-process high dimensional datd since they allow to achieve a simpler representation of the original data that still preserves intrinsic information. In this work, we assess the effectiveness of these methods when applied to two common tasks in Bioinformatics: patient classification and gene clustering. We compared the performance of different learning models in the original space and in several projected spaces obtained with different techniques, both in a supervised and in an unsupervised setting. Our results show that projection techniques can lead to a significant improvement in the learning ability of models.

Effectiveness of projection techniques in genomic data analysis

GALDI, PAOLA;SERRA, ANGELA;TAGLIAFERRI, Roberto
2016-01-01

Abstract

Real-world datasets, such as genomic data, are noisy and high-dimensional, and are therefore difficult to analyse without a preliminary step aimed to reduce data dimensionality and to select relevant features. Projection techniques are a useful tool to pre-process high dimensional datd since they allow to achieve a simpler representation of the original data that still preserves intrinsic information. In this work, we assess the effectiveness of these methods when applied to two common tasks in Bioinformatics: patient classification and gene clustering. We compared the performance of different learning models in the original space and in several projected spaces obtained with different techniques, both in a supervised and in an unsupervised setting. Our results show that projection techniques can lead to a significant improvement in the learning ability of models.
978-1-5090-1131-5
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4678107
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact