Motivation In the era of big data, the richness and variety of available datasets have opened new horizons for investigators in the biomedical field. The ultimate challenge consists in building an integrated base of knowledge derived from heterogeneous sources. Multiview learning is the branch of machine learning concerned with the analysis of multimodal data, that is, patterns represented by different sets of features extracted from multiple data sources. In recent years, multiview learning methodologies have become increasingly popular, and a high number of biomedical applications based on multiview data have been recorded in literature. For example, in bioinformatics, analyses can be based on multiple experiments investigating different facets of the same phenomena, such as gene expression, microRNA expression, protein-protein interactions, genome-wide association, and so on, in order to capture information regarding different aspects of biological systems. In the same way, neuroscience data analysis can benefit from different imaging modalities that allow to study different features of the nervous system (e.g., structural vs. functional organization). Compared to the limited perspective offered by single-view analyses, the integration of multiple views can provide a deeper understanding of the underlying principles governing complex systems. Results In this work, we review the existing multiview methodologies to discuss their operation modes and principles, with the goal of increasing their further development in the biomedical field. We organize the described methods in three categories, according to the type of data, the statistical problem, and the type of integration. This discussion, which highlights the advantages and disadvantages of different schools of thought, is intended to be a reference for those who want to start working with the integration of biomedical data. We selected a number of representative examples in bioinformatics and neuroinformatics to show the potential of multiview learning applications to cutting-edge research problems. First, we explain how multiview clustering can be used to perform patient subtyping in order to identify groups of patients who share similar molecular characteristics and possibly similar reactions to treatment. Then, the drug-repositioning problem is introduced and a discussion of the multiview classification methods used in literature is provided. We then describe an example of how both clustering and classification can be combined in a multiview setting for the automated diagnosis of neurodegenerative disorders, and we explain how multiple noninvasive imaging modalities can be exploited together to obtain more accurate brain parcellations. Finally, we discuss how deep learning techniques that are getting more and more recognition in various fields can be applied to multimodal data to learn complex representations, and we present few examples of this application.

Multiview learning in biomedical applications

Serra A.;Galdi P.;Tagliaferri R.
2018-01-01

Abstract

Motivation In the era of big data, the richness and variety of available datasets have opened new horizons for investigators in the biomedical field. The ultimate challenge consists in building an integrated base of knowledge derived from heterogeneous sources. Multiview learning is the branch of machine learning concerned with the analysis of multimodal data, that is, patterns represented by different sets of features extracted from multiple data sources. In recent years, multiview learning methodologies have become increasingly popular, and a high number of biomedical applications based on multiview data have been recorded in literature. For example, in bioinformatics, analyses can be based on multiple experiments investigating different facets of the same phenomena, such as gene expression, microRNA expression, protein-protein interactions, genome-wide association, and so on, in order to capture information regarding different aspects of biological systems. In the same way, neuroscience data analysis can benefit from different imaging modalities that allow to study different features of the nervous system (e.g., structural vs. functional organization). Compared to the limited perspective offered by single-view analyses, the integration of multiple views can provide a deeper understanding of the underlying principles governing complex systems. Results In this work, we review the existing multiview methodologies to discuss their operation modes and principles, with the goal of increasing their further development in the biomedical field. We organize the described methods in three categories, according to the type of data, the statistical problem, and the type of integration. This discussion, which highlights the advantages and disadvantages of different schools of thought, is intended to be a reference for those who want to start working with the integration of biomedical data. We selected a number of representative examples in bioinformatics and neuroinformatics to show the potential of multiview learning applications to cutting-edge research problems. First, we explain how multiview clustering can be used to perform patient subtyping in order to identify groups of patients who share similar molecular characteristics and possibly similar reactions to treatment. Then, the drug-repositioning problem is introduced and a discussion of the multiview classification methods used in literature is provided. We then describe an example of how both clustering and classification can be combined in a multiview setting for the automated diagnosis of neurodegenerative disorders, and we explain how multiple noninvasive imaging modalities can be exploited together to obtain more accurate brain parcellations. Finally, we discuss how deep learning techniques that are getting more and more recognition in various fields can be applied to multimodal data to learn complex representations, and we present few examples of this application.
2018
9780128154809
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4764326
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 10
social impact