In the biological field, having a visual and interactive representation of data is useful, particularly when there is a need to investigate a large amount of multilevel data. It is advantageous to communicate this knowledge intuitively because it helps the users to perceive the dynamic structure in which the correct connections are present and can be extrapolated. In this work, we propose a human-interaction system to view similarity data based on the functions of the Gene Ontology (Cellular Component, Molecular Function, and Biological Process) of the proteins/genes for Alzheimer disease and Parkinson disease. The similarity data was built with the Lin and Wang measures for all three areas of Gene Ontology. We clustered data with the K-means algorithm in order to demonstrate how information derived from data can only be partial when using traditional display methods. Then, we have suggested a dynamic and interactive view based on SigmaJS with the aim of allowing customization in the interactive mode of the analysis workflow by users. To this aim, we have developed a first prototype to obtained a more immediate visualization to capture the most relevant information within the three vocabularies of Gene Ontology. This facilitates the creation of an omic view and the ability to perform a multilevel analysis with more details which is much more valuable for the understanding of knowledge by the end users.

PADD: Dynamic Distance-Graph based on Similarity Measures for GO Terms Visualization of Alzheimer and Parkinson diseases

Alessia Auriemma Citarella
;
Fabiola De Marco;Luigi Di Biasi;Michele Risi;Genoveffa Tortora
2021-01-01

Abstract

In the biological field, having a visual and interactive representation of data is useful, particularly when there is a need to investigate a large amount of multilevel data. It is advantageous to communicate this knowledge intuitively because it helps the users to perceive the dynamic structure in which the correct connections are present and can be extrapolated. In this work, we propose a human-interaction system to view similarity data based on the functions of the Gene Ontology (Cellular Component, Molecular Function, and Biological Process) of the proteins/genes for Alzheimer disease and Parkinson disease. The similarity data was built with the Lin and Wang measures for all three areas of Gene Ontology. We clustered data with the K-means algorithm in order to demonstrate how information derived from data can only be partial when using traditional display methods. Then, we have suggested a dynamic and interactive view based on SigmaJS with the aim of allowing customization in the interactive mode of the analysis workflow by users. To this aim, we have developed a first prototype to obtained a more immediate visualization to capture the most relevant information within the three vocabularies of Gene Ontology. This facilitates the creation of an omic view and the ability to perform a multilevel analysis with more details which is much more valuable for the understanding of knowledge by the end users.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4799272
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact