Microalgae play a crucial role in various sectors, such as biofuel production or environmental monitoring. The ability to accurately classify and analyze microalgae species from optical images is vital for advancing research and applications. Generally, the data regarding the population of microalgae constitute a valuable input for machine-learning algorithms whose aim is to classify real data derived from optical images of microalgae cells. However, obtaining a diverse dataset of microalgae populations to train machine-learning models can be challenging and resource-intensive. This paper presents a machine-learning algorithm based on a dataset of algal records generated by a simulation model. The simulation model uses a combination of mathematical models, probabilistic distributions, and biological knowledge to create realistic data on microalgae populations. The dataset generated from the developed model serves as a resource for training and validation of the subsequent machine-learning model proposed for the microalgae classification task. The machine learning model, trained on this synthetic data, can subsequently be applied to efficiently classify and analyze optical images of real microalgae populations, leading to improved precision and reliability in the identification of species useful to produce biofuels.

Advanced Simulation Model for Studying Biofuel-producing Microalgae Populations

Miccio Michele
Writing – Review & Editing
;
2024

Abstract

Microalgae play a crucial role in various sectors, such as biofuel production or environmental monitoring. The ability to accurately classify and analyze microalgae species from optical images is vital for advancing research and applications. Generally, the data regarding the population of microalgae constitute a valuable input for machine-learning algorithms whose aim is to classify real data derived from optical images of microalgae cells. However, obtaining a diverse dataset of microalgae populations to train machine-learning models can be challenging and resource-intensive. This paper presents a machine-learning algorithm based on a dataset of algal records generated by a simulation model. The simulation model uses a combination of mathematical models, probabilistic distributions, and biological knowledge to create realistic data on microalgae populations. The dataset generated from the developed model serves as a resource for training and validation of the subsequent machine-learning model proposed for the microalgae classification task. The machine learning model, trained on this synthetic data, can subsequently be applied to efficiently classify and analyze optical images of real microalgae populations, leading to improved precision and reliability in the identification of species useful to produce biofuels.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4909280
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact