Ultra-Low-Resolution (ULR) Time-of-Flight (ToF) sensors are utilized in combination with higher-resolution image sensors in several industrial applications, such as object recognition and accurate auto-focus. However, the heterogeneous combination of these sensors and the complex post-processing systems, makes it challenging to meet the requirements of compactness, lightness, and aesthetics in edge devices used in industrial and consumer applications, such as smart visors and glasses. This work investigates the feasibility of using ULR ToF as the only sensing element in object recognition systems, combined with machine learning techniques. The outcome of this investigation resulted in a very compact system based on the STMicroelectronics VL53L8CX 8 × 8 pixel ToF sensor, coupled with a custom Convolutional Neural Network (CNN) capable of classifying four objects even under partial occlusion conditions. The designed solution overcomes the limitations mentioned above, achieving an accuracy of 92.23% when activations and weights are quantized to 8 bits. The CNN has been profiled and deployed on STM32 microcontrollers, occupying 2.65 kB of memory, performing one inference in 1 ms, and consuming 32 μ J of energy per inference. These results are a significant improvement over the state-of-the-art in the literature in terms of inference time, memory occupation, and energy consumption per inference.

Object Classification using Ultra Low Resolution Time-of-Flight Sensor and Tiny Convolutional Neural Network

Fasolino A.;Vitolo P.;Liguori R.;Di Benedetto L.;Rubino A.;Licciardo G. D.;
2024

Abstract

Ultra-Low-Resolution (ULR) Time-of-Flight (ToF) sensors are utilized in combination with higher-resolution image sensors in several industrial applications, such as object recognition and accurate auto-focus. However, the heterogeneous combination of these sensors and the complex post-processing systems, makes it challenging to meet the requirements of compactness, lightness, and aesthetics in edge devices used in industrial and consumer applications, such as smart visors and glasses. This work investigates the feasibility of using ULR ToF as the only sensing element in object recognition systems, combined with machine learning techniques. The outcome of this investigation resulted in a very compact system based on the STMicroelectronics VL53L8CX 8 × 8 pixel ToF sensor, coupled with a custom Convolutional Neural Network (CNN) capable of classifying four objects even under partial occlusion conditions. The designed solution overcomes the limitations mentioned above, achieving an accuracy of 92.23% when activations and weights are quantized to 8 bits. The CNN has been profiled and deployed on STM32 microcontrollers, occupying 2.65 kB of memory, performing one inference in 1 ms, and consuming 32 μ J of energy per inference. These results are a significant improvement over the state-of-the-art in the literature in terms of inference time, memory occupation, and energy consumption per inference.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11386/4882572
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact