In this paper we present a vision based method for counting the number of persons which cross a virtual line. The method analyzes the video stream acquired by a camera mounted in a zenithal position with respect to the counting line, allowing to determine the number of persons that cross the virtual line and providing the crossing direction for each person. The proposed approach has been specifically designed to achieve high accuracy and computational efficiency, so as to allow its adoption in real scenarios. An extensive evaluation of the method has been carried out taking into account the main factors that may impact on the counting performance and, in particular, the acquisition technology (traditional RGB camera and depth sensor), the installation scenario (indoor and outdoor), the density of the people flow (isolated people and groups of persons), the acquisition frame rate, and the image resolution. We have also analyzed the combination of the outputs obtained from the RGB and depth sensors as a way to improve the counting performance. The experimental results confirm the effectiveness of the proposed method, especially when combining RGB and depth information, and the tests over three different CPU architectures demonstrate the possibility of deploying the method both on high-end servers for processing in parallel a large number of video streams and on low power CPUs as those embedded on commercial smart cameras.

Counting people by RGB or depth overhead cameras

DEL PIZZO, LUCA;FOGGIA, PASQUALE;GRECO, ANTONIO;PERCANNELLA, Gennaro;VENTO, Mario
2016

Abstract

In this paper we present a vision based method for counting the number of persons which cross a virtual line. The method analyzes the video stream acquired by a camera mounted in a zenithal position with respect to the counting line, allowing to determine the number of persons that cross the virtual line and providing the crossing direction for each person. The proposed approach has been specifically designed to achieve high accuracy and computational efficiency, so as to allow its adoption in real scenarios. An extensive evaluation of the method has been carried out taking into account the main factors that may impact on the counting performance and, in particular, the acquisition technology (traditional RGB camera and depth sensor), the installation scenario (indoor and outdoor), the density of the people flow (isolated people and groups of persons), the acquisition frame rate, and the image resolution. We have also analyzed the combination of the outputs obtained from the RGB and depth sensors as a way to improve the counting performance. The experimental results confirm the effectiveness of the proposed method, especially when combining RGB and depth information, and the tests over three different CPU architectures demonstrate the possibility of deploying the method both on high-end servers for processing in parallel a large number of video streams and on low power CPUs as those embedded on commercial smart cameras.
File in questo prodotto:
File Dimensione Formato  
manuscript.pdf

embargo fino al 01/01/2019

Descrizione: Articolo principale
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Creative commons
Dimensione 1.7 MB
Formato Adobe PDF
1.7 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11386/4682395
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 57
  • ???jsp.display-item.citation.isi??? 43
social impact