In this paper we propose a novel method for the detection of audio events for surveillance applications. The method is based on the bag of words approach, adapted to deal with the specific issues of audio surveillance: the need to recognize both short and long sounds, the presence of a significant noise level and of superimposed background sounds of intensity comparable to the audio events to be detected. In order to test the proposed method in complex, realistic scenarios, we have built a large, publicly available dataset of audio events. The dataset has allowed us to evaluate the robustness of our method with respect to varying levels of the Signal-to-Noise Ratio; the experimentation has confirmed its applicability in real world conditions, and has shown a significant performance improvement with respect to other methods from the literature.
|Titolo:||Reliable Detection of Audio Events in Highly Noisy Environments|
|Data di pubblicazione:||2015|
|Appare nelle tipologie:||1.1.1 Articolo su rivista con DOI|