An ensemble of rejecting classiﬁers for anomaly detection of audio events

Conte, Donatello; Foggia, Pasquale; Percannella, Gennaro; Saggese, Alessia; Vento, Mario

doi:10.1109/AVSS.2012.9

Audio analytic systems are receiving an increasing interest in the scientiﬁc community, not only as stand alone systems for the automatic detection of abnormal events by the interpretation of the audio track, but also in conjunction with video analytics tools for enforcing the evidence of anomaly detection. In this paper we present an automatic recognizer of a set of abnormal audio events that works by extracting suitable features from the signals obtained by microphones installed into a surveilled area, and by classifying them using two classiﬁers that operate at different time resolutions. An original aspect of the proposed system is the estimation of the reliability of each response of the individual classiﬁers. In this way, each classiﬁer is able to reject the samples having an overall reliability below a threshold. This approach allows our system to combine only reliable decisions, so increasing the overall performance of the method. The system has been tested on a large dataset of samples acquired from real world scenarios; the audio classes of interests are represented by gunshot, scream and glass breaking in addition to the background sounds. The preliminary results obtained encourage further research in this direction.