Abstract
In this work, we use the subband intensity histogram values extracted from the spectrogram image of sound signals to form the feature vector for sound classification in an audio surveillance application. We propose two features based on this approach. Firstly, we extract the histogram features from the short time Fourier transform spectrogram image of sound signals, which we refer as the spectral histogram feature (SHF). Secondly, we apply the mel-filter to the spectrogram image before histogram feature extraction which we refer as the mel-spectral histogram feature (MSHF). When compared to baseline features from similar work, the SHF was shown to give significantly improved results in low SNR conditions with a higher overall classification performance. In addition, the MSHF produced even better results than the SHF with the added advantage of a lower feature dimension.
Original language | English |
---|---|
Title of host publication | 2015 IEEE International Conference on Digital Signal Processing, DSP 2015 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 432-435 |
Number of pages | 4 |
Volume | 2015-September |
ISBN (Electronic) | 9781479980581 |
DOIs | |
Publication status | Published - 9 Sept 2015 |
Externally published | Yes |
Event | IEEE International Conference on Digital Signal Processing, DSP 2015 - Singapore, Singapore Duration: 21 Jul 2015 → 24 Jul 2015 |
Conference
Conference | IEEE International Conference on Digital Signal Processing, DSP 2015 |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 21/07/15 → 24/07/15 |
Keywords
- audio surveillance
- mel-filter
- sound recognition
- spectral histogram feature
- support vector machine