Abstract
A sound signal produces a unique texture which can be visualized using a spectrogram image and analyzed for automatic sound recognition. In this paper, we explore the use of a well-known image texture analysis technique called the gray-level co-occurrence matrix (GLCM) for sound recognition in an audio surveillance application. The GLCM captures the distribution of co-occurring values at a given offset. Unlike most other similar research which derive features from the GLCM, we use the matrix values itself to form the feature vector with analysis carried out in subbands. When compared to a baseline feature from related work, the proposed spectrogram image texture feature (SITF) gives marginally lower results under clean and high signal-to-noise ratio (SNR) conditions but significantly better results are achieved at low SNR, where the baseline feature was seen to be less effective.
Original language | English |
---|---|
Title of host publication | 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 1956-1960 |
Number of pages | 5 |
Volume | 2015-August |
ISBN (Electronic) | 9781467369978 |
DOIs | |
Publication status | Published - 1 Jan 2015 |
Externally published | Yes |
Event | 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Brisbane, Australia Duration: 19 Apr 2014 → 24 Apr 2014 |
Conference
Conference | 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 |
---|---|
Country/Territory | Australia |
City | Brisbane |
Period | 19/04/14 → 24/04/14 |
Keywords
- Audio surveillance
- gray-level cooccurrence matrix
- sound recognition
- spectrogram image texture feature
- support vector machine