TY - JOUR
T1 - A computational auditory masking model based on signal-dependent compression. I. Model description and performance analysis
AU - Buchholz, Jörg M.
AU - Mourjopoulos, John
PY - 2004/9
Y1 - 2004/9
N2 - In the present study, a novel computational model is proposed, which can efficiently emulate many aspects of the monaural signal processing of the auditory system. The model employs the concept of Signal-Dependent Compression (SDC), according to which the auditory system is assumed to perform a compression of the input signal's dynamics, controlled by an operating-point signal, which is directly derived from the input signal itself. This model is characterized by a simple, analytically defined, and flexible general structure, which employs a small number of free parameters and allows for a computationally very efficient realization. The model seems to be useful in describing various aspects of psychoacoustical masking (simultaneous and forward masking) and shows significant physiological relevance (e.g., auditory adaptation, peripheral compression at critical frequency). Considering possible audio and speech processing applications, it is shown that the SDC approach can be regarded as an extension of widely used dynamic range control systems and in addition, produces a modulation transfer function similar to the modulation spectrum of speech. In an accompanying paper [1] the model's performance in describing psychoacoustical masking data is investigated by comparing its output to results obtained from established psychoacoustical masking studies.
AB - In the present study, a novel computational model is proposed, which can efficiently emulate many aspects of the monaural signal processing of the auditory system. The model employs the concept of Signal-Dependent Compression (SDC), according to which the auditory system is assumed to perform a compression of the input signal's dynamics, controlled by an operating-point signal, which is directly derived from the input signal itself. This model is characterized by a simple, analytically defined, and flexible general structure, which employs a small number of free parameters and allows for a computationally very efficient realization. The model seems to be useful in describing various aspects of psychoacoustical masking (simultaneous and forward masking) and shows significant physiological relevance (e.g., auditory adaptation, peripheral compression at critical frequency). Considering possible audio and speech processing applications, it is shown that the SDC approach can be regarded as an extension of widely used dynamic range control systems and in addition, produces a modulation transfer function similar to the modulation spectrum of speech. In an accompanying paper [1] the model's performance in describing psychoacoustical masking data is investigated by comparing its output to results obtained from established psychoacoustical masking studies.
UR - http://www.scopus.com/inward/record.url?scp=6344221722&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:6344221722
VL - 90
SP - 873
EP - 886
JO - Acta Acustica united with Acustica
JF - Acta Acustica united with Acustica
SN - 1610-1928
IS - 5
ER -