Multitask Noisy Speech Enhancement System www.denoise.net |
||||
Overview Example Restorer - Speech band equalizer - Dynamics processing - Noise gate - Signal level limiter - Clipping restoration - Noise reduction - Noise whitening - Blind deconvolution - Spectrum analyser - Time stretching - Spectral expander - Fourier corrector - Neural network corrector - Decorrelation - Joint approximation - Homomorphic approximation - Reverberation Recorder Browser - Synchronisation - Normalisation Contact info |
Noise reductionThe background noise is the most common factor degrading the quality and intelligibility of speech in recordings. The noise reduction module intends to lower the noise level without affecting the speech signal quality. This module is based on the spectral subtraction performed independently in the frequency bands corresponding to the auditory critical bands. The spectral subtraction method is a simple and effective method of noise reduction. In this method, an average signal spectrum and average noise spectrum are estimated in parts of the recording and subtracted from each other, so that average signal-to-noise ratio (SNR) is improved. It is assumed that the signal is distorted by a wide-band, stationary, additive noise, the noise estimate is the same during the analysis and the restoration and the phase is the same in the original and restored signal. The noisy signal y(m) is a sum of the desired signal x(m) and the noise n(m): y(m) = x(m) + n(m) In the frequency domain, this may be denoted as: Y(jω) = X(jω) + N(jω) => X(jω) = Y(jω) - N(jω) where Y(jω), X(jω), N(jω) are Fourier transforms of y(m), x(m), n(m), respectively. The statistic parameters of the noise are not known, thus the noise and the speech signal are replaced by their estimates:
The noise spectrum estimate
where
where The spectral subtraction error may be defined as:
This error degrades the signal quality, introducing the distortion known as residual noise or musical noise. The error is a function of expected
Therefore, the longer noise section is used in analysis, the more accurate the noise estimate is. The signal-to-noise ratio may be defined in frequency domain as SNR a priori (for clean signal) or SNR a posteriori (for noisy signal). SNR in k-th frame is given by:
During the restoration process, the clean signal is not known, hence the SNR a priori value has to be estimated. Using the Gaussian model, optimal SNR in k-th frame may be defined as:
where P(x) is:
Calculation of the averaged noise estimate Spectral subtraction method Before the restoration process is started, a part of the recording containing only noise (not shorter than 1 second) has to be selected in the main window of the Restorer application. The window of the noise reduction module enables user to set reduction coefficient γ in each of the 11 frequency bands and the smoothing coefficient λ. It is recommended to start the processing by setting all reduction coefficients to the middle value. During the restoration, the reduction coefficients should be reduced in bands where the speech signal becomes distorted and increased in the bands where noise level is too high.
| |||
© 2004 Multimedia Systems Department, Gdansk University of Technology and Air Force Academy in Deblin |