Redução de ruído em sinais de voz usando curvas especializadas de modificação dos coeficientes da transformada em co-seno. / Speech denoising by softsoft thresholding.
AUTOR(ES)
Irineu Antunes Júnior
DATA DE PUBLICAÇÃO
2006
RESUMO
Many noise-reduction methods are based on the possibility of representing the clean signal as a reduced number of coefficients of a block transform, so that cancelling coefficients below a certain thresholding level will produce an enhanced reconstructed signal. It is necessary to assume that the clean signal has a sparse representation, while the noise energy is spread over all coefficients. The main drawback of those methods is the speech distortion introduced by eliminating small magnitude coefficients, and the presence of artifacts (musical noise) produced by isolated noisy coefficients randomly crossing the thresholding level. Based on the observation that the speech coefficient histogram has many important coefficients close to origin, we propose a custom thresholding function to perform noise reduction in speech signals corrupted by AWGN. This function, called SoftSoft, has two thresholding levels: a lower level adjusted to reduce speech distortion, and a higher level adjusted to remove noise. The joint optimal values can be determined by minimizing the resulting mean square error (MSE). We also verify that this new thresholding function leads to a lower MSE than the well-known Soft and Hard-thresholding functions, which employ only a higher thresholding level. Although the improvement in terms of MSE is not expressive, a perceptual distortion measure (the log-spectral distance, LSD) is employed to prove the higher performance of the proposed thresholding scheme.
ASSUNTO(S)
speech denoising processamento digital de voz estimação não-paramétrica digital speech processing non-parametric speech signal estimation redução de ruído em sinal de voz
Documentos Relacionados
- Redução de ruido em sinais de voz nos sistemas radio moveis veiculares
- RECONOCIMIENTO DE VOZ EN PRESCENCIA DE RUIDO
- Redução adaptativa de eco e de ruído para terminais viva-voz.
- Melhoria da qualidade de sinais de fala degradados por ruído através da utilização de sinais sintetizados.
- Análise de granulometria baseada na energia dos coeficientes da Transformada Wavelet