TW201614641A

TW201614641A - Method and apparatus for speech enhancement based on source separation

Info

Publication number: TW201614641A
Application number: TW104128192A
Authority: TW
Inventors: Dalia Elbadawy; Alexey Ozerov; Quang Khanh Ngoc Duong
Original assignee: Thomson Licensing
Priority date: 2014-09-30
Filing date: 2015-08-27
Publication date: 2016-04-16
Also published as: WO2016050725A1

Abstract

The present embodiments provide speech enhancement based on source separation techniques. Specifically, we use a universal spectral model for speech, and train the spectral model for noise and activations for speech/noise based on the universal spectral model for speech and input noisy speech. We formulate the optimization problem using a cost function that includes a divergence function and a sparsity penalty function, wherein the penalty function is based on the notion of relative group sparsity. The sparsity penalty function includes two parts: a sparsity-promoting part for the groups (activations for some groups become zero) and an anti-sparsity-promoting part for the whole activation matrix corresponding to the speech model (i.e., the activations for speech as a whole does not become zero). Based on the universal spectral model for speech, the spectral model for noise, and activations for speech/noise, we can estimate the speech/noise included in the input noisy speech.