EP2012725A2 - Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur console - Google Patents

Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur console

Info

Publication number
EP2012725A2
EP2012725A2 EP07759884A EP07759884A EP2012725A2 EP 2012725 A2 EP2012725 A2 EP 2012725A2 EP 07759884 A EP07759884 A EP 07759884A EP 07759884 A EP07759884 A EP 07759884A EP 2012725 A2 EP2012725 A2 EP 2012725A2
Authority
EP
European Patent Office
Prior art keywords
signal
narrow band
console
noise
instructions
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP07759884A
Other languages
German (de)
English (en)
Other versions
EP2012725A4 (fr
Inventor
Xiadong Mao
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Interactive Entertainment Inc
Original Assignee
Sony Computer Entertainment Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/381,728 external-priority patent/US7545926B2/en
Priority claimed from US11/381,725 external-priority patent/US7783061B2/en
Priority claimed from US11/429,047 external-priority patent/US8233642B2/en
Priority claimed from PCT/US2006/017483 external-priority patent/WO2006121896A2/fr
Priority claimed from US11/381,727 external-priority patent/US7697700B2/en
Priority claimed from US11/381,729 external-priority patent/US7809145B2/en
Priority claimed from US11/381,721 external-priority patent/US8947347B2/en
Priority claimed from US11/418,988 external-priority patent/US8160269B2/en
Priority claimed from US11/381,724 external-priority patent/US8073157B2/en
Priority claimed from US11/418,989 external-priority patent/US8139793B2/en
Application filed by Sony Computer Entertainment Inc filed Critical Sony Computer Entertainment Inc
Publication of EP2012725A2 publication Critical patent/EP2012725A2/fr
Publication of EP2012725A4 publication Critical patent/EP2012725A4/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • H04M9/082Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02163Only one microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Definitions

  • Embodiments of the present invention are directed to audio signal processing and more particularly to removal of console noise in a device having a microphone located on a device console.
  • consoles that include various user controls and inputs.
  • many consumer electronic devices utilize a console that includes various user controls and inputs.
  • a microphone is typically a conventional omni-directional microphone having no preferred listening direction.
  • noise sources such as cooling fans, hard-disk drives, CD-ROM drives and digital video disk (DVD) drives.
  • a microphone located on the console would pick up noise from these sources. Since these noise sources are often located quite close to the microphone(s) they can greatly interfere with desired sound inputs, e.g., user voice commands. To address this problem techniques for filtering out noise from these sources have been implemented in these devices.
  • Embodiments of the invention are directed to reduction of noise in a device having a console with one or more microphones and a source of narrow band distributed noise located on the console.
  • a microphone signal containing a broad band distributed desired sound and narrow band distributed noise is divided amongst a plurality of frequency bins. For each frequency bin, it is determined whether a portion of the signal within the frequency bin belongs to a narrow band distribution characteristic of the source of narrow band noise located on the console. Any frequency bins containing portions of the signal belonging to the narrow band distribution are filtered to reduce the narrow band noise.
  • FIG. 1 is a schematic diagram of an electronic device according to an embodiment of the present invention.
  • FIG. 2 is a flow diagram of a method for reduction of noise in a device of the type shown in FIG. 1.
  • FIGs. 3A-3B are graphs of microphone signal as a function of frequency illustrating reduction of narrow band noise according to embodiments of the present invention.
  • FIGs. 4A-4B are graphs of microphone signals for different microphones as a function of frequency illustrating reduction of narrow band noise according to alternative embodiments of the present invention.
  • an electronic device 100 includes a console 102 having one or more microphones 104A, 104B.
  • the term console generally refers to a stand-alone unit containing electronic components that perform computation and/or signal processing functions.
  • the console may receive inputs from one or more input external devices, e.g., a joystick 106, and provide outputs to one or more output external devices such as a monitor 108.
  • the console 102 may include a central processor unit 110 and memory 112.
  • the console may include an optional fan 114 to provide cooling of the console components.
  • the console 102 may be a console for a video game system, such as a Sony PlayStation®, a cable television set top box, a digital video recorder, such as a TiVo® digital video recorder available from TiVo Inc. of Alviso, California.
  • a video game system such as a Sony PlayStation®
  • a cable television set top box such as a cable television set top box
  • a digital video recorder such as a TiVo® digital video recorder available from TiVo Inc. of Alviso, California.
  • the processor unit 110 and memory 112 may be coupled to each other via a system bus 116.
  • the microphones 104A, 104B may be coupled to the processor and/or memory through input/output (VO) elements 118.
  • VO generally refers to any program, operation or device that transfers data to or from the console 100 and to or from a peripheral device. Every data transfer may be regarded as an output from one device and an input into another.
  • the device 100 may include one or more additional peripheral units which may be internal to the console 102 or external to it.
  • Peripheral devices include input-only devices, such as keyboards and mouses, output-only devices, such as printers as well as devices such as a writable CD-ROM that can act as both an input and an output device.
  • peripheral device includes external devices, such as a mouse, keyboard, printer, monitor, microphone, game controller, camera, external Zip drive or scanner as well as internal devices, e.g., a disk drive 120 such as a CD-ROM drive, CD-R drive, hard disk drive or DVD drive, an internal modem other peripheral such as a flash memory reader/writer, hard drive.
  • the console includes at least one source of narrow-band distributed noise such as the disk drive 120.
  • Narrow band noise from the disk drive 120 may be filtered from digital signal data generated from microphone inputs X A OO, X B (O SO that desired sounds, e.g., voice, from a remote source 101 are not drowned out by the sound of the disk drive 120.
  • the narrow band noise may be characterized by a gamma distribution.
  • the desired sound from the source 101 is preferably characterized by a broad band probability density function distribution such as a Gaussian-distributed probability density function.
  • the memory 112 may contain coded instructions 113 that can be executed by the processor 110 and/or data 115 that facilitate removal of the narrow band disk drive noise.
  • the data 115 may include a distribution function generated from training data of many hours of recording of sounds from disk drive.
  • the distribution function may be stored in the form of a lookup table.
  • the coded instructions 113 may implement a method 200 for reducing narrow band noise in a device of the type shown in FIG. 1.
  • a signal from one or more of the console microphone input signals 104A, 104B is divided into frequency bins, as indicated at 202.
  • Dividing the signal into a plurality of frequency bins may include capturing a time-windowed portion of the signal (e.g., microphone signal X A (X)), converting the time- windowed portion to a frequency domain signal x(f) (e.g., using a fast Fourier transform) and dividing the frequency domain signal amongst the frequency bins.
  • a time-windowed portion of the signal e.g., microphone signal X A (X)
  • converting the time- windowed portion to a frequency domain signal x(f)
  • x(f) e.g., using a fast Fourier transform
  • approximately 32 ms of microphone data may be stored in a buffer for classification into frequency bins.
  • each frequency bin it is determined whether a portion of the signal within the frequency bin belongs to a narrow band distribution characteristic of the narrow band disk drive noise as indicated at 204. Any frequency bins containing portions of the signal belonging to the narrow band distribution are filtered from the input signal and indicated at 206.
  • the frequency domain signal x(f) may be regarded as a combination of a broadband signal 302 and a narrow band signal 304.
  • each bin contains a value corresponding to a portion of the broadband signal 302 and a portion of the narrow band signal 304.
  • the portion of the signal x(f) in a given frequency bin 306 due to the narrow band signal 304 may be estimated from the training data. This portion may be subtracted from the value in the frequency bin 306 to filter out the narrow band noise from that bin.
  • the narrow band signal 304 may be estimated as follows. First narrow band signal samples may be collected in a large volume to train its distribution model. Distribution models are widely known to those of skill in the pattern recognition arts, such as speech modeling. The distribution model for the narrow band signal 304 is similar to those used in speech modeling with a few exceptions. Specifically, unlike speech, which is considered broadband with a Gaussian distribution, the narrow band noise on in the narrow band signal 304 has a "Gamma" distribution density function. The distribution model is known as a "Gamma- Mixture-Model". Speech applications, such as speaker/language identification, by comparison usually use a "Gaussian-Mixture-Model". The two models are quite similar. The underlying distribution function is the only significant difference.
  • the model training procedure follows an "Estimate-Maximize” (EM) algorithm, which is widely available in speech modeling.
  • EM Estimatimate-Maximize
  • the EM algorithm is an iterative likelihood maximization method, which estimates a set of model parameters from a training data set.
  • a feature vector is generated directly from a logarithm of power-spectrum.
  • a speech model usually applies further compression, such as DCT or cepstrum-coeficient. This is because the signal of interest is narrow band, and band averaging that possibly has attenuation in broadband background is not desired.
  • the model is utilized to estimate a narrow-band noise power spectrum density (PSD).
  • PSD narrow-band noise power spectrum density
  • An algorithm for such a model may proceed as follows:
  • the signal x(t) is transformed from the time domain to the frequency domain.
  • X(k) fft(x(t)), where k is a frequency index.
  • a feature vector V(k) is obtained from the logarithm of power spectrum.
  • V(k) log(S yy (k))
  • feature Vector is a common term in pattern recognition. Essentially any pattern matching includes 1) a pre-trained model that defines the distribution in priori feature space, and 2) runtime observed feature vectors. The task is to match the feature vector against the model. Given a prior trained gamma ⁇ Model>, the narrow-band noise presence probability ⁇ P n (k)> may be obtained for this observed feature V(k).
  • the narrow-band noise PSD is adaptively updated:
  • the filtering may take advantage of the presence of two or more microphones 104A, 104B on the console 102. If there are two microphones 104A, 104B on the console 102 one of them (104B) may be closer to the disk drive than the other (104A). As a result there is a difference in the time of arrival of the noise from the disk drive 120 for the microphone input signals X A Q) and X ⁇ (t). The difference in time of arrival results in different frequency distributions for the input signals when they are frequency converted to X A (Q, X ⁇ (f) as illustrated in FIGs. 4A-4B.
  • the frequency distribution of broadband sound from remote a sources will not be significantly different for X A (Q, X B (Q- However the frequency distribution for the narrow band signal 304A from microphone 104A will be frequency shifted relative to the frequency distribution 304B from microphone 104B.
  • the narrow band noise contribution to the frequency bins 306 can be determined by generating a feature vector V(k) from the frequency domain signals X A (Q, X ⁇ (f) from the two microphones 104A, 104B.
  • a first feature vector V(k,A) is generated from the power spectrum S yy (k,A) for microphone 104A:
  • V(k,A) log(S yy (k,A))
  • a second feature vector V(k,B) is generated from the power spectrum S yy (k,B) for microphone 104B:
  • V(k,B) log(S yy (k,B))
  • V(k) is then obtained from a simple concatenation of V(k,A) and V(k,B)
  • V(k) [V(k,l), V(k,2)]
  • Embodiments of the present invention may be used as presented herein or in combination with other user input mechanisms and notwithstanding mechanisms that track or profile the angular direction or volume of sound and/or mechanisms that track the position of the object actively or passively, mechanisms using machine vision, combinations thereof and where the object tracked may include ancillary controls or buttons that manipulate feedback to the system and where such feedback may include but is not limited light emission from light sources, sound distortion means, or other suitable transmitters and modulators as well as controls, buttons, pressure pad, etc. that may influence the transmission or modulation of the same, encode state, and/or transmit commands from or to a device, including devices that are tracked by the system and whether such devices are part of, interacting with or influencing a system used in connection with embodiments of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)

Abstract

L'invention concerne un procédé de réduction de bruit dans un dispositif équipé d'une console possédant un ou plusieurs microphones et une source de bruit répartie à bande étroite située sur la console. Un signal de microphone contenant un son souhaité réparti à large bande et un bruit réparti à bande étroite sont divisés parmi plusieurs bandes de fréquence. Pour chaque bande de fréquence, le procédé consiste à déterminer si une portion du signal située à l'intérieur de la bande de fréquence appartient à une caractéristique de répartition de bande étroite de la source du bruit à bande étroite située sur la console. Toutes les bandes de fréquence contenant des parties du signal appartenant à la répartition à bande étroite sont filtrées pour réduire le bruit à bande étroite.
EP07759884A 2006-05-04 2007-03-30 Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur console Withdrawn EP2012725A4 (fr)

Applications Claiming Priority (11)

Application Number Priority Date Filing Date Title
PCT/US2006/017483 WO2006121896A2 (fr) 2005-05-05 2006-05-04 Ecoute selective de source sonore conjuguee a un traitement informatique interactif
US11/381,727 US7697700B2 (en) 2006-05-04 2006-05-04 Noise removal for electronic device with far field microphone on console
US11/381,729 US7809145B2 (en) 2006-05-04 2006-05-04 Ultra small microphone array
US11/381,728 US7545926B2 (en) 2006-05-04 2006-05-04 Echo and noise cancellation
US11/418,988 US8160269B2 (en) 2003-08-27 2006-05-04 Methods and apparatuses for adjusting a listening area for capturing sounds
US11/381,725 US7783061B2 (en) 2003-08-27 2006-05-04 Methods and apparatus for the targeted sound detection
US11/418,989 US8139793B2 (en) 2003-08-27 2006-05-04 Methods and apparatus for capturing audio signals based on a visual image
US11/381,724 US8073157B2 (en) 2003-08-27 2006-05-04 Methods and apparatus for targeted sound detection and characterization
US11/381,721 US8947347B2 (en) 2003-08-27 2006-05-04 Controlling actions in a video game unit
US11/429,047 US8233642B2 (en) 2003-08-27 2006-05-04 Methods and apparatuses for capturing an audio signal based on a location of the signal
PCT/US2007/065701 WO2007130766A2 (fr) 2006-05-04 2007-03-30 Suppression de bruit pour dispositif électronique équipé d'un microphone de champ lointain sur console

Publications (2)

Publication Number Publication Date
EP2012725A2 true EP2012725A2 (fr) 2009-01-14
EP2012725A4 EP2012725A4 (fr) 2011-10-12

Family

ID=56290936

Family Applications (2)

Application Number Title Priority Date Filing Date
EP07759872A Withdrawn EP2014132A4 (fr) 2006-05-04 2007-03-30 Annulation d'echo et de bruit
EP07759884A Withdrawn EP2012725A4 (fr) 2006-05-04 2007-03-30 Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur console

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP07759872A Withdrawn EP2014132A4 (fr) 2006-05-04 2007-03-30 Annulation d'echo et de bruit

Country Status (3)

Country Link
EP (2) EP2014132A4 (fr)
JP (3) JP4476355B2 (fr)
WO (2) WO2007130766A2 (fr)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8738367B2 (en) 2009-03-18 2014-05-27 Nec Corporation Speech signal processing device
JP4964267B2 (ja) * 2009-04-03 2012-06-27 有限会社ケプストラム 適応フィルタ及びこれを有するエコーキャンセラ
JP2010249939A (ja) * 2009-04-13 2010-11-04 Sony Corp ノイズ低減装置、ノイズ判定方法
EP2858068A4 (fr) * 2012-05-31 2016-02-24 Toyota Motor Co Ltd Dispositif de détection de source audio, dispositif de génération de modèle de bruit, dispositif de réduction de bruit, dispositif d'estimation de direction de source audio, dispositif de détection de véhicule s'approchant et procédé de réduction de bruit
CN109166589B (zh) * 2018-08-13 2024-08-20 深圳市腾讯网络信息技术有限公司 应用声音抑制方法、装置、介质以及设备
WO2021126670A1 (fr) * 2019-12-18 2021-06-24 Dolby Laboratories Licensing Corporation Commande de taille de pas d'adaptation de filtre pour annulation d'écho
CN113689871A (zh) * 2020-05-19 2021-11-23 阿里巴巴集团控股有限公司 回声消除方法和装置
CN112017679B (zh) * 2020-08-05 2024-01-26 海尔优家智能科技(北京)有限公司 用于自适应滤波器系数更新的方法及装置、设备
CN115472175A (zh) * 2022-08-31 2022-12-13 海尔优家智能科技(北京)有限公司 音频资源的回声消除方法和装置、存储介质及电子装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4802227A (en) * 1987-04-03 1989-01-31 American Telephone And Telegraph Company Noise reduction processing arrangement for microphone arrays
US5550924A (en) * 1993-07-07 1996-08-27 Picturetel Corporation Reduction of background noise for speech enhancement
US6445801B1 (en) * 1997-11-21 2002-09-03 Sextant Avionique Method of frequency filtering applied to noise suppression in signals implementing a wiener filter
EP1445759A1 (fr) * 2003-02-10 2004-08-11 Siemens Aktiengesellschaft Méthode adaptée à l'usager pour modéliser le bruit de fond en reconnaissance de parole

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3135937B2 (ja) * 1991-05-16 2001-02-19 株式会社リコー 雑音除去装置
JP3110201B2 (ja) * 1993-04-16 2000-11-20 沖電気工業株式会社 ノイズ除去装置
US5806025A (en) * 1996-08-07 1998-09-08 U S West, Inc. Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank
SE9700772D0 (sv) * 1997-03-03 1997-03-03 Ericsson Telefon Ab L M A high resolution post processing method for a speech decoder
DE19806015C2 (de) * 1998-02-13 1999-12-23 Siemens Ag Verfahren zur Verbesserung der akustischen Rückhördämpfung in Freisprecheinrichtungen
US6263078B1 (en) * 1999-01-07 2001-07-17 Signalworks, Inc. Acoustic echo canceller with fast volume control compensation
JP2002537586A (ja) * 1999-02-18 2002-11-05 アンドレア エレクトロニクス コーポレイション 雑音を消去するためのシステム、方法及び装置
US6426979B1 (en) * 1999-04-29 2002-07-30 Legerity, Inc. Adaptation control algorithm for echo cancellation using signal-value based analysis
US6526139B1 (en) * 1999-11-03 2003-02-25 Tellabs Operations, Inc. Consolidated noise injection in a voice processing system
JP3358731B2 (ja) * 2000-04-24 2002-12-24 株式会社富建設 介護装置
US7139401B2 (en) * 2002-01-03 2006-11-21 Hitachi Global Storage Technologies B.V. Hard disk drive with self-contained active acoustic noise reduction
JP2003284181A (ja) * 2002-03-20 2003-10-03 Matsushita Electric Ind Co Ltd 集音装置
US6947549B2 (en) * 2003-02-19 2005-09-20 The Hong Kong Polytechnic University Echo canceller
US7885420B2 (en) * 2003-02-21 2011-02-08 Qnx Software Systems Co. Wind noise suppression system
JP4227529B2 (ja) * 2004-01-06 2009-02-18 パナソニック株式会社 周期性雑音抑圧装置
US7254535B2 (en) * 2004-06-30 2007-08-07 Motorola, Inc. Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system
DE602005020662D1 (de) * 2004-10-13 2010-05-27 Koninkl Philips Electronics Nv Echolöschung

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4802227A (en) * 1987-04-03 1989-01-31 American Telephone And Telegraph Company Noise reduction processing arrangement for microphone arrays
US5550924A (en) * 1993-07-07 1996-08-27 Picturetel Corporation Reduction of background noise for speech enhancement
US6445801B1 (en) * 1997-11-21 2002-09-03 Sextant Avionique Method of frequency filtering applied to noise suppression in signals implementing a wiener filter
EP1445759A1 (fr) * 2003-02-10 2004-08-11 Siemens Aktiengesellschaft Méthode adaptée à l'usager pour modéliser le bruit de fond en reconnaissance de parole

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JONG WON SHIN ET AL: "Voice Activity Detection based on Generalized Gamma Distribution", 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - 18-23 MARCH 2005 - PHILADELPHIA, PA, USA, IEEE, PISCATAWAY, NJ, vol. 1, 18 March 2005 (2005-03-18), pages 781-784, XP010792154, DOI: DOI:10.1109/ICASSP.2005.1415230 ISBN: 978-0-7803-8874-1 *
MARTIN R ED - INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS: "Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors", 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS. (ICASSP). ORLANDO, FL, MAY 13 - 17, 2002; [IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP)], NEW YORK, NY : IEEE, US, vol. 1, 13 May 2002 (2002-05-13), pages I-253, XP010804742, ISBN: 978-0-7803-7402-7 *
See also references of WO2007130766A2 *

Also Published As

Publication number Publication date
WO2007130766A3 (fr) 2008-09-04
WO2007130766A2 (fr) 2007-11-15
WO2007130765A2 (fr) 2007-11-15
EP2014132A2 (fr) 2009-01-14
JP2010171985A (ja) 2010-08-05
EP2014132A4 (fr) 2013-01-02
JP4476355B2 (ja) 2010-06-09
JP4833343B2 (ja) 2011-12-07
JP4866958B2 (ja) 2012-02-01
JP2009535997A (ja) 2009-10-01
EP2012725A4 (fr) 2011-10-12
JP2009535996A (ja) 2009-10-01
WO2007130765A3 (fr) 2008-12-18

Similar Documents

Publication Publication Date Title
US7697700B2 (en) Noise removal for electronic device with far field microphone on console
WO2007130766A2 (fr) Suppression de bruit pour dispositif électronique équipé d'un microphone de champ lointain sur console
US9286907B2 (en) Smart rejecter for keyboard click noise
JP4376902B2 (ja) 音声入力システム
CN102938254B (zh) 一种语音信号增强系统和方法
Martin Speech enhancement based on minimum mean-square error estimation and supergaussian priors
US7295972B2 (en) Method and apparatus for blind source separation using two sensors
JP5587396B2 (ja) 信号分離のためのシステム、方法、および装置
JP5452655B2 (ja) 音声状態モデルを使用したマルチセンサ音声高品質化
JP4897666B2 (ja) 音声妨害を検出および除去する方法および装置
US7065487B2 (en) Speech recognition method, program and apparatus using multiple acoustic models
US8462969B2 (en) Systems and methods for own voice recognition with adaptations for noise robustness
Mallawaarachchi et al. Spectrogram denoising and automated extraction of the fundamental frequency variation of dolphin whistles
Gerkmann et al. Spectral masking and filtering
CN104021798A (zh) 用于通过具有可变频谱增益和可动态调制的硬度的算法对音频信号隔音的方法
KR20190130533A (ko) 음성 검출기를 구비한 보청기 및 그 방법
JP6888627B2 (ja) 情報処理装置、情報処理方法及びプログラム
Al-Karawi et al. Model selection toward robustness speaker verification in reverberant conditions
KR20110061781A (ko) 실시간 잡음 추정에 기반하여 잡음을 제거하는 음성 처리 장치 및 방법
CN110858485B (zh) 语音增强方法、装置、设备及存储介质
Gomez et al. Robustness to speaker position in distant-talking automatic speech recognition
KR101568282B1 (ko) 클러스터 기반 손실 특징 복원 알고리즘을 위한 마스크 추정 방법 및 장치
Li Robust speaker recognition by means of acoustic transmission channel matching: An acoustic parameter estimation approach
Witkowski et al. Speaker Recognition from Distance Using X-Vectors with Reverberation-Robust Features
CN118486318A (zh) 一种户外直播环境杂音消除方法、介质及系统

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20081107

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK RS

DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20110909

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 21/02 20060101AFI20110905BHEP

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: SONY COMPUTER ENTERTAINMENT INCORPORATED

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: SONY COMPUTER ENTERTAINMENT INC.

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20130905