EP2012725A2 - Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur console - Google Patents
Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur consoleInfo
- Publication number
- EP2012725A2 EP2012725A2 EP07759884A EP07759884A EP2012725A2 EP 2012725 A2 EP2012725 A2 EP 2012725A2 EP 07759884 A EP07759884 A EP 07759884A EP 07759884 A EP07759884 A EP 07759884A EP 2012725 A2 EP2012725 A2 EP 2012725A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- narrow band
- console
- noise
- instructions
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000009467 reduction Effects 0.000 title claims abstract description 10
- 238000009826 distribution Methods 0.000 claims abstract description 32
- 238000000034 method Methods 0.000 claims description 26
- 239000013598 vector Substances 0.000 claims description 19
- 230000006870 function Effects 0.000 claims description 10
- 238000001914 filtration Methods 0.000 claims description 5
- 230000008901 benefit Effects 0.000 description 11
- 238000012545 processing Methods 0.000 description 6
- 238000001228 spectrum Methods 0.000 description 6
- 230000002093 peripheral effect Effects 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 3
- 238000005315 distribution function Methods 0.000 description 3
- 238000012935 Averaging Methods 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000003909 pattern recognition Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011897 real-time detection Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
- H04M9/082—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02163—Only one microphone
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Definitions
- Embodiments of the present invention are directed to audio signal processing and more particularly to removal of console noise in a device having a microphone located on a device console.
- consoles that include various user controls and inputs.
- many consumer electronic devices utilize a console that includes various user controls and inputs.
- a microphone is typically a conventional omni-directional microphone having no preferred listening direction.
- noise sources such as cooling fans, hard-disk drives, CD-ROM drives and digital video disk (DVD) drives.
- a microphone located on the console would pick up noise from these sources. Since these noise sources are often located quite close to the microphone(s) they can greatly interfere with desired sound inputs, e.g., user voice commands. To address this problem techniques for filtering out noise from these sources have been implemented in these devices.
- Embodiments of the invention are directed to reduction of noise in a device having a console with one or more microphones and a source of narrow band distributed noise located on the console.
- a microphone signal containing a broad band distributed desired sound and narrow band distributed noise is divided amongst a plurality of frequency bins. For each frequency bin, it is determined whether a portion of the signal within the frequency bin belongs to a narrow band distribution characteristic of the source of narrow band noise located on the console. Any frequency bins containing portions of the signal belonging to the narrow band distribution are filtered to reduce the narrow band noise.
- FIG. 1 is a schematic diagram of an electronic device according to an embodiment of the present invention.
- FIG. 2 is a flow diagram of a method for reduction of noise in a device of the type shown in FIG. 1.
- FIGs. 3A-3B are graphs of microphone signal as a function of frequency illustrating reduction of narrow band noise according to embodiments of the present invention.
- FIGs. 4A-4B are graphs of microphone signals for different microphones as a function of frequency illustrating reduction of narrow band noise according to alternative embodiments of the present invention.
- an electronic device 100 includes a console 102 having one or more microphones 104A, 104B.
- the term console generally refers to a stand-alone unit containing electronic components that perform computation and/or signal processing functions.
- the console may receive inputs from one or more input external devices, e.g., a joystick 106, and provide outputs to one or more output external devices such as a monitor 108.
- the console 102 may include a central processor unit 110 and memory 112.
- the console may include an optional fan 114 to provide cooling of the console components.
- the console 102 may be a console for a video game system, such as a Sony PlayStation®, a cable television set top box, a digital video recorder, such as a TiVo® digital video recorder available from TiVo Inc. of Alviso, California.
- a video game system such as a Sony PlayStation®
- a cable television set top box such as a cable television set top box
- a digital video recorder such as a TiVo® digital video recorder available from TiVo Inc. of Alviso, California.
- the processor unit 110 and memory 112 may be coupled to each other via a system bus 116.
- the microphones 104A, 104B may be coupled to the processor and/or memory through input/output (VO) elements 118.
- VO generally refers to any program, operation or device that transfers data to or from the console 100 and to or from a peripheral device. Every data transfer may be regarded as an output from one device and an input into another.
- the device 100 may include one or more additional peripheral units which may be internal to the console 102 or external to it.
- Peripheral devices include input-only devices, such as keyboards and mouses, output-only devices, such as printers as well as devices such as a writable CD-ROM that can act as both an input and an output device.
- peripheral device includes external devices, such as a mouse, keyboard, printer, monitor, microphone, game controller, camera, external Zip drive or scanner as well as internal devices, e.g., a disk drive 120 such as a CD-ROM drive, CD-R drive, hard disk drive or DVD drive, an internal modem other peripheral such as a flash memory reader/writer, hard drive.
- the console includes at least one source of narrow-band distributed noise such as the disk drive 120.
- Narrow band noise from the disk drive 120 may be filtered from digital signal data generated from microphone inputs X A OO, X B (O SO that desired sounds, e.g., voice, from a remote source 101 are not drowned out by the sound of the disk drive 120.
- the narrow band noise may be characterized by a gamma distribution.
- the desired sound from the source 101 is preferably characterized by a broad band probability density function distribution such as a Gaussian-distributed probability density function.
- the memory 112 may contain coded instructions 113 that can be executed by the processor 110 and/or data 115 that facilitate removal of the narrow band disk drive noise.
- the data 115 may include a distribution function generated from training data of many hours of recording of sounds from disk drive.
- the distribution function may be stored in the form of a lookup table.
- the coded instructions 113 may implement a method 200 for reducing narrow band noise in a device of the type shown in FIG. 1.
- a signal from one or more of the console microphone input signals 104A, 104B is divided into frequency bins, as indicated at 202.
- Dividing the signal into a plurality of frequency bins may include capturing a time-windowed portion of the signal (e.g., microphone signal X A (X)), converting the time- windowed portion to a frequency domain signal x(f) (e.g., using a fast Fourier transform) and dividing the frequency domain signal amongst the frequency bins.
- a time-windowed portion of the signal e.g., microphone signal X A (X)
- converting the time- windowed portion to a frequency domain signal x(f)
- x(f) e.g., using a fast Fourier transform
- approximately 32 ms of microphone data may be stored in a buffer for classification into frequency bins.
- each frequency bin it is determined whether a portion of the signal within the frequency bin belongs to a narrow band distribution characteristic of the narrow band disk drive noise as indicated at 204. Any frequency bins containing portions of the signal belonging to the narrow band distribution are filtered from the input signal and indicated at 206.
- the frequency domain signal x(f) may be regarded as a combination of a broadband signal 302 and a narrow band signal 304.
- each bin contains a value corresponding to a portion of the broadband signal 302 and a portion of the narrow band signal 304.
- the portion of the signal x(f) in a given frequency bin 306 due to the narrow band signal 304 may be estimated from the training data. This portion may be subtracted from the value in the frequency bin 306 to filter out the narrow band noise from that bin.
- the narrow band signal 304 may be estimated as follows. First narrow band signal samples may be collected in a large volume to train its distribution model. Distribution models are widely known to those of skill in the pattern recognition arts, such as speech modeling. The distribution model for the narrow band signal 304 is similar to those used in speech modeling with a few exceptions. Specifically, unlike speech, which is considered broadband with a Gaussian distribution, the narrow band noise on in the narrow band signal 304 has a "Gamma" distribution density function. The distribution model is known as a "Gamma- Mixture-Model". Speech applications, such as speaker/language identification, by comparison usually use a "Gaussian-Mixture-Model". The two models are quite similar. The underlying distribution function is the only significant difference.
- the model training procedure follows an "Estimate-Maximize” (EM) algorithm, which is widely available in speech modeling.
- EM Estimatimate-Maximize
- the EM algorithm is an iterative likelihood maximization method, which estimates a set of model parameters from a training data set.
- a feature vector is generated directly from a logarithm of power-spectrum.
- a speech model usually applies further compression, such as DCT or cepstrum-coeficient. This is because the signal of interest is narrow band, and band averaging that possibly has attenuation in broadband background is not desired.
- the model is utilized to estimate a narrow-band noise power spectrum density (PSD).
- PSD narrow-band noise power spectrum density
- An algorithm for such a model may proceed as follows:
- the signal x(t) is transformed from the time domain to the frequency domain.
- X(k) fft(x(t)), where k is a frequency index.
- a feature vector V(k) is obtained from the logarithm of power spectrum.
- V(k) log(S yy (k))
- feature Vector is a common term in pattern recognition. Essentially any pattern matching includes 1) a pre-trained model that defines the distribution in priori feature space, and 2) runtime observed feature vectors. The task is to match the feature vector against the model. Given a prior trained gamma ⁇ Model>, the narrow-band noise presence probability ⁇ P n (k)> may be obtained for this observed feature V(k).
- the narrow-band noise PSD is adaptively updated:
- the filtering may take advantage of the presence of two or more microphones 104A, 104B on the console 102. If there are two microphones 104A, 104B on the console 102 one of them (104B) may be closer to the disk drive than the other (104A). As a result there is a difference in the time of arrival of the noise from the disk drive 120 for the microphone input signals X A Q) and X ⁇ (t). The difference in time of arrival results in different frequency distributions for the input signals when they are frequency converted to X A (Q, X ⁇ (f) as illustrated in FIGs. 4A-4B.
- the frequency distribution of broadband sound from remote a sources will not be significantly different for X A (Q, X B (Q- However the frequency distribution for the narrow band signal 304A from microphone 104A will be frequency shifted relative to the frequency distribution 304B from microphone 104B.
- the narrow band noise contribution to the frequency bins 306 can be determined by generating a feature vector V(k) from the frequency domain signals X A (Q, X ⁇ (f) from the two microphones 104A, 104B.
- a first feature vector V(k,A) is generated from the power spectrum S yy (k,A) for microphone 104A:
- V(k,A) log(S yy (k,A))
- a second feature vector V(k,B) is generated from the power spectrum S yy (k,B) for microphone 104B:
- V(k,B) log(S yy (k,B))
- V(k) is then obtained from a simple concatenation of V(k,A) and V(k,B)
- V(k) [V(k,l), V(k,2)]
- Embodiments of the present invention may be used as presented herein or in combination with other user input mechanisms and notwithstanding mechanisms that track or profile the angular direction or volume of sound and/or mechanisms that track the position of the object actively or passively, mechanisms using machine vision, combinations thereof and where the object tracked may include ancillary controls or buttons that manipulate feedback to the system and where such feedback may include but is not limited light emission from light sources, sound distortion means, or other suitable transmitters and modulators as well as controls, buttons, pressure pad, etc. that may influence the transmission or modulation of the same, encode state, and/or transmit commands from or to a device, including devices that are tracked by the system and whether such devices are part of, interacting with or influencing a system used in connection with embodiments of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Abstract
Applications Claiming Priority (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2006/017483 WO2006121896A2 (fr) | 2005-05-05 | 2006-05-04 | Ecoute selective de source sonore conjuguee a un traitement informatique interactif |
US11/381,727 US7697700B2 (en) | 2006-05-04 | 2006-05-04 | Noise removal for electronic device with far field microphone on console |
US11/381,729 US7809145B2 (en) | 2006-05-04 | 2006-05-04 | Ultra small microphone array |
US11/381,728 US7545926B2 (en) | 2006-05-04 | 2006-05-04 | Echo and noise cancellation |
US11/418,988 US8160269B2 (en) | 2003-08-27 | 2006-05-04 | Methods and apparatuses for adjusting a listening area for capturing sounds |
US11/381,725 US7783061B2 (en) | 2003-08-27 | 2006-05-04 | Methods and apparatus for the targeted sound detection |
US11/418,989 US8139793B2 (en) | 2003-08-27 | 2006-05-04 | Methods and apparatus for capturing audio signals based on a visual image |
US11/381,724 US8073157B2 (en) | 2003-08-27 | 2006-05-04 | Methods and apparatus for targeted sound detection and characterization |
US11/381,721 US8947347B2 (en) | 2003-08-27 | 2006-05-04 | Controlling actions in a video game unit |
US11/429,047 US8233642B2 (en) | 2003-08-27 | 2006-05-04 | Methods and apparatuses for capturing an audio signal based on a location of the signal |
PCT/US2007/065701 WO2007130766A2 (fr) | 2006-05-04 | 2007-03-30 | Suppression de bruit pour dispositif électronique équipé d'un microphone de champ lointain sur console |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2012725A2 true EP2012725A2 (fr) | 2009-01-14 |
EP2012725A4 EP2012725A4 (fr) | 2011-10-12 |
Family
ID=56290936
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07759872A Withdrawn EP2014132A4 (fr) | 2006-05-04 | 2007-03-30 | Annulation d'echo et de bruit |
EP07759884A Withdrawn EP2012725A4 (fr) | 2006-05-04 | 2007-03-30 | Suppression de bruit pour dispositif electronique equipe d'un microphone de champ lointain sur console |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07759872A Withdrawn EP2014132A4 (fr) | 2006-05-04 | 2007-03-30 | Annulation d'echo et de bruit |
Country Status (3)
Country | Link |
---|---|
EP (2) | EP2014132A4 (fr) |
JP (3) | JP4476355B2 (fr) |
WO (2) | WO2007130766A2 (fr) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8738367B2 (en) | 2009-03-18 | 2014-05-27 | Nec Corporation | Speech signal processing device |
JP4964267B2 (ja) * | 2009-04-03 | 2012-06-27 | 有限会社ケプストラム | 適応フィルタ及びこれを有するエコーキャンセラ |
JP2010249939A (ja) * | 2009-04-13 | 2010-11-04 | Sony Corp | ノイズ低減装置、ノイズ判定方法 |
EP2858068A4 (fr) * | 2012-05-31 | 2016-02-24 | Toyota Motor Co Ltd | Dispositif de détection de source audio, dispositif de génération de modèle de bruit, dispositif de réduction de bruit, dispositif d'estimation de direction de source audio, dispositif de détection de véhicule s'approchant et procédé de réduction de bruit |
CN109166589B (zh) * | 2018-08-13 | 2024-08-20 | 深圳市腾讯网络信息技术有限公司 | 应用声音抑制方法、装置、介质以及设备 |
WO2021126670A1 (fr) * | 2019-12-18 | 2021-06-24 | Dolby Laboratories Licensing Corporation | Commande de taille de pas d'adaptation de filtre pour annulation d'écho |
CN113689871A (zh) * | 2020-05-19 | 2021-11-23 | 阿里巴巴集团控股有限公司 | 回声消除方法和装置 |
CN112017679B (zh) * | 2020-08-05 | 2024-01-26 | 海尔优家智能科技(北京)有限公司 | 用于自适应滤波器系数更新的方法及装置、设备 |
CN115472175A (zh) * | 2022-08-31 | 2022-12-13 | 海尔优家智能科技(北京)有限公司 | 音频资源的回声消除方法和装置、存储介质及电子装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4802227A (en) * | 1987-04-03 | 1989-01-31 | American Telephone And Telegraph Company | Noise reduction processing arrangement for microphone arrays |
US5550924A (en) * | 1993-07-07 | 1996-08-27 | Picturetel Corporation | Reduction of background noise for speech enhancement |
US6445801B1 (en) * | 1997-11-21 | 2002-09-03 | Sextant Avionique | Method of frequency filtering applied to noise suppression in signals implementing a wiener filter |
EP1445759A1 (fr) * | 2003-02-10 | 2004-08-11 | Siemens Aktiengesellschaft | Méthode adaptée à l'usager pour modéliser le bruit de fond en reconnaissance de parole |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3135937B2 (ja) * | 1991-05-16 | 2001-02-19 | 株式会社リコー | 雑音除去装置 |
JP3110201B2 (ja) * | 1993-04-16 | 2000-11-20 | 沖電気工業株式会社 | ノイズ除去装置 |
US5806025A (en) * | 1996-08-07 | 1998-09-08 | U S West, Inc. | Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank |
SE9700772D0 (sv) * | 1997-03-03 | 1997-03-03 | Ericsson Telefon Ab L M | A high resolution post processing method for a speech decoder |
DE19806015C2 (de) * | 1998-02-13 | 1999-12-23 | Siemens Ag | Verfahren zur Verbesserung der akustischen Rückhördämpfung in Freisprecheinrichtungen |
US6263078B1 (en) * | 1999-01-07 | 2001-07-17 | Signalworks, Inc. | Acoustic echo canceller with fast volume control compensation |
JP2002537586A (ja) * | 1999-02-18 | 2002-11-05 | アンドレア エレクトロニクス コーポレイション | 雑音を消去するためのシステム、方法及び装置 |
US6426979B1 (en) * | 1999-04-29 | 2002-07-30 | Legerity, Inc. | Adaptation control algorithm for echo cancellation using signal-value based analysis |
US6526139B1 (en) * | 1999-11-03 | 2003-02-25 | Tellabs Operations, Inc. | Consolidated noise injection in a voice processing system |
JP3358731B2 (ja) * | 2000-04-24 | 2002-12-24 | 株式会社富建設 | 介護装置 |
US7139401B2 (en) * | 2002-01-03 | 2006-11-21 | Hitachi Global Storage Technologies B.V. | Hard disk drive with self-contained active acoustic noise reduction |
JP2003284181A (ja) * | 2002-03-20 | 2003-10-03 | Matsushita Electric Ind Co Ltd | 集音装置 |
US6947549B2 (en) * | 2003-02-19 | 2005-09-20 | The Hong Kong Polytechnic University | Echo canceller |
US7885420B2 (en) * | 2003-02-21 | 2011-02-08 | Qnx Software Systems Co. | Wind noise suppression system |
JP4227529B2 (ja) * | 2004-01-06 | 2009-02-18 | パナソニック株式会社 | 周期性雑音抑圧装置 |
US7254535B2 (en) * | 2004-06-30 | 2007-08-07 | Motorola, Inc. | Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system |
DE602005020662D1 (de) * | 2004-10-13 | 2010-05-27 | Koninkl Philips Electronics Nv | Echolöschung |
-
2007
- 2007-03-30 WO PCT/US2007/065701 patent/WO2007130766A2/fr active Application Filing
- 2007-03-30 EP EP07759872A patent/EP2014132A4/fr not_active Withdrawn
- 2007-03-30 JP JP2009509908A patent/JP4476355B2/ja not_active Expired - Fee Related
- 2007-03-30 WO PCT/US2007/065686 patent/WO2007130765A2/fr active Application Filing
- 2007-03-30 JP JP2009509909A patent/JP4866958B2/ja not_active Expired - Fee Related
- 2007-03-30 EP EP07759884A patent/EP2012725A4/fr not_active Withdrawn
-
2010
- 2010-01-29 JP JP2010019147A patent/JP4833343B2/ja not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4802227A (en) * | 1987-04-03 | 1989-01-31 | American Telephone And Telegraph Company | Noise reduction processing arrangement for microphone arrays |
US5550924A (en) * | 1993-07-07 | 1996-08-27 | Picturetel Corporation | Reduction of background noise for speech enhancement |
US6445801B1 (en) * | 1997-11-21 | 2002-09-03 | Sextant Avionique | Method of frequency filtering applied to noise suppression in signals implementing a wiener filter |
EP1445759A1 (fr) * | 2003-02-10 | 2004-08-11 | Siemens Aktiengesellschaft | Méthode adaptée à l'usager pour modéliser le bruit de fond en reconnaissance de parole |
Non-Patent Citations (3)
Title |
---|
JONG WON SHIN ET AL: "Voice Activity Detection based on Generalized Gamma Distribution", 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - 18-23 MARCH 2005 - PHILADELPHIA, PA, USA, IEEE, PISCATAWAY, NJ, vol. 1, 18 March 2005 (2005-03-18), pages 781-784, XP010792154, DOI: DOI:10.1109/ICASSP.2005.1415230 ISBN: 978-0-7803-8874-1 * |
MARTIN R ED - INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS: "Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors", 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS. (ICASSP). ORLANDO, FL, MAY 13 - 17, 2002; [IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP)], NEW YORK, NY : IEEE, US, vol. 1, 13 May 2002 (2002-05-13), pages I-253, XP010804742, ISBN: 978-0-7803-7402-7 * |
See also references of WO2007130766A2 * |
Also Published As
Publication number | Publication date |
---|---|
WO2007130766A3 (fr) | 2008-09-04 |
WO2007130766A2 (fr) | 2007-11-15 |
WO2007130765A2 (fr) | 2007-11-15 |
EP2014132A2 (fr) | 2009-01-14 |
JP2010171985A (ja) | 2010-08-05 |
EP2014132A4 (fr) | 2013-01-02 |
JP4476355B2 (ja) | 2010-06-09 |
JP4833343B2 (ja) | 2011-12-07 |
JP4866958B2 (ja) | 2012-02-01 |
JP2009535997A (ja) | 2009-10-01 |
EP2012725A4 (fr) | 2011-10-12 |
JP2009535996A (ja) | 2009-10-01 |
WO2007130765A3 (fr) | 2008-12-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7697700B2 (en) | Noise removal for electronic device with far field microphone on console | |
WO2007130766A2 (fr) | Suppression de bruit pour dispositif électronique équipé d'un microphone de champ lointain sur console | |
US9286907B2 (en) | Smart rejecter for keyboard click noise | |
JP4376902B2 (ja) | 音声入力システム | |
CN102938254B (zh) | 一种语音信号增强系统和方法 | |
Martin | Speech enhancement based on minimum mean-square error estimation and supergaussian priors | |
US7295972B2 (en) | Method and apparatus for blind source separation using two sensors | |
JP5587396B2 (ja) | 信号分離のためのシステム、方法、および装置 | |
JP5452655B2 (ja) | 音声状態モデルを使用したマルチセンサ音声高品質化 | |
JP4897666B2 (ja) | 音声妨害を検出および除去する方法および装置 | |
US7065487B2 (en) | Speech recognition method, program and apparatus using multiple acoustic models | |
US8462969B2 (en) | Systems and methods for own voice recognition with adaptations for noise robustness | |
Mallawaarachchi et al. | Spectrogram denoising and automated extraction of the fundamental frequency variation of dolphin whistles | |
Gerkmann et al. | Spectral masking and filtering | |
CN104021798A (zh) | 用于通过具有可变频谱增益和可动态调制的硬度的算法对音频信号隔音的方法 | |
KR20190130533A (ko) | 음성 검출기를 구비한 보청기 및 그 방법 | |
JP6888627B2 (ja) | 情報処理装置、情報処理方法及びプログラム | |
Al-Karawi et al. | Model selection toward robustness speaker verification in reverberant conditions | |
KR20110061781A (ko) | 실시간 잡음 추정에 기반하여 잡음을 제거하는 음성 처리 장치 및 방법 | |
CN110858485B (zh) | 语音增强方法、装置、设备及存储介质 | |
Gomez et al. | Robustness to speaker position in distant-talking automatic speech recognition | |
KR101568282B1 (ko) | 클러스터 기반 손실 특징 복원 알고리즘을 위한 마스크 추정 방법 및 장치 | |
Li | Robust speaker recognition by means of acoustic transmission channel matching: An acoustic parameter estimation approach | |
Witkowski et al. | Speaker Recognition from Distance Using X-Vectors with Reverberation-Robust Features | |
CN118486318A (zh) | 一种户外直播环境杂音消除方法、介质及系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20081107 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK RS |
|
DAX | Request for extension of the european patent (deleted) | ||
A4 | Supplementary search report drawn up and despatched |
Effective date: 20110909 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 21/02 20060101AFI20110905BHEP |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SONY COMPUTER ENTERTAINMENT INCORPORATED |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SONY COMPUTER ENTERTAINMENT INC. |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
18W | Application withdrawn |
Effective date: 20130905 |