DE602007014382D1 - Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen - Google Patents
Unterscheidung zwischen Vordergrundsprache und HintergrundgeräuschenInfo
- Publication number
- DE602007014382D1 DE602007014382D1 DE602007014382T DE602007014382T DE602007014382D1 DE 602007014382 D1 DE602007014382 D1 DE 602007014382D1 DE 602007014382 T DE602007014382 T DE 602007014382T DE 602007014382 T DE602007014382 T DE 602007014382T DE 602007014382 D1 DE602007014382 D1 DE 602007014382D1
- Authority
- DE
- Germany
- Prior art keywords
- distinction
- background noise
- foreground
- stochastic
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000002708 enhancing effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
- Machine Translation (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
- Details Of Television Scanning (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP07021933A EP2058797B1 (de) | 2007-11-12 | 2007-11-12 | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen |
Publications (1)
Publication Number | Publication Date |
---|---|
DE602007014382D1 true DE602007014382D1 (de) | 2011-06-16 |
Family
ID=39015777
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE602007014382T Active DE602007014382D1 (de) | 2007-11-12 | 2007-11-12 | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen |
Country Status (4)
Country | Link |
---|---|
US (1) | US8131544B2 (de) |
EP (1) | EP2058797B1 (de) |
AT (1) | ATE508452T1 (de) |
DE (1) | DE602007014382D1 (de) |
Families Citing this family (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
JP4867516B2 (ja) * | 2006-08-01 | 2012-02-01 | ヤマハ株式会社 | 音声会議システム |
JP2009086581A (ja) * | 2007-10-03 | 2009-04-23 | Toshiba Corp | 音声認識の話者モデルを作成する装置およびプログラム |
US8355511B2 (en) * | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
US8521530B1 (en) | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
EP2189976B1 (de) * | 2008-11-21 | 2012-10-24 | Nuance Communications, Inc. | Verfahren zur Adaption eines Codierungsbuches für Spracherkennung |
US8275148B2 (en) * | 2009-07-28 | 2012-09-25 | Fortemedia, Inc. | Audio processing apparatus and method |
KR101581885B1 (ko) * | 2009-08-26 | 2016-01-04 | 삼성전자주식회사 | 복소 스펙트럼 잡음 제거 장치 및 방법 |
CN102725715B (zh) * | 2009-10-20 | 2016-11-09 | 谱瑞科技股份有限公司 | 减少触控屏幕控制器中的耦合噪声影响的方法和设备 |
US9838784B2 (en) | 2009-12-02 | 2017-12-05 | Knowles Electronics, Llc | Directional audio capture |
US9008329B1 (en) * | 2010-01-26 | 2015-04-14 | Audience, Inc. | Noise reduction using multi-feature cluster tracker |
US8538035B2 (en) | 2010-04-29 | 2013-09-17 | Audience, Inc. | Multi-microphone robust noise suppression |
US8473287B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
US8781137B1 (en) | 2010-04-27 | 2014-07-15 | Audience, Inc. | Wind noise detection and suppression |
US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
US8447596B2 (en) * | 2010-07-12 | 2013-05-21 | Audience, Inc. | Monaural noise suppression based on computational auditory scene analysis |
WO2012108911A1 (en) | 2011-02-07 | 2012-08-16 | Cypress Semiconductor Corporation | Noise filtering devices, systems and methods for capacitance sensing devices |
CN102655006A (zh) * | 2011-03-03 | 2012-09-05 | 富泰华工业(深圳)有限公司 | 语音传输装置及其语音传输方法 |
US9224388B2 (en) | 2011-03-04 | 2015-12-29 | Qualcomm Incorporated | Sound recognition method and system |
US8849663B2 (en) * | 2011-03-21 | 2014-09-30 | The Intellisis Corporation | Systems and methods for segmenting and/or classifying an audio signal from transformed audio information |
US9142220B2 (en) | 2011-03-25 | 2015-09-22 | The Intellisis Corporation | Systems and methods for reconstructing an audio signal from transformed audio information |
US9323385B2 (en) | 2011-04-05 | 2016-04-26 | Parade Technologies, Ltd. | Noise detection for a capacitance sensing panel |
US9170322B1 (en) | 2011-04-05 | 2015-10-27 | Parade Technologies, Ltd. | Method and apparatus for automating noise reduction tuning in real time |
WO2012158156A1 (en) * | 2011-05-16 | 2012-11-22 | Google Inc. | Noise supression method and apparatus using multiple feature modeling for speech/noise likelihood |
KR101801327B1 (ko) * | 2011-07-29 | 2017-11-27 | 삼성전자주식회사 | 감정 정보 생성 장치, 감정 정보 생성 방법 및 감정 정보 기반 기능 추천 장치 |
US8620646B2 (en) * | 2011-08-08 | 2013-12-31 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal using harmonic envelope |
US9183850B2 (en) | 2011-08-08 | 2015-11-10 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal |
US8548803B2 (en) | 2011-08-08 | 2013-10-01 | The Intellisis Corporation | System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain |
WO2013057608A1 (en) * | 2011-10-17 | 2013-04-25 | Koninklijke Philips Electronics N.V. | A medical monitoring system based on sound analysis in a medical environment |
US20150287406A1 (en) * | 2012-03-23 | 2015-10-08 | Google Inc. | Estimating Speech in the Presence of Noise |
US9881616B2 (en) * | 2012-06-06 | 2018-01-30 | Qualcomm Incorporated | Method and systems having improved speech recognition |
TWI557722B (zh) * | 2012-11-15 | 2016-11-11 | 緯創資通股份有限公司 | 語音干擾的濾除方法、系統,與電腦可讀記錄媒體 |
CN103971685B (zh) * | 2013-01-30 | 2015-06-10 | 腾讯科技(深圳)有限公司 | 语音命令识别方法和系统 |
US9489965B2 (en) * | 2013-03-15 | 2016-11-08 | Sri International | Method and apparatus for acoustic signal characterization |
US9520138B2 (en) * | 2013-03-15 | 2016-12-13 | Broadcom Corporation | Adaptive modulation filtering for spectral feature enhancement |
US9570087B2 (en) * | 2013-03-15 | 2017-02-14 | Broadcom Corporation | Single channel suppression of interfering sources |
US9536540B2 (en) * | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
CN104143326B (zh) | 2013-12-03 | 2016-11-02 | 腾讯科技(深圳)有限公司 | 一种语音命令识别方法和装置 |
CN106797512B (zh) | 2014-08-28 | 2019-10-25 | 美商楼氏电子有限公司 | 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质 |
US9978388B2 (en) | 2014-09-12 | 2018-05-22 | Knowles Electronics, Llc | Systems and methods for restoration of speech components |
TWI584275B (zh) * | 2014-11-25 | 2017-05-21 | 宏達國際電子股份有限公司 | 電子裝置和聲音信號的分析與播放方法 |
US9870785B2 (en) | 2015-02-06 | 2018-01-16 | Knuedge Incorporated | Determining features of harmonic signals |
US9922668B2 (en) | 2015-02-06 | 2018-03-20 | Knuedge Incorporated | Estimating fractional chirp rate with multiple frequency representations |
US9842611B2 (en) | 2015-02-06 | 2017-12-12 | Knuedge Incorporated | Estimating pitch using peak-to-peak distances |
CN105096121B (zh) * | 2015-06-25 | 2017-07-25 | 百度在线网络技术(北京)有限公司 | 声纹认证方法和装置 |
US20170150254A1 (en) * | 2015-11-19 | 2017-05-25 | Vocalzoom Systems Ltd. | System, device, and method of sound isolation and signal enhancement |
US9820042B1 (en) | 2016-05-02 | 2017-11-14 | Knowles Electronics, Llc | Stereo separation and directional suppression with omni-directional microphones |
CN105933323B (zh) * | 2016-06-01 | 2019-05-31 | 百度在线网络技术(北京)有限公司 | 声纹注册、认证方法及装置 |
US20180166073A1 (en) * | 2016-12-13 | 2018-06-14 | Ford Global Technologies, Llc | Speech Recognition Without Interrupting The Playback Audio |
US10558421B2 (en) | 2017-05-22 | 2020-02-11 | International Business Machines Corporation | Context based identification of non-relevant verbal communications |
US10356362B1 (en) * | 2018-01-16 | 2019-07-16 | Google Llc | Controlling focus of audio signals on speaker during videoconference |
US20230005488A1 (en) * | 2019-12-17 | 2023-01-05 | Sony Group Corporation | Signal processing device, signal processing method, program, and signal processing system |
US11274965B2 (en) | 2020-02-10 | 2022-03-15 | International Business Machines Corporation | Noise model-based converter with signal steps based on uncertainty |
CN113870879A (zh) * | 2020-06-12 | 2021-12-31 | 青岛海尔电冰箱有限公司 | 智能家电麦克风的共享方法、智能家电和可读存储介质 |
US11694692B2 (en) | 2020-11-11 | 2023-07-04 | Bank Of America Corporation | Systems and methods for audio enhancement and conversion |
CN113870871A (zh) * | 2021-08-19 | 2021-12-31 | 阿里巴巴达摩院(杭州)科技有限公司 | 音频处理方法、装置、存储介质、电子设备 |
CN115547308B (zh) * | 2022-09-01 | 2024-09-20 | 北京达佳互联信息技术有限公司 | 一种音频识别模型训练方法、音频识别方法、装置、电子设备及存储介质 |
CN118098260B (zh) * | 2024-03-26 | 2024-08-23 | 荣耀终端有限公司 | 一种语音信号处理方法及相关设备 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5353376A (en) * | 1992-03-20 | 1994-10-04 | Texas Instruments Incorporated | System and method for improved speech acquisition for hands-free voice telecommunication in a noisy environment |
US6615170B1 (en) | 2000-03-07 | 2003-09-02 | International Business Machines Corporation | Model-based voice activity detection system and method using a log-likelihood ratio and pitch |
US6993481B2 (en) * | 2000-12-04 | 2006-01-31 | Global Ip Sound Ab | Detection of speech activity using feature model adaptation |
US7072834B2 (en) * | 2002-04-05 | 2006-07-04 | Intel Corporation | Adapting to adverse acoustic environment in speech processing using playback training data |
JP2005249816A (ja) * | 2004-03-01 | 2005-09-15 | Internatl Business Mach Corp <Ibm> | 信号強調装置、方法及びプログラム、並びに音声認識装置、方法及びプログラム |
JP2007093630A (ja) * | 2005-09-05 | 2007-04-12 | Advanced Telecommunication Research Institute International | 音声強調装置 |
CA2536976A1 (en) * | 2006-02-20 | 2007-08-20 | Diaphonics, Inc. | Method and apparatus for detecting speaker change in a voice transaction |
US20070239441A1 (en) * | 2006-03-29 | 2007-10-11 | Jiri Navratil | System and method for addressing channel mismatch through class specific transforms |
EP2022042B1 (de) * | 2006-05-16 | 2010-12-08 | Loquendo S.p.A. | Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache |
US9966085B2 (en) | 2006-12-30 | 2018-05-08 | Google Technology Holdings LLC | Method and noise suppression circuit incorporating a plurality of noise suppression techniques |
DE602007004733D1 (de) | 2007-10-10 | 2010-03-25 | Harman Becker Automotive Sys | Sprechererkennung |
-
2007
- 2007-11-12 DE DE602007014382T patent/DE602007014382D1/de active Active
- 2007-11-12 EP EP07021933A patent/EP2058797B1/de active Active
- 2007-11-12 AT AT07021933T patent/ATE508452T1/de not_active IP Right Cessation
-
2008
- 2008-11-12 US US12/269,837 patent/US8131544B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
EP2058797B1 (de) | 2011-05-04 |
US20090228272A1 (en) | 2009-09-10 |
US8131544B2 (en) | 2012-03-06 |
ATE508452T1 (de) | 2011-05-15 |
EP2058797A1 (de) | 2009-05-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE602007014382D1 (de) | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen | |
DE602006005493D1 (de) | Sprachsteuerung von Fahrzeugelementen von außerhalb einer Fahrzeugkabine | |
DE602007004733D1 (de) | Sprechererkennung | |
ATE430975T1 (de) | Reduzierung von hintergrundrauschen in freisprechsystemen | |
ATE456130T1 (de) | Partielle sprachrekonstruktion | |
ATE425532T1 (de) | Modellbasierte verbesserung von sprachsignalen | |
ATE540398T1 (de) | Sprachaktivitätsdetektionseinrichtung und verfahren | |
DE602006018795D1 (de) | Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache | |
MX2009005159A (es) | Un metodo y un aparato para descodificar una señal de audio. | |
NZ562182A (en) | Method and apparatus for anti-sparseness filtering of a bandwidth extended speech prediction excitation signal | |
ATE420431T1 (de) | Grundfrequenzextraktion mit adaptivem filter | |
ATE550754T1 (de) | Verfahren und vorrichtung zur aktiven geräuschsminderung unter anwendung von wahrnehmungsmaskierung | |
DE602007004217D1 (de) | Schnelle Schätzung der Spektraldichte der Rauschleistung zur Sprachsignalverbesserung | |
ATE385027T1 (de) | Sprachverbesserung | |
ATE476835T1 (de) | Verfahren und vorrichtung zur verbesserung der audiorekonstruktion | |
GB202000883D0 (en) | An expressive text-to-speech system | |
ATE492875T1 (de) | Sprachanalysesystem | |
DE602006009927D1 (de) | Verfahren und System zur Bereitstellung eines Tonsignals mit erweiterter Bandbreite | |
ATE422789T1 (de) | Mikrofoneinrichtung mit orientierungssensor und entsprechendes verfahren zum betreiben der mikrofoneinrichtung | |
DE602007009731D1 (de) | Verfahren zur rückkopplungslöschung in einem hörgerät und hörgerät | |
MX2016004923A (es) | Concepto para codificar una señal de audio y decodificar una señal de audio usando informacion de conformacion espectral relacionada con la voz. | |
WO2013132348A3 (en) | Formant based speech reconstruction from noisy signals | |
JP1719852S (ja) | 車載用スピーカー | |
DK2148530T3 (da) | Hørehjælp med UV-føler og fremgangsmåde til drift heraf | |
DK1945000T3 (da) | Fremgangsmåde til reduktion af forstyrrelser og et tilsvarende akustisk anlæg |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
R097 | No opposition filed against granted patent, or epo opposition proceedings concluded without decision |
Ref document number: 2058797 Country of ref document: EP |
|
R082 | Change of representative |
Ref document number: 2058797 Country of ref document: EP Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUSSER, |