CN1591574B - 用于减少在语音信号中的噪音的方法和系统 - Google Patents
用于减少在语音信号中的噪音的方法和系统 Download PDFInfo
- Publication number
- CN1591574B CN1591574B CN200410068536.3A CN200410068536A CN1591574B CN 1591574 B CN1591574 B CN 1591574B CN 200410068536 A CN200410068536 A CN 200410068536A CN 1591574 B CN1591574 B CN 1591574B
- Authority
- CN
- China
- Prior art keywords
- harmonic component
- noise
- component
- voice signal
- proportional zoom
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 27
- 230000009467 reduction Effects 0.000 claims description 23
- 238000001228 spectrum Methods 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 20
- 230000014509 gene expression Effects 0.000 claims description 6
- 238000012549 training Methods 0.000 description 12
- 238000004891 communication Methods 0.000 description 10
- 238000012360 testing method Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 230000002093 peripheral effect Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 3
- 230000009897 systematic effect Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000005055 memory storage Effects 0.000 description 2
- 238000003909 pattern recognition Methods 0.000 description 2
- CDFKCKUONRRKJD-UHFFFAOYSA-N 1-(3-chlorophenoxy)-3-[2-[[3-(3-chlorophenoxy)-2-hydroxypropyl]amino]ethylamino]propan-2-ol;methanesulfonic acid Chemical compound CS(O)(=O)=O.CS(O)(=O)=O.C=1C=CC(Cl)=CC=1OCC(O)CNCCNCC(O)COC1=CC=CC(Cl)=C1 CDFKCKUONRRKJD-UHFFFAOYSA-N 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 238000013100 final test Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Noise Elimination (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (11)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/647,586 US7516067B2 (en) | 2003-08-25 | 2003-08-25 | Method and apparatus using harmonic-model-based front end for robust speech recognition |
US10/647,586 | 2003-08-25 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1591574A CN1591574A (zh) | 2005-03-09 |
CN1591574B true CN1591574B (zh) | 2010-06-23 |
Family
ID=34104651
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200410068536.3A Expired - Fee Related CN1591574B (zh) | 2003-08-25 | 2004-08-25 | 用于减少在语音信号中的噪音的方法和系统 |
Country Status (7)
Country | Link |
---|---|
US (1) | US7516067B2 (zh) |
EP (1) | EP1511011B1 (zh) |
JP (1) | JP4731855B2 (zh) |
KR (1) | KR101087319B1 (zh) |
CN (1) | CN1591574B (zh) |
AT (1) | ATE347162T1 (zh) |
DE (1) | DE602004003439T2 (zh) |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7447630B2 (en) * | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
KR100744352B1 (ko) * | 2005-08-01 | 2007-07-30 | 삼성전자주식회사 | 음성 신호의 하모닉 성분을 이용한 유/무성음 분리 정보를추출하는 방법 및 그 장치 |
US9185487B2 (en) | 2006-01-30 | 2015-11-10 | Audience, Inc. | System and method for providing noise suppression utilizing null processing noise subtraction |
US8005671B2 (en) * | 2006-12-04 | 2011-08-23 | Qualcomm Incorporated | Systems and methods for dynamic normalization to reduce loss in precision for low-level signals |
JP5089295B2 (ja) * | 2007-08-31 | 2012-12-05 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声処理システム、方法及びプログラム |
KR100919223B1 (ko) * | 2007-09-19 | 2009-09-28 | 한국전자통신연구원 | 부대역의 불확실성 정보를 이용한 잡음환경에서의 음성인식 방법 및 장치 |
US8306817B2 (en) * | 2008-01-08 | 2012-11-06 | Microsoft Corporation | Speech recognition with non-linear noise reduction on Mel-frequency cepstra |
JP5640238B2 (ja) * | 2008-02-28 | 2014-12-17 | 株式会社通信放送国際研究所 | 特異点信号処理システムおよびそのプログラム |
US8473287B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
US8538035B2 (en) | 2010-04-29 | 2013-09-17 | Audience, Inc. | Multi-microphone robust noise suppression |
US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
US8781137B1 (en) | 2010-04-27 | 2014-07-15 | Audience, Inc. | Wind noise detection and suppression |
US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
US9245538B1 (en) * | 2010-05-20 | 2016-01-26 | Audience, Inc. | Bandwidth enhancement of speech signals assisted by noise reduction |
US8447596B2 (en) * | 2010-07-12 | 2013-05-21 | Audience, Inc. | Monaural noise suppression based on computational auditory scene analysis |
JP6064600B2 (ja) * | 2010-11-25 | 2017-01-25 | 日本電気株式会社 | 信号処理装置、信号処理方法、及び信号処理プログラム |
FR2980620A1 (fr) * | 2011-09-23 | 2013-03-29 | France Telecom | Traitement d'amelioration de la qualite des signaux audiofrequences decodes |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
WO2016033364A1 (en) | 2014-08-28 | 2016-03-03 | Audience, Inc. | Multi-sourced noise suppression |
US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
CA2998689C (en) * | 2015-09-25 | 2021-10-26 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoder and method for encoding an audio signal with reduced background noise using linear predictive coding |
WO2017143334A1 (en) * | 2016-02-19 | 2017-08-24 | New York University | Method and system for multi-talker babble noise reduction using q-factor based signal decomposition |
CN108175436A (zh) * | 2017-12-28 | 2018-06-19 | 北京航空航天大学 | 一种肠鸣音智能自动识别方法 |
US11545143B2 (en) | 2021-05-18 | 2023-01-03 | Boris Fridman-Mintz | Recognition or synthesis of human-uttered harmonic sounds |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1152776A (zh) * | 1995-10-26 | 1997-06-25 | 索尼公司 | 复制语言信号、解码语音、合成语音的方法和装置 |
US5913187A (en) * | 1997-08-29 | 1999-06-15 | Nortel Networks Corporation | Nonlinear filter for noise suppression in linear prediction speech processing devices |
US6029128A (en) * | 1995-06-16 | 2000-02-22 | Nokia Mobile Phones Ltd. | Speech synthesizer |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH06289897A (ja) * | 1993-03-31 | 1994-10-18 | Sony Corp | 音声信号処理装置 |
US5701390A (en) | 1995-02-22 | 1997-12-23 | Digital Voice Systems, Inc. | Synthesis of MBE-based coded speech using regenerated phase information |
JP3591068B2 (ja) * | 1995-06-30 | 2004-11-17 | ソニー株式会社 | 音声信号の雑音低減方法 |
JPH0944186A (ja) * | 1995-07-31 | 1997-02-14 | Matsushita Electric Ind Co Ltd | 雑音抑制装置 |
JPH09152891A (ja) * | 1995-11-28 | 1997-06-10 | Takayoshi Hirata | 非調和的周期検出法を用いた準周期的雑音の除去方式 |
US6453285B1 (en) * | 1998-08-21 | 2002-09-17 | Polycom, Inc. | Speech activity detector for use in noise reduction system, and methods therefor |
US6253171B1 (en) * | 1999-02-23 | 2001-06-26 | Comsat Corporation | Method of determining the voicing probability of speech signals |
US6529868B1 (en) * | 2000-03-28 | 2003-03-04 | Tellabs Operations, Inc. | Communication system noise cancellation power signal calculation techniques |
TW466471B (en) * | 2000-04-07 | 2001-12-01 | Ind Tech Res Inst | Method for performing noise adaptation in voice recognition unit |
US20020039425A1 (en) * | 2000-07-19 | 2002-04-04 | Burnett Gregory C. | Method and apparatus for removing noise from electronic signals |
US7020605B2 (en) * | 2000-09-15 | 2006-03-28 | Mindspeed Technologies, Inc. | Speech coding system with time-domain noise attenuation |
JP3586205B2 (ja) * | 2001-02-22 | 2004-11-10 | 日本電信電話株式会社 | 音声スペクトル改善方法、音声スペクトル改善装置、音声スペクトル改善プログラム、プログラムを記憶した記憶媒体 |
US7120580B2 (en) * | 2001-08-15 | 2006-10-10 | Sri International | Method and apparatus for recognizing speech in a noisy environment |
US6952482B2 (en) * | 2001-10-02 | 2005-10-04 | Siemens Corporation Research, Inc. | Method and apparatus for noise filtering |
US7447630B2 (en) * | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
US7464029B2 (en) * | 2005-07-22 | 2008-12-09 | Qualcomm Incorporated | Robust separation of speech signals in a noisy environment |
KR101414233B1 (ko) * | 2007-01-05 | 2014-07-02 | 삼성전자 주식회사 | 음성 신호의 명료도를 향상시키는 장치 및 방법 |
-
2003
- 2003-08-25 US US10/647,586 patent/US7516067B2/en not_active Expired - Fee Related
-
2004
- 2004-07-23 AT AT04103533T patent/ATE347162T1/de not_active IP Right Cessation
- 2004-07-23 EP EP04103533A patent/EP1511011B1/en active Active
- 2004-07-23 DE DE602004003439T patent/DE602004003439T2/de active Active
- 2004-08-19 JP JP2004239995A patent/JP4731855B2/ja not_active Expired - Fee Related
- 2004-08-24 KR KR1020040066834A patent/KR101087319B1/ko active IP Right Grant
- 2004-08-25 CN CN200410068536.3A patent/CN1591574B/zh not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6029128A (en) * | 1995-06-16 | 2000-02-22 | Nokia Mobile Phones Ltd. | Speech synthesizer |
CN1152776A (zh) * | 1995-10-26 | 1997-06-25 | 索尼公司 | 复制语言信号、解码语音、合成语音的方法和装置 |
US5913187A (en) * | 1997-08-29 | 1999-06-15 | Nortel Networks Corporation | Nonlinear filter for noise suppression in linear prediction speech processing devices |
Also Published As
Publication number | Publication date |
---|---|
JP4731855B2 (ja) | 2011-07-27 |
DE602004003439D1 (de) | 2007-01-11 |
US20050049857A1 (en) | 2005-03-03 |
JP2005070779A (ja) | 2005-03-17 |
DE602004003439T2 (de) | 2007-03-29 |
CN1591574A (zh) | 2005-03-09 |
EP1511011B1 (en) | 2006-11-29 |
KR20050022371A (ko) | 2005-03-07 |
EP1511011A3 (en) | 2005-04-13 |
KR101087319B1 (ko) | 2011-11-25 |
EP1511011A2 (en) | 2005-03-02 |
US7516067B2 (en) | 2009-04-07 |
ATE347162T1 (de) | 2006-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1591574B (zh) | 用于减少在语音信号中的噪音的方法和系统 | |
CN101887728B (zh) | 多传感语音增强方法 | |
CN1584984B (zh) | 使用瞬时信噪比作为最优估计的主量的降噪方法 | |
CN110457432B (zh) | 面试评分方法、装置、设备及存储介质 | |
CN101385074B (zh) | 说话者验证 | |
CN100583243C (zh) | 多传感器语音增强的方法和装置 | |
CN101199006B (zh) | 使用先验无噪声语音的多传感语音增强方法和系统 | |
CN101606191B (zh) | 使用语音状态模型的多传感语音增强 | |
CN100589180C (zh) | 使用切换状态空间模型的多模变分推导的语音识别方法 | |
CN1419184A (zh) | 用于调试与语言模型一起使用的类实体词典的方法和设备 | |
MXPA04002919A (es) | Metodo de calculo de ruido mediante el uso del aprendizaje de bayes de incremento. | |
CN103189913A (zh) | 用于分解多信道音频信号的方法、设备和机器可读存储媒体 | |
US6990447B2 (en) | Method and apparatus for denoising and deverberation using variational inference and strong speech models | |
CN100565671C (zh) | 声道谐振跟踪方法 | |
CN113470698B (zh) | 一种说话人转换点检测方法、装置、设备及存储介质 | |
CN105224844A (zh) | 验证方法、系统和装置 | |
US20070055519A1 (en) | Robust bandwith extension of narrowband signals | |
CN102568484B (zh) | 弯曲谱和精细估计音频编码 | |
JP2002140093A (ja) | ノイズ含有スピーチのドメインにおいて音響空間の区分、補正およびスケーリング・ベクトルを用いたノイズ低減方法 | |
CN1624765A (zh) | 使用分段线性逼近的连续值声道共振跟踪方法和装置 | |
Bouchakour et al. | Noise-robust speech recognition in mobile network based on convolution neural networks | |
Zouhir et al. | Power Normalized Gammachirp Cepstral (PNGC) coefficients-based approach for robust speaker recognition | |
CN112133279A (zh) | 车载信息播报方法、装置及终端设备 | |
CN117649846B (zh) | 语音识别模型生成方法、语音识别方法、设备和介质 | |
Painter et al. | A MATLAB software tool for the introduction of speech coding fundamentals in a DSP course |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150515 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150515 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100623 Termination date: 20200825 |
|
CF01 | Termination of patent right due to non-payment of annual fee |