CN111226277A - 语音增强方法及装置 - Google Patents
语音增强方法及装置 Download PDFInfo
- Publication number
- CN111226277A CN111226277A CN201880067882.XA CN201880067882A CN111226277A CN 111226277 A CN111226277 A CN 111226277A CN 201880067882 A CN201880067882 A CN 201880067882A CN 111226277 A CN111226277 A CN 111226277A
- Authority
- CN
- China
- Prior art keywords
- power spectrum
- noise
- spectral
- power
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 120
- 238000001228 spectrum Methods 0.000 claims abstract description 686
- 230000003595 spectral effect Effects 0.000 claims abstract description 527
- 230000007613 environmental effect Effects 0.000 claims abstract description 52
- 238000012545 processing Methods 0.000 claims abstract description 34
- 230000006870 function Effects 0.000 claims description 80
- 230000015654 memory Effects 0.000 claims description 16
- 230000005236 sound signal Effects 0.000 claims description 16
- 230000009467 reduction Effects 0.000 abstract description 9
- 230000008569 process Effects 0.000 description 27
- 238000010586 diagram Methods 0.000 description 20
- 238000004422 calculation algorithm Methods 0.000 description 19
- 238000004891 communication Methods 0.000 description 12
- 230000003044 adaptive effect Effects 0.000 description 11
- 238000004364 calculation method Methods 0.000 description 9
- 230000000694 effects Effects 0.000 description 8
- 230000000875 corresponding effect Effects 0.000 description 5
- 238000010183 spectrum analysis Methods 0.000 description 5
- 230000009286 beneficial effect Effects 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000009977 dual effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 238000011410 subtraction method Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Mathematical Physics (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
一种语音增强方法及装置,方法包括:根据带噪语音信号的功率谱以及噪声信号的功率谱,确定第一谱减参数(S201);根据第一谱减参数以及参考功率谱确定第二谱减参数(S202);根据噪声信号的功率谱和第二谱减参数对带噪语音信号进行谱减处理(S203);其中,参考功率谱包括:用户语音预测功率谱和/或环境噪声预测功率。通过考虑到终端设备的用户语音功率谱特性和/或用户所处环境噪声功率谱特性的规律性,对第一谱减参数进行优化处理得到第二谱减参数,以便根据优化后的第二谱减参数对带噪语音信号进行谱减处理,提高了去噪后的语音信号的可懂度和自然度,从而提高了降噪性能。
Description
PCT国内申请,说明书已公开。
Claims (24)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711368189X | 2017-12-18 | ||
CN201711368189 | 2017-12-18 | ||
PCT/CN2018/073281 WO2019119593A1 (zh) | 2017-12-18 | 2018-01-18 | 语音增强方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111226277A true CN111226277A (zh) | 2020-06-02 |
CN111226277B CN111226277B (zh) | 2022-12-27 |
Family
ID=66993022
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880067882.XA Active CN111226277B (zh) | 2017-12-18 | 2018-01-18 | 语音增强方法及装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US11164591B2 (zh) |
CN (1) | CN111226277B (zh) |
WO (1) | WO2019119593A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111986693A (zh) * | 2020-08-10 | 2020-11-24 | 北京小米松果电子有限公司 | 音频信号的处理方法及装置、终端设备和存储介质 |
CN113793620A (zh) * | 2021-11-17 | 2021-12-14 | 深圳市北科瑞声科技股份有限公司 | 基于场景分类的语音降噪方法、装置、设备及存储介质 |
CN116705013A (zh) * | 2023-07-28 | 2023-09-05 | 腾讯科技(深圳)有限公司 | 语音唤醒词的检测方法、装置、存储介质和电子设备 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050071156A1 (en) * | 2003-09-30 | 2005-03-31 | Intel Corporation | Method for spectral subtraction in speech enhancement |
US20050288923A1 (en) * | 2004-06-25 | 2005-12-29 | The Hong Kong University Of Science And Technology | Speech enhancement by noise masking |
CN104200811A (zh) * | 2014-08-08 | 2014-12-10 | 华迪计算机集团有限公司 | 对语音信号进行自适应谱减消噪处理的方法和装置 |
CN104269178A (zh) * | 2014-08-08 | 2015-01-07 | 华迪计算机集团有限公司 | 对语音信号进行自适应谱减和小波包消噪处理的方法和装置 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4897878A (en) * | 1985-08-26 | 1990-01-30 | Itt Corporation | Noise compensation in speech recognition apparatus |
US6775652B1 (en) * | 1998-06-30 | 2004-08-10 | At&T Corp. | Speech recognition over lossy transmission systems |
US7103540B2 (en) * | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
US20040078199A1 (en) * | 2002-08-20 | 2004-04-22 | Hanoh Kremer | Method for auditory based noise reduction and an apparatus for auditory based noise reduction |
US7133825B2 (en) * | 2003-11-28 | 2006-11-07 | Skyworks Solutions, Inc. | Computationally efficient background noise suppressor for speech coding and speech recognition |
JP2008512888A (ja) * | 2004-09-07 | 2008-04-24 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | 改善した雑音抑圧を有する電話装置 |
KR100745977B1 (ko) * | 2005-09-26 | 2007-08-06 | 삼성전자주식회사 | 음성 구간 검출 장치 및 방법 |
CN102436820B (zh) * | 2010-09-29 | 2013-08-28 | 华为技术有限公司 | 高频带信号编码方法及装置、高频带信号解码方法及装置 |
US9589580B2 (en) * | 2011-03-14 | 2017-03-07 | Cochlear Limited | Sound processing based on a confidence measure |
CN103730126B (zh) | 2012-10-16 | 2017-04-05 | 联芯科技有限公司 | 噪声抑制方法和噪声抑制器 |
CN104252863A (zh) | 2013-06-28 | 2014-12-31 | 上海通用汽车有限公司 | 车载收音机的音频降噪处理系统及方法 |
WO2015092943A1 (en) * | 2013-12-17 | 2015-06-25 | Sony Corporation | Electronic devices and methods for compensating for environmental noise in text-to-speech applications |
US9552829B2 (en) * | 2014-05-01 | 2017-01-24 | Bellevue Investments Gmbh & Co. Kgaa | System and method for low-loss removal of stationary and non-stationary short-time interferences |
US9818084B1 (en) * | 2015-12-09 | 2017-11-14 | Impinj, Inc. | RFID loss-prevention based on transition risk |
CN107393550B (zh) * | 2017-07-14 | 2021-03-19 | 深圳永顺智信息科技有限公司 | 语音处理方法及装置 |
US10991355B2 (en) * | 2019-02-18 | 2021-04-27 | Bose Corporation | Dynamic sound masking based on monitoring biosignals and environmental noises |
-
2018
- 2018-01-18 CN CN201880067882.XA patent/CN111226277B/zh active Active
- 2018-01-18 WO PCT/CN2018/073281 patent/WO2019119593A1/zh active Application Filing
- 2018-01-18 US US16/645,677 patent/US11164591B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050071156A1 (en) * | 2003-09-30 | 2005-03-31 | Intel Corporation | Method for spectral subtraction in speech enhancement |
US20050288923A1 (en) * | 2004-06-25 | 2005-12-29 | The Hong Kong University Of Science And Technology | Speech enhancement by noise masking |
CN104200811A (zh) * | 2014-08-08 | 2014-12-10 | 华迪计算机集团有限公司 | 对语音信号进行自适应谱减消噪处理的方法和装置 |
CN104269178A (zh) * | 2014-08-08 | 2015-01-07 | 华迪计算机集团有限公司 | 对语音信号进行自适应谱减和小波包消噪处理的方法和装置 |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111986693A (zh) * | 2020-08-10 | 2020-11-24 | 北京小米松果电子有限公司 | 音频信号的处理方法及装置、终端设备和存储介质 |
CN113793620A (zh) * | 2021-11-17 | 2021-12-14 | 深圳市北科瑞声科技股份有限公司 | 基于场景分类的语音降噪方法、装置、设备及存储介质 |
CN113793620B (zh) * | 2021-11-17 | 2022-03-08 | 深圳市北科瑞声科技股份有限公司 | 基于场景分类的语音降噪方法、装置、设备及存储介质 |
CN116705013A (zh) * | 2023-07-28 | 2023-09-05 | 腾讯科技(深圳)有限公司 | 语音唤醒词的检测方法、装置、存储介质和电子设备 |
CN116705013B (zh) * | 2023-07-28 | 2023-10-10 | 腾讯科技(深圳)有限公司 | 语音唤醒词的检测方法、装置、存储介质和电子设备 |
Also Published As
Publication number | Publication date |
---|---|
US11164591B2 (en) | 2021-11-02 |
CN111226277B (zh) | 2022-12-27 |
US20200279573A1 (en) | 2020-09-03 |
WO2019119593A1 (zh) | 2019-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109671433B (zh) | 一种关键词的检测方法以及相关装置 | |
US9978388B2 (en) | Systems and methods for restoration of speech components | |
EP3164871B1 (en) | User environment aware acoustic noise reduction | |
EP3127114B1 (en) | Situation dependent transient suppression | |
US9668048B2 (en) | Contextual switching of microphones | |
WO2019100500A1 (zh) | 语音信号降噪方法及设备 | |
CN111226277B (zh) | 语音增强方法及装置 | |
CN106165015B (zh) | 用于促进基于加水印的回声管理的装置和方法 | |
CN104067341A (zh) | 在存在背景噪声的情况下的语音活动检测 | |
CN107993672B (zh) | 频带扩展方法及装置 | |
CN111128221A (zh) | 一种音频信号处理方法、装置、终端及存储介质 | |
JP2020115206A (ja) | システム及び方法 | |
CN111883182B (zh) | 人声检测方法、装置、设备及存储介质 | |
CN109756818B (zh) | 双麦克风降噪方法、装置、存储介质及电子设备 | |
CN104575509A (zh) | 语音增强处理方法及装置 | |
US20150325252A1 (en) | Method and device for eliminating noise, and mobile terminal | |
US20170206898A1 (en) | Systems and methods for assisting automatic speech recognition | |
US20180277134A1 (en) | Key Click Suppression | |
CN116343765A (zh) | 自动语境绑定领域特定话音识别的方法和系统 | |
CN113707170A (zh) | 风噪声抑制方法、电子设备和存储介质 | |
CN114220430A (zh) | 多音区语音交互方法、装置、设备以及存储介质 | |
CN112992167A (zh) | 音频信号的处理方法、装置及电子设备 | |
CN112309418A (zh) | 一种抑制风噪声的方法及装置 | |
CN111724808A (zh) | 音频信号处理方法、装置、终端及存储介质 | |
US9564983B1 (en) | Enablement of a private phone conversation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |