CN111226277A - 语音增强方法及装置 - Google Patents

语音增强方法及装置 Download PDF

Info

Publication number
CN111226277A
CN111226277A CN201880067882.XA CN201880067882A CN111226277A CN 111226277 A CN111226277 A CN 111226277A CN 201880067882 A CN201880067882 A CN 201880067882A CN 111226277 A CN111226277 A CN 111226277A
Authority
CN
China
Prior art keywords
power spectrum
noise
spectral
power
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201880067882.XA
Other languages
English (en)
Other versions
CN111226277B (zh
Inventor
胡伟湘
苗磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN111226277A publication Critical patent/CN111226277A/zh
Application granted granted Critical
Publication of CN111226277B publication Critical patent/CN111226277B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

一种语音增强方法及装置,方法包括:根据带噪语音信号的功率谱以及噪声信号的功率谱,确定第一谱减参数(S201);根据第一谱减参数以及参考功率谱确定第二谱减参数(S202);根据噪声信号的功率谱和第二谱减参数对带噪语音信号进行谱减处理(S203);其中,参考功率谱包括:用户语音预测功率谱和/或环境噪声预测功率。通过考虑到终端设备的用户语音功率谱特性和/或用户所处环境噪声功率谱特性的规律性,对第一谱减参数进行优化处理得到第二谱减参数,以便根据优化后的第二谱减参数对带噪语音信号进行谱减处理,提高了去噪后的语音信号的可懂度和自然度,从而提高了降噪性能。

Description

PCT国内申请,说明书已公开。

Claims (24)

  1. PCT国内申请,权利要求书已公开。
CN201880067882.XA 2017-12-18 2018-01-18 语音增强方法及装置 Active CN111226277B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201711368189X 2017-12-18
CN201711368189 2017-12-18
PCT/CN2018/073281 WO2019119593A1 (zh) 2017-12-18 2018-01-18 语音增强方法及装置

Publications (2)

Publication Number Publication Date
CN111226277A true CN111226277A (zh) 2020-06-02
CN111226277B CN111226277B (zh) 2022-12-27

Family

ID=66993022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880067882.XA Active CN111226277B (zh) 2017-12-18 2018-01-18 语音增强方法及装置

Country Status (3)

Country Link
US (1) US11164591B2 (zh)
CN (1) CN111226277B (zh)
WO (1) WO2019119593A1 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111986693A (zh) * 2020-08-10 2020-11-24 北京小米松果电子有限公司 音频信号的处理方法及装置、终端设备和存储介质
CN113793620A (zh) * 2021-11-17 2021-12-14 深圳市北科瑞声科技股份有限公司 基于场景分类的语音降噪方法、装置、设备及存储介质
CN116705013A (zh) * 2023-07-28 2023-09-05 腾讯科技(深圳)有限公司 语音唤醒词的检测方法、装置、存储介质和电子设备

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050071156A1 (en) * 2003-09-30 2005-03-31 Intel Corporation Method for spectral subtraction in speech enhancement
US20050288923A1 (en) * 2004-06-25 2005-12-29 The Hong Kong University Of Science And Technology Speech enhancement by noise masking
CN104200811A (zh) * 2014-08-08 2014-12-10 华迪计算机集团有限公司 对语音信号进行自适应谱减消噪处理的方法和装置
CN104269178A (zh) * 2014-08-08 2015-01-07 华迪计算机集团有限公司 对语音信号进行自适应谱减和小波包消噪处理的方法和装置

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4897878A (en) * 1985-08-26 1990-01-30 Itt Corporation Noise compensation in speech recognition apparatus
US6775652B1 (en) * 1998-06-30 2004-08-10 At&T Corp. Speech recognition over lossy transmission systems
US7103540B2 (en) * 2002-05-20 2006-09-05 Microsoft Corporation Method of pattern recognition using noise reduction uncertainty
US20040078199A1 (en) * 2002-08-20 2004-04-22 Hanoh Kremer Method for auditory based noise reduction and an apparatus for auditory based noise reduction
US7133825B2 (en) * 2003-11-28 2006-11-07 Skyworks Solutions, Inc. Computationally efficient background noise suppressor for speech coding and speech recognition
JP2008512888A (ja) * 2004-09-07 2008-04-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 改善した雑音抑圧を有する電話装置
KR100745977B1 (ko) * 2005-09-26 2007-08-06 삼성전자주식회사 음성 구간 검출 장치 및 방법
CN102436820B (zh) * 2010-09-29 2013-08-28 华为技术有限公司 高频带信号编码方法及装置、高频带信号解码方法及装置
US9589580B2 (en) * 2011-03-14 2017-03-07 Cochlear Limited Sound processing based on a confidence measure
CN103730126B (zh) 2012-10-16 2017-04-05 联芯科技有限公司 噪声抑制方法和噪声抑制器
CN104252863A (zh) 2013-06-28 2014-12-31 上海通用汽车有限公司 车载收音机的音频降噪处理系统及方法
WO2015092943A1 (en) * 2013-12-17 2015-06-25 Sony Corporation Electronic devices and methods for compensating for environmental noise in text-to-speech applications
US9552829B2 (en) * 2014-05-01 2017-01-24 Bellevue Investments Gmbh & Co. Kgaa System and method for low-loss removal of stationary and non-stationary short-time interferences
US9818084B1 (en) * 2015-12-09 2017-11-14 Impinj, Inc. RFID loss-prevention based on transition risk
CN107393550B (zh) * 2017-07-14 2021-03-19 深圳永顺智信息科技有限公司 语音处理方法及装置
US10991355B2 (en) * 2019-02-18 2021-04-27 Bose Corporation Dynamic sound masking based on monitoring biosignals and environmental noises

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050071156A1 (en) * 2003-09-30 2005-03-31 Intel Corporation Method for spectral subtraction in speech enhancement
US20050288923A1 (en) * 2004-06-25 2005-12-29 The Hong Kong University Of Science And Technology Speech enhancement by noise masking
CN104200811A (zh) * 2014-08-08 2014-12-10 华迪计算机集团有限公司 对语音信号进行自适应谱减消噪处理的方法和装置
CN104269178A (zh) * 2014-08-08 2015-01-07 华迪计算机集团有限公司 对语音信号进行自适应谱减和小波包消噪处理的方法和装置

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111986693A (zh) * 2020-08-10 2020-11-24 北京小米松果电子有限公司 音频信号的处理方法及装置、终端设备和存储介质
CN113793620A (zh) * 2021-11-17 2021-12-14 深圳市北科瑞声科技股份有限公司 基于场景分类的语音降噪方法、装置、设备及存储介质
CN113793620B (zh) * 2021-11-17 2022-03-08 深圳市北科瑞声科技股份有限公司 基于场景分类的语音降噪方法、装置、设备及存储介质
CN116705013A (zh) * 2023-07-28 2023-09-05 腾讯科技(深圳)有限公司 语音唤醒词的检测方法、装置、存储介质和电子设备
CN116705013B (zh) * 2023-07-28 2023-10-10 腾讯科技(深圳)有限公司 语音唤醒词的检测方法、装置、存储介质和电子设备

Also Published As

Publication number Publication date
US11164591B2 (en) 2021-11-02
CN111226277B (zh) 2022-12-27
US20200279573A1 (en) 2020-09-03
WO2019119593A1 (zh) 2019-06-27

Similar Documents

Publication Publication Date Title
CN109671433B (zh) 一种关键词的检测方法以及相关装置
US9978388B2 (en) Systems and methods for restoration of speech components
EP3164871B1 (en) User environment aware acoustic noise reduction
EP3127114B1 (en) Situation dependent transient suppression
US9668048B2 (en) Contextual switching of microphones
WO2019100500A1 (zh) 语音信号降噪方法及设备
CN111226277B (zh) 语音增强方法及装置
CN106165015B (zh) 用于促进基于加水印的回声管理的装置和方法
CN104067341A (zh) 在存在背景噪声的情况下的语音活动检测
CN107993672B (zh) 频带扩展方法及装置
CN111128221A (zh) 一种音频信号处理方法、装置、终端及存储介质
JP2020115206A (ja) システム及び方法
CN111883182B (zh) 人声检测方法、装置、设备及存储介质
CN109756818B (zh) 双麦克风降噪方法、装置、存储介质及电子设备
CN104575509A (zh) 语音增强处理方法及装置
US20150325252A1 (en) Method and device for eliminating noise, and mobile terminal
US20170206898A1 (en) Systems and methods for assisting automatic speech recognition
US20180277134A1 (en) Key Click Suppression
CN116343765A (zh) 自动语境绑定领域特定话音识别的方法和系统
CN113707170A (zh) 风噪声抑制方法、电子设备和存储介质
CN114220430A (zh) 多音区语音交互方法、装置、设备以及存储介质
CN112992167A (zh) 音频信号的处理方法、装置及电子设备
CN112309418A (zh) 一种抑制风噪声的方法及装置
CN111724808A (zh) 音频信号处理方法、装置、终端及存储介质
US9564983B1 (en) Enablement of a private phone conversation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant