CN106571146B - 噪音信号确定方法、语音去噪方法及装置 - Google Patents

噪音信号确定方法、语音去噪方法及装置 Download PDF

Info

Publication number
CN106571146B
CN106571146B CN201510670697.8A CN201510670697A CN106571146B CN 106571146 B CN106571146 B CN 106571146B CN 201510670697 A CN201510670697 A CN 201510670697A CN 106571146 B CN106571146 B CN 106571146B
Authority
CN
China
Prior art keywords
signal
variance
frame
frame signal
segment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510670697.8A
Other languages
English (en)
Chinese (zh)
Other versions
CN106571146A (zh
Inventor
杜志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to CN201510670697.8A priority Critical patent/CN106571146B/zh
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to PL16854895T priority patent/PL3364413T3/pl
Priority to EP16854895.6A priority patent/EP3364413B1/en
Priority to ES16854895T priority patent/ES2807529T3/es
Priority to PCT/CN2016/101444 priority patent/WO2017063516A1/zh
Priority to KR1020187013177A priority patent/KR102208855B1/ko
Priority to JP2018519388A priority patent/JP6784758B2/ja
Priority to SG11201803004YA priority patent/SG11201803004YA/en
Priority to SG10202005490WA priority patent/SG10202005490WA/en
Publication of CN106571146A publication Critical patent/CN106571146A/zh
Priority to US15/951,928 priority patent/US10796713B2/en
Application granted granted Critical
Publication of CN106571146B publication Critical patent/CN106571146B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02168Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Telephone Function (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephonic Communication Services (AREA)
  • Noise Elimination (AREA)
  • Mobile Radio Communication Systems (AREA)
CN201510670697.8A 2015-10-13 2015-10-13 噪音信号确定方法、语音去噪方法及装置 Active CN106571146B (zh)

Priority Applications (10)

Application Number Priority Date Filing Date Title
CN201510670697.8A CN106571146B (zh) 2015-10-13 2015-10-13 噪音信号确定方法、语音去噪方法及装置
SG10202005490WA SG10202005490WA (en) 2015-10-13 2016-10-08 Noise signal determining method and apparatus and voice denoising method and apparatus
ES16854895T ES2807529T3 (es) 2015-10-13 2016-10-08 Método para la determinación de señal de ruido y aparato del mismo
PCT/CN2016/101444 WO2017063516A1 (zh) 2015-10-13 2016-10-08 噪音信号确定方法、语音去噪方法及装置
KR1020187013177A KR102208855B1 (ko) 2015-10-13 2016-10-08 노이즈 신호 결정 방법과 장치, 및 음성 노이즈 제거 방법과 장치
JP2018519388A JP6784758B2 (ja) 2015-10-13 2016-10-08 ノイズ信号判定方法及び装置並びに音声ノイズ除去方法及び装置
PL16854895T PL3364413T3 (pl) 2015-10-13 2016-10-08 Sposób określania sygnału szumu i przeznaczone do tego urządzenie
EP16854895.6A EP3364413B1 (en) 2015-10-13 2016-10-08 Method of determining noise signal and apparatus thereof
SG11201803004YA SG11201803004YA (en) 2015-10-13 2016-10-08 Noise signal determining method and apparatus and voice denoising method and apparatus
US15/951,928 US10796713B2 (en) 2015-10-13 2018-04-12 Identification of noise signal for voice denoising device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510670697.8A CN106571146B (zh) 2015-10-13 2015-10-13 噪音信号确定方法、语音去噪方法及装置

Publications (2)

Publication Number Publication Date
CN106571146A CN106571146A (zh) 2017-04-19
CN106571146B true CN106571146B (zh) 2019-10-15

Family

ID=58508605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510670697.8A Active CN106571146B (zh) 2015-10-13 2015-10-13 噪音信号确定方法、语音去噪方法及装置

Country Status (9)

Country Link
US (1) US10796713B2 (enExample)
EP (1) EP3364413B1 (enExample)
JP (1) JP6784758B2 (enExample)
KR (1) KR102208855B1 (enExample)
CN (1) CN106571146B (enExample)
ES (1) ES2807529T3 (enExample)
PL (1) PL3364413T3 (enExample)
SG (2) SG11201803004YA (enExample)
WO (1) WO2017063516A1 (enExample)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10504538B2 (en) * 2017-06-01 2019-12-10 Sorenson Ip Holdings, Llc Noise reduction by application of two thresholds in each frequency band in audio signals
KR102096533B1 (ko) * 2018-09-03 2020-04-02 국방과학연구소 음성 구간을 검출하는 방법 및 장치
CN110689901B (zh) * 2019-09-09 2022-06-28 苏州臻迪智能科技有限公司 语音降噪的方法、装置、电子设备及可读存储介质
JP7331588B2 (ja) * 2019-09-26 2023-08-23 ヤマハ株式会社 情報処理方法、推定モデル構築方法、情報処理装置、推定モデル構築装置およびプログラム
JP7012917B2 (ja) * 2019-12-13 2022-01-28 三菱電機株式会社 情報処理装置、検出方法、及び検出プログラム
KR102784793B1 (ko) 2020-08-06 2025-03-21 라인플러스 주식회사 딥러닝을 이용한 시간 및 주파수 분석 기반의 노이즈 제거 방법 및 장치
WO2022141364A1 (zh) * 2020-12-31 2022-07-07 深圳市韶音科技有限公司 生成音频的方法和系统
CN112967738B (zh) * 2021-02-01 2024-06-14 腾讯音乐娱乐科技(深圳)有限公司 人声检测方法、装置及电子设备和计算机可读存储介质
CN115249484A (zh) * 2021-04-27 2022-10-28 大众问问(北京)信息科技有限公司 语音信号处理方法、装置、计算机设备和存储介质
US20240257823A1 (en) * 2023-01-30 2024-08-01 MIXHalo Corp. Systems and methods for remote real-time audio monitoring
CN119865647A (zh) * 2024-12-23 2025-04-22 海信视像科技股份有限公司 一种显示设备、服务器及其音频降噪和模型训练方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2031583A1 (en) * 2007-08-31 2009-03-04 Harman Becker Automotive Systems GmbH Fast estimation of spectral noise power density for speech signal enhancement
CN101968957A (zh) * 2010-10-28 2011-02-09 哈尔滨工程大学 一种噪声条件下的语音检测方法
CN102314883A (zh) * 2010-06-30 2012-01-11 比亚迪股份有限公司 一种判断音乐噪声的方法以及语音消噪方法
CN103632677A (zh) * 2013-11-27 2014-03-12 腾讯科技(成都)有限公司 带噪语音信号处理方法、装置及服务器
CN103903629A (zh) * 2012-12-28 2014-07-02 联芯科技有限公司 基于隐马尔科夫链模型的噪声估计方法和装置

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2966452B2 (ja) * 1989-12-11 1999-10-25 三洋電機株式会社 音声認識装置の雑音除去システム
JPH0836400A (ja) * 1994-07-25 1996-02-06 Kokusai Electric Co Ltd 音声状態判定回路
US6529868B1 (en) * 2000-03-28 2003-03-04 Tellabs Operations, Inc. Communication system noise cancellation power signal calculation techniques
US7299173B2 (en) * 2002-01-30 2007-11-20 Motorola Inc. Method and apparatus for speech detection using time-frequency variance
CN101197130B (zh) 2006-12-07 2011-05-18 华为技术有限公司 声音活动检测方法和声音活动检测器
CN101627428A (zh) * 2007-03-06 2010-01-13 日本电气株式会社 抑制杂音的方法、装置以及程序
JP2009216733A (ja) * 2008-03-06 2009-09-24 Nippon Telegr & Teleph Corp <Ntt> フィルタ推定装置、信号強調装置、フィルタ推定方法、信号強調方法、プログラム、記録媒体
JP4327886B1 (ja) 2008-05-30 2009-09-09 株式会社東芝 音質補正装置、音質補正方法及び音質補正用プログラム
CN102792373B (zh) * 2010-03-09 2014-05-07 三菱电机株式会社 噪音抑制装置
CN101853661B (zh) * 2010-05-14 2012-05-30 中国科学院声学研究所 基于非监督学习的噪声谱估计与语音活动度检测方法
JP4937393B2 (ja) 2010-09-17 2012-05-23 株式会社東芝 音質補正装置及び音声補正方法
CN102800322B (zh) * 2011-05-27 2014-03-26 中国科学院声学研究所 一种噪声功率谱估计与语音活动性检测方法
CN103489446B (zh) * 2013-10-10 2016-01-06 福州大学 复杂环境下基于自适应能量检测的鸟鸣识别方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2031583A1 (en) * 2007-08-31 2009-03-04 Harman Becker Automotive Systems GmbH Fast estimation of spectral noise power density for speech signal enhancement
CN102314883A (zh) * 2010-06-30 2012-01-11 比亚迪股份有限公司 一种判断音乐噪声的方法以及语音消噪方法
CN101968957A (zh) * 2010-10-28 2011-02-09 哈尔滨工程大学 一种噪声条件下的语音检测方法
CN103903629A (zh) * 2012-12-28 2014-07-02 联芯科技有限公司 基于隐马尔科夫链模型的噪声估计方法和装置
CN103632677A (zh) * 2013-11-27 2014-03-12 腾讯科技(成都)有限公司 带噪语音信号处理方法、装置及服务器

Also Published As

Publication number Publication date
SG11201803004YA (en) 2018-05-30
EP3364413B1 (en) 2020-06-10
US10796713B2 (en) 2020-10-06
SG10202005490WA (en) 2020-07-29
PL3364413T3 (pl) 2020-10-19
CN106571146A (zh) 2017-04-19
ES2807529T3 (es) 2021-02-23
WO2017063516A1 (zh) 2017-04-20
EP3364413A4 (en) 2019-06-26
EP3364413A1 (en) 2018-08-22
JP6784758B2 (ja) 2020-11-11
US20180293997A1 (en) 2018-10-11
KR20180067608A (ko) 2018-06-20
KR102208855B1 (ko) 2021-01-29
JP2018534618A (ja) 2018-11-22

Similar Documents

Publication Publication Date Title
CN106571146B (zh) 噪音信号确定方法、语音去噪方法及装置
Guan et al. Does country-level R&D efficiency benefit from the collaboration network structure?
US10026418B2 (en) Abnormal frame detection method and apparatus
WO2020173133A1 (zh) 情感识别模型的训练方法、情感识别方法、装置、设备及存储介质
CN117373487B (zh) 基于音频的设备故障检测方法、装置及相关设备
CN111968670B (zh) 音频识别方法及装置
CN105788603A (zh) 一种基于经验模态分解的音频识别方法及系统
CN110706693A (zh) 语音端点的确定方法及装置、存储介质、电子装置
CN110782915A (zh) 一种基于深度学习的波形音乐成分分离方法
BR112014009647B1 (pt) Aparelho de atenuação do ruído e método de atenuação do ruído
CN108922514B (zh) 一种基于低频对数谱的鲁棒特征提取方法
AU2015271580B2 (en) Method for processing speech/audio signal and apparatus
CN110708370B (zh) 一种数据处理方法及终端
CN105355206B (zh) 一种声纹特征提取方法和电子设备
CN112055284B (zh) 回声消除方法及神经网络的训练方法、装置、介质、设备
CN114678040B (zh) 语音一致性检测方法、装置、设备及存储介质
EP2382623B1 (en) Aligning scheme for audio signals
CN107481732B (zh) 一种口语测评中的降噪方法、装置及终端设备
CN106340310B (zh) 语音检测方法及装置
CN110491413A (zh) 一种基于孪生网络的音频内容一致性监测方法及系统
EP2840570A1 (en) Enhanced estimation of at least one target signal
CN110097893B (zh) 音频信号的转换方法及装置
CN110097888B (zh) 人声增强方法、装置及设备
HK1235538A (en) Noise signal determining method, and voice de-noising method and apparatus
HK1235538A1 (en) Noise signal determining method, and voice de-noising method and apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1235538

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20200921

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Patentee after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Patentee before: Advanced innovation technology Co.,Ltd.

Effective date of registration: 20200921

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Patentee after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: Alibaba Group Holding Ltd.