KR102208855B1 - 노이즈 신호 결정 방법과 장치, 및 음성 노이즈 제거 방법과 장치 - Google Patents

노이즈 신호 결정 방법과 장치, 및 음성 노이즈 제거 방법과 장치 Download PDF

Info

Publication number
KR102208855B1
KR102208855B1 KR1020187013177A KR20187013177A KR102208855B1 KR 102208855 B1 KR102208855 B1 KR 102208855B1 KR 1020187013177 A KR1020187013177 A KR 1020187013177A KR 20187013177 A KR20187013177 A KR 20187013177A KR 102208855 B1 KR102208855 B1 KR 102208855B1
Authority
KR
South Korea
Prior art keywords
signal
noise
variance
frame
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020187013177A
Other languages
English (en)
Korean (ko)
Other versions
KR20180067608A (ko
Inventor
지준 두
Original Assignee
어드밴스드 뉴 테크놀로지스 씨오., 엘티디.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 어드밴스드 뉴 테크놀로지스 씨오., 엘티디. filed Critical 어드밴스드 뉴 테크놀로지스 씨오., 엘티디.
Publication of KR20180067608A publication Critical patent/KR20180067608A/ko
Application granted granted Critical
Publication of KR102208855B1 publication Critical patent/KR102208855B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02168Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)
  • Noise Elimination (AREA)
  • Mobile Radio Communication Systems (AREA)
KR1020187013177A 2015-10-13 2016-10-08 노이즈 신호 결정 방법과 장치, 및 음성 노이즈 제거 방법과 장치 Active KR102208855B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201510670697.8A CN106571146B (zh) 2015-10-13 2015-10-13 噪音信号确定方法、语音去噪方法及装置
CN201510670697.8 2015-10-13
PCT/CN2016/101444 WO2017063516A1 (zh) 2015-10-13 2016-10-08 噪音信号确定方法、语音去噪方法及装置

Publications (2)

Publication Number Publication Date
KR20180067608A KR20180067608A (ko) 2018-06-20
KR102208855B1 true KR102208855B1 (ko) 2021-01-29

Family

ID=58508605

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020187013177A Active KR102208855B1 (ko) 2015-10-13 2016-10-08 노이즈 신호 결정 방법과 장치, 및 음성 노이즈 제거 방법과 장치

Country Status (9)

Country Link
US (1) US10796713B2 (enExample)
EP (1) EP3364413B1 (enExample)
JP (1) JP6784758B2 (enExample)
KR (1) KR102208855B1 (enExample)
CN (1) CN106571146B (enExample)
ES (1) ES2807529T3 (enExample)
PL (1) PL3364413T3 (enExample)
SG (2) SG10202005490WA (enExample)
WO (1) WO2017063516A1 (enExample)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10504538B2 (en) * 2017-06-01 2019-12-10 Sorenson Ip Holdings, Llc Noise reduction by application of two thresholds in each frequency band in audio signals
KR102096533B1 (ko) * 2018-09-03 2020-04-02 국방과학연구소 음성 구간을 검출하는 방법 및 장치
CN110689901B (zh) * 2019-09-09 2022-06-28 苏州臻迪智能科技有限公司 语音降噪的方法、装置、电子设备及可读存储介质
JP7331588B2 (ja) * 2019-09-26 2023-08-23 ヤマハ株式会社 情報処理方法、推定モデル構築方法、情報処理装置、推定モデル構築装置およびプログラム
EP4060662B1 (en) * 2019-12-13 2025-12-03 Mitsubishi Electric Corporation Information processing device, detection method, and detection program
KR102784793B1 (ko) 2020-08-06 2025-03-21 라인플러스 주식회사 딥러닝을 이용한 시간 및 주파수 분석 기반의 노이즈 제거 방법 및 장치
CN116134834A (zh) * 2020-12-31 2023-05-16 深圳市韶音科技有限公司 生成音频的方法和系统
CN112967738B (zh) * 2021-02-01 2024-06-14 腾讯音乐娱乐科技(深圳)有限公司 人声检测方法、装置及电子设备和计算机可读存储介质
CN115249484A (zh) * 2021-04-27 2022-10-28 大众问问(北京)信息科技有限公司 语音信号处理方法、装置、计算机设备和存储介质
US20240257823A1 (en) * 2023-01-30 2024-08-01 MIXHalo Corp. Systems and methods for remote real-time audio monitoring
CN119865647A (zh) * 2024-12-23 2025-04-22 海信视像科技股份有限公司 一种显示设备、服务器及其音频降噪和模型训练方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101968957A (zh) 2010-10-28 2011-02-09 哈尔滨工程大学 一种噪声条件下的语音检测方法
EP2546831A1 (en) 2010-03-09 2013-01-16 Mitsubishi Electric Corporation Noise suppression device

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2966452B2 (ja) * 1989-12-11 1999-10-25 三洋電機株式会社 音声認識装置の雑音除去システム
JPH0836400A (ja) * 1994-07-25 1996-02-06 Kokusai Electric Co Ltd 音声状態判定回路
US6529868B1 (en) * 2000-03-28 2003-03-04 Tellabs Operations, Inc. Communication system noise cancellation power signal calculation techniques
US7299173B2 (en) * 2002-01-30 2007-11-20 Motorola Inc. Method and apparatus for speech detection using time-frequency variance
CN101197130B (zh) 2006-12-07 2011-05-18 华为技术有限公司 声音活动检测方法和声音活动检测器
CN101627428A (zh) * 2007-03-06 2010-01-13 日本电气株式会社 抑制杂音的方法、装置以及程序
EP2031583B1 (en) * 2007-08-31 2010-01-06 Harman Becker Automotive Systems GmbH Fast estimation of spectral noise power density for speech signal enhancement
JP2009216733A (ja) * 2008-03-06 2009-09-24 Nippon Telegr & Teleph Corp <Ntt> フィルタ推定装置、信号強調装置、フィルタ推定方法、信号強調方法、プログラム、記録媒体
JP4327886B1 (ja) 2008-05-30 2009-09-09 株式会社東芝 音質補正装置、音質補正方法及び音質補正用プログラム
CN101853661B (zh) * 2010-05-14 2012-05-30 中国科学院声学研究所 基于非监督学习的噪声谱估计与语音活动度检测方法
CN102314883B (zh) * 2010-06-30 2013-08-21 比亚迪股份有限公司 一种判断音乐噪声的方法以及语音消噪方法
JP4937393B2 (ja) 2010-09-17 2012-05-23 株式会社東芝 音質補正装置及び音声補正方法
CN102800322B (zh) * 2011-05-27 2014-03-26 中国科学院声学研究所 一种噪声功率谱估计与语音活动性检测方法
CN103903629B (zh) * 2012-12-28 2017-02-15 联芯科技有限公司 基于隐马尔科夫链模型的噪声估计方法和装置
CN103489446B (zh) * 2013-10-10 2016-01-06 福州大学 复杂环境下基于自适应能量检测的鸟鸣识别方法
CN103632677B (zh) * 2013-11-27 2016-09-28 腾讯科技(成都)有限公司 带噪语音信号处理方法、装置及服务器

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2546831A1 (en) 2010-03-09 2013-01-16 Mitsubishi Electric Corporation Noise suppression device
CN101968957A (zh) 2010-10-28 2011-02-09 哈尔滨工程大学 一种噪声条件下的语音检测方法

Also Published As

Publication number Publication date
SG10202005490WA (en) 2020-07-29
JP2018534618A (ja) 2018-11-22
SG11201803004YA (en) 2018-05-30
CN106571146B (zh) 2019-10-15
EP3364413A1 (en) 2018-08-22
US10796713B2 (en) 2020-10-06
JP6784758B2 (ja) 2020-11-11
KR20180067608A (ko) 2018-06-20
CN106571146A (zh) 2017-04-19
PL3364413T3 (pl) 2020-10-19
WO2017063516A1 (zh) 2017-04-20
US20180293997A1 (en) 2018-10-11
ES2807529T3 (es) 2021-02-23
EP3364413A4 (en) 2019-06-26
EP3364413B1 (en) 2020-06-10

Similar Documents

Publication Publication Date Title
KR102208855B1 (ko) 노이즈 신호 결정 방법과 장치, 및 음성 노이즈 제거 방법과 장치
JP6793706B2 (ja) 音声信号を検出するための方法および装置
EP2828856B1 (en) Audio classification using harmonicity estimation
JP2018534618A5 (enExample)
CN110706693B (zh) 语音端点的确定方法及装置、存储介质、电子装置
US8891786B1 (en) Selective notch filtering for howling suppression
CN103903633B (zh) 检测语音信号的方法和装置
US9997168B2 (en) Method and apparatus for signal extraction of audio signal
CN110609088B (zh) 一种支柱绝缘子故障识别方法及装置
CN106850511B (zh) 识别访问攻击的方法及装置
US9213703B1 (en) Pitch shift and time stretch resistant audio matching
Patti et al. Methods for classification of nocturnal migratory bird vocalizations using Pseudo Wigner-Ville Transform
US9978393B1 (en) System and method for automatically removing noise defects from sound recordings
US20150317997A1 (en) System and method for low-loss removal of stationary and non-stationary short-time interferences
US10319201B2 (en) Systems and methods for hierarchical acoustic detection of security threats
GB2580821A (en) Analysing speech signals
US9129015B1 (en) Min/max filter for audio matching
HK1235538A1 (en) Noise signal determining method, and voice de-noising method and apparatus
HK1235538A (en) Noise signal determining method, and voice de-noising method and apparatus
HK1235538B (zh) 噪音信号确定方法、语音去噪方法及装置
US9269370B2 (en) Adaptive speech filter for attenuation of ambient noise
Sharan et al. Subband spectral histogram feature for improved sound recognition in low SNR conditions
CN119478776A (zh) 一种输电线路杆塔微风振动视频分析方法及预警装置
Miao et al. Mixing vector construction for single channel semi-blind source separation using Empirical Mode Decomposition
JP2016045456A5 (enExample)

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20180509

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20191227

Comment text: Request for Examination of Application

A302 Request for accelerated examination
PA0302 Request for accelerated examination

Patent event date: 20200518

Patent event code: PA03022R01D

Comment text: Request for Accelerated Examination

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20200917

Patent event code: PE09021S01D

N231 Notification of change of applicant
PN2301 Change of applicant

Patent event date: 20201204

Comment text: Notification of Change of Applicant

Patent event code: PN23011R01D

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20210108

PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20210122

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20210125

End annual number: 3

Start annual number: 1

PG1601 Publication of registration
PR1001 Payment of annual fee

Payment date: 20240103

Start annual number: 4

End annual number: 4

PR1001 Payment of annual fee

Payment date: 20241219

Start annual number: 5

End annual number: 5