CN112955951A - 语音端点检测方法、装置、存储介质及电子设备 - Google Patents

语音端点检测方法、装置、存储介质及电子设备 Download PDF

Info

Publication number
CN112955951A
CN112955951A CN201880097699.4A CN201880097699A CN112955951A CN 112955951 A CN112955951 A CN 112955951A CN 201880097699 A CN201880097699 A CN 201880097699A CN 112955951 A CN112955951 A CN 112955951A
Authority
CN
China
Prior art keywords
noise
signal
frame
frequency domain
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201880097699.4A
Other languages
English (en)
Inventor
陈岩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Shenzhen Huantai Technology Co Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Shenzhen Huantai Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd, Shenzhen Huantai Technology Co Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Publication of CN112955951A publication Critical patent/CN112955951A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)

Abstract

一种语音端点检测方法、装置、存储介质及电子设备,该方法包括:获取含噪语音信号(101);对所述含噪语音信号进行降噪处理,得到降噪语音信号(102);计算所述降噪语音信号的谱熵比值,并计算所述降噪语音信号的短时能量(103);根据所述降噪语音信号的谱熵比值和所述降噪语音信号的短时能量进行语音端点检测(104)。

Description

PCT国内申请,说明书已公开。

Claims (20)

  1. PCT国内申请,权利要求书已公开。
CN201880097699.4A 2018-11-15 2018-11-15 语音端点检测方法、装置、存储介质及电子设备 Pending CN112955951A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/115601 WO2020097841A1 (zh) 2018-11-15 2018-11-15 语音端点检测方法、装置、存储介质及电子设备

Publications (1)

Publication Number Publication Date
CN112955951A true CN112955951A (zh) 2021-06-11

Family

ID=70731178

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880097699.4A Pending CN112955951A (zh) 2018-11-15 2018-11-15 语音端点检测方法、装置、存储介质及电子设备

Country Status (2)

Country Link
CN (1) CN112955951A (zh)
WO (1) WO2020097841A1 (zh)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012215600A (ja) * 2011-03-31 2012-11-08 Oki Electric Ind Co Ltd 音声区間判定装置、音声区間判定方法、及びプログラム
CN104810024A (zh) * 2014-01-28 2015-07-29 上海力声特医学科技有限公司 一种双路麦克风语音降噪处理方法及系统
CN105023572A (zh) * 2014-04-16 2015-11-04 王景芳 一种含噪语音端点鲁棒检测方法
CN105825871A (zh) * 2016-03-16 2016-08-03 大连理工大学 一种无前导静音段语音的端点检测方法
CN106653062A (zh) * 2017-02-17 2017-05-10 重庆邮电大学 一种低信噪比环境下基于谱熵改进的语音端点检测方法
CN107731223A (zh) * 2017-11-22 2018-02-23 腾讯科技(深圳)有限公司 语音活性检测方法、相关装置和设备
CN107910017A (zh) * 2017-12-19 2018-04-13 河海大学 一种带噪语音端点检测中阈值设定的方法
CN108428456A (zh) * 2018-03-29 2018-08-21 浙江凯池电子科技有限公司 语音降噪算法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4587160B2 (ja) * 2004-03-26 2010-11-24 キヤノン株式会社 信号処理装置および方法
CN101599269B (zh) * 2009-07-02 2011-07-20 中国农业大学 语音端点检测方法及装置

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012215600A (ja) * 2011-03-31 2012-11-08 Oki Electric Ind Co Ltd 音声区間判定装置、音声区間判定方法、及びプログラム
CN104810024A (zh) * 2014-01-28 2015-07-29 上海力声特医学科技有限公司 一种双路麦克风语音降噪处理方法及系统
CN105023572A (zh) * 2014-04-16 2015-11-04 王景芳 一种含噪语音端点鲁棒检测方法
CN105825871A (zh) * 2016-03-16 2016-08-03 大连理工大学 一种无前导静音段语音的端点检测方法
CN106653062A (zh) * 2017-02-17 2017-05-10 重庆邮电大学 一种低信噪比环境下基于谱熵改进的语音端点检测方法
CN107731223A (zh) * 2017-11-22 2018-02-23 腾讯科技(深圳)有限公司 语音活性检测方法、相关装置和设备
CN107910017A (zh) * 2017-12-19 2018-04-13 河海大学 一种带噪语音端点检测中阈值设定的方法
CN108428456A (zh) * 2018-03-29 2018-08-21 浙江凯池电子科技有限公司 语音降噪算法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
阴法明,唐於烽: "基于深度置信网络的语音增强算法", 电子电器, vol. 41, no. 5, pages 1325 - 1329 *

Also Published As

Publication number Publication date
WO2020097841A1 (zh) 2020-05-22

Similar Documents

Publication Publication Date Title
EP3828885B1 (en) Voice denoising method and apparatus, computing device and computer readable storage medium
US11056130B2 (en) Speech enhancement method and apparatus, device and storage medium
US10504539B2 (en) Voice activity detection systems and methods
WO2019101123A1 (zh) 语音活性检测方法、相关装置和设备
CN110021307B (zh) 音频校验方法、装置、存储介质及电子设备
CN108831500A (zh) 语音增强方法、装置、计算机设备及存储介质
CN111445919B (zh) 结合ai模型的语音增强方法、系统、电子设备和介质
CN109616098B (zh) 基于频域能量的语音端点检测方法和装置
US20140214418A1 (en) Sound processing device and sound processing method
CN112004177B (zh) 一种啸叫检测方法、麦克风音量调节方法及存储介质
US9374651B2 (en) Sensitivity calibration method and audio device
CN109346062B (zh) 语音端点检测方法及装置
CN110875049B (zh) 语音信号的处理方法及装置
CN110648687B (zh) 一种活动语音检测方法及系统
CN110556125B (zh) 基于语音信号的特征提取方法、设备及计算机存储介质
US10839820B2 (en) Voice processing method, apparatus, device and storage medium
CN110503973B (zh) 音频信号瞬态噪音抑制方法、系统以及存储介质
WO2022218254A1 (zh) 语音信号增强方法、装置及电子设备
US11594239B1 (en) Detection and removal of wind noise
CN109102823B (zh) 一种基于子带谱熵的语音增强方法
CN113160846B (zh) 噪声抑制方法和电子设备
WO2017128910A1 (zh) 一种语音出现概率的确定方法、装置及电子设备
CN110556128B (zh) 一种语音活动性检测方法、设备及计算机可读存储介质
CN116959495A (zh) 一种语音信号信噪比估计方法、系统
CN112955951A (zh) 语音端点检测方法、装置、存储介质及电子设备

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination