SG11201903320XA - Voice signal detection method and apparatus - Google Patents

Voice signal detection method and apparatus

Info

Publication number
SG11201903320XA
SG11201903320XA SG11201903320XA SG11201903320XA SG11201903320XA SG 11201903320X A SG11201903320X A SG 11201903320XA SG 11201903320X A SG11201903320X A SG 11201903320XA SG 11201903320X A SG11201903320X A SG 11201903320XA SG 11201903320X A SG11201903320X A SG 11201903320XA
Authority
SG
Singapore
Prior art keywords
voice signal
detection method
signal detection
short
audio signal
Prior art date
Application number
SG11201903320XA
Other languages
English (en)
Inventor
Lei Jiao
Yanchu Guan
Xiaodong Zeng
Feng Lin
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=59176496&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=SG11201903320X(A) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of SG11201903320XA publication Critical patent/SG11201903320XA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Circuits Of Receivers In General (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Electric Clocks (AREA)
SG11201903320XA 2016-10-12 2017-09-26 Voice signal detection method and apparatus SG11201903320XA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610890946.9A CN106887241A (zh) 2016-10-12 2016-10-12 一种语音信号检测方法与装置
PCT/CN2017/103489 WO2018068636A1 (zh) 2016-10-12 2017-09-26 一种语音信号检测方法与装置

Publications (1)

Publication Number Publication Date
SG11201903320XA true SG11201903320XA (en) 2019-05-30

Family

ID=59176496

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201903320XA SG11201903320XA (en) 2016-10-12 2017-09-26 Voice signal detection method and apparatus

Country Status (10)

Country Link
US (1) US10706874B2 (https=)
EP (1) EP3528251B1 (https=)
JP (2) JP6859499B2 (https=)
KR (1) KR102214888B1 (https=)
CN (1) CN106887241A (https=)
MY (1) MY201634A (https=)
PH (1) PH12019500784B1 (https=)
SG (1) SG11201903320XA (https=)
TW (1) TWI654601B (https=)
WO (1) WO2018068636A1 (https=)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106887241A (zh) * 2016-10-12 2017-06-23 阿里巴巴集团控股有限公司 一种语音信号检测方法与装置
CN107957918B (zh) * 2016-10-14 2019-05-10 腾讯科技(深圳)有限公司 数据恢复方法和装置
CN108257616A (zh) * 2017-12-05 2018-07-06 苏州车萝卜汽车电子科技有限公司 人机对话的检测方法以及装置
CN108305639B (zh) * 2018-05-11 2021-03-09 南京邮电大学 语音情感识别方法、计算机可读存储介质、终端
CN108682432B (zh) * 2018-05-11 2021-03-16 南京邮电大学 语音情感识别装置
CN108847217A (zh) * 2018-05-31 2018-11-20 平安科技(深圳)有限公司 一种语音切分方法、装置、计算机设备及存储介质
CN109545193B (zh) * 2018-12-18 2023-03-14 百度在线网络技术(北京)有限公司 用于生成模型的方法和装置
CN110225444A (zh) * 2019-06-14 2019-09-10 四川长虹电器股份有限公司 一种麦克风阵列系统的故障检测方法及其检测系统
CN111724783B (zh) * 2020-06-24 2023-10-17 北京小米移动软件有限公司 智能设备的唤醒方法、装置、智能设备及介质
CN113270118B (zh) * 2021-05-14 2024-02-13 杭州网易智企科技有限公司 语音活动侦测方法及装置、存储介质和电子设备
CN116612775A (zh) * 2022-02-09 2023-08-18 宸芯科技股份有限公司 一种杂音消除方法、装置、电子设备及介质
CN114792530B (zh) * 2022-04-26 2025-07-04 美的集团(上海)有限公司 语音数据处理方法、装置、电子设备和存储介质
CN114898774B (zh) * 2022-05-06 2025-06-13 钉钉(中国)信息技术有限公司 一种音频掉点的检测方法及装置
CN116863947A (zh) * 2023-07-27 2023-10-10 海纳科德(湖北)科技有限公司 一种利用宠物语音信号识别情绪的方法及系统

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3297346B2 (ja) * 1997-04-30 2002-07-02 沖電気工業株式会社 音声検出装置
TW333610B (en) 1997-10-16 1998-06-11 Winbond Electronics Corp The phonetic detecting apparatus and its detecting method
US6480823B1 (en) 1998-03-24 2002-11-12 Matsushita Electric Industrial Co., Ltd. Speech detection for noisy conditions
JP3266124B2 (ja) * 1999-01-07 2002-03-18 ヤマハ株式会社 アナログ信号中の類似波形検出装置及び同信号の時間軸伸長圧縮装置
KR100463657B1 (ko) * 2002-11-30 2004-12-29 삼성전자주식회사 음성구간 검출 장치 및 방법
US7715447B2 (en) 2003-12-23 2010-05-11 Intel Corporation Method and system for tone detection
CN101625860B (zh) * 2008-07-10 2012-07-04 新奥特(北京)视频技术有限公司 语音端点检测中的背景噪声自适应调整方法
WO2010061505A1 (ja) 2008-11-27 2010-06-03 日本電気株式会社 発話音声検出装置
CN101494049B (zh) * 2009-03-11 2011-07-27 北京邮电大学 一种用于音频监控系统中的音频特征参数的提取方法
ES2371619B1 (es) 2009-10-08 2012-08-08 Telefónica, S.A. Procedimiento de detección de segmentos de voz.
CN104485118A (zh) 2009-10-19 2015-04-01 瑞典爱立信有限公司 用于语音活动检测的检测器和方法
KR101666521B1 (ko) * 2010-01-08 2016-10-14 삼성전자 주식회사 입력 신호의 피치 주기 검출 방법 및 그 장치
US20130090926A1 (en) 2011-09-16 2013-04-11 Qualcomm Incorporated Mobile device context information using speech detection
CN102568457A (zh) * 2011-12-23 2012-07-11 深圳市万兴软件有限公司 一种基于哼唱输入的乐曲合成方法及装置
US9351089B1 (en) * 2012-03-14 2016-05-24 Amazon Technologies, Inc. Audio tap detection
JP5772739B2 (ja) * 2012-06-21 2015-09-02 ヤマハ株式会社 音声処理装置
CN103544961B (zh) * 2012-07-10 2017-12-19 中兴通讯股份有限公司 语音信号处理方法及装置
CN107195313B (zh) * 2012-08-31 2021-02-09 瑞典爱立信有限公司 用于语音活动性检测的方法和设备
CN103117067B (zh) * 2013-01-19 2015-07-15 渤海大学 一种低信噪比下语音端点检测方法
CN103177722B (zh) * 2013-03-08 2016-04-20 北京理工大学 一种基于音色相似度的歌曲检索方法
CN103198838A (zh) * 2013-03-29 2013-07-10 苏州皓泰视频技术有限公司 一种用于嵌入式系统的异常声音监控方法和监控装置
CN103247293B (zh) * 2013-05-14 2015-04-08 中国科学院自动化研究所 一种语音数据的编码及解码方法
WO2014194273A2 (en) * 2013-05-30 2014-12-04 Eisner, Mark Systems and methods for enhancing targeted audibility
US9502028B2 (en) * 2013-10-18 2016-11-22 Knowles Electronics, Llc Acoustic activity detection apparatus and method
CN103646649B (zh) * 2013-12-30 2016-04-13 中国科学院自动化研究所 一种高效的语音检测方法
CN104916288B (zh) 2014-03-14 2019-01-18 深圳Tcl新技术有限公司 一种音频中人声突出处理的方法及装置
CN104934032B (zh) * 2014-03-17 2019-04-05 华为技术有限公司 根据频域能量对语音信号进行处理的方法和装置
US9406313B2 (en) * 2014-03-21 2016-08-02 Intel Corporation Adaptive microphone sampling rate techniques
CN106328168B (zh) * 2016-08-30 2019-10-18 成都普创通信技术股份有限公司 一种语音信号相似度检测方法
CN106887241A (zh) * 2016-10-12 2017-06-23 阿里巴巴集团控股有限公司 一种语音信号检测方法与装置

Also Published As

Publication number Publication date
JP2019535039A (ja) 2019-12-05
PH12019500784A1 (en) 2019-11-11
WO2018068636A1 (zh) 2018-04-19
US10706874B2 (en) 2020-07-07
JP6999012B2 (ja) 2022-01-18
US20190237097A1 (en) 2019-08-01
KR20190061076A (ko) 2019-06-04
TWI654601B (zh) 2019-03-21
EP3528251A1 (en) 2019-08-21
MY201634A (en) 2024-03-06
CN106887241A (zh) 2017-06-23
JP2021071729A (ja) 2021-05-06
PH12019500784B1 (en) 2024-02-28
EP3528251A4 (en) 2019-08-21
EP3528251B1 (en) 2022-02-23
TW201814692A (zh) 2018-04-16
KR102214888B1 (ko) 2021-02-15
JP6859499B2 (ja) 2021-04-14

Similar Documents

Publication Publication Date Title
SG11201903320XA (en) Voice signal detection method and apparatus
GB2562664A (en) Methods for detecting a sleep disorder and sleep disorder detection devices
SG11201907257SA (en) Model training method, apparatus, and device, and data similarity determining method, apparatus, and device
HUE068485T2 (hu) Energiatakarékos jel detektálási eljárás, erõforrás meghatározó eljárás, és eszköz ezekhez
MX360586B (es) Método y dispositivo de alarma.
MX361526B (es) Método y dispositivo de alarma.
MX359182B (es) Metodo y dispositivo para grabacion de video.
SG11201909141TA (en) Face image processing methods and apparatuses, and electronic devices
IN2015MN01766A (https=)
MY193521A (en) Method for detecting audio signal and apparatus
MX2016008877A (es) Aparato y metodo para la generacion de una pluralidad de canales de audio.
MY185159A (en) Apparatus and method for generating a frequency enhanced signal using temporal smoothing of subbands
MY202725A (en) Sound quality identification method and device for sound file
PH12019502472A1 (en) Method and device for discontinuous reception
EP2664062A4 (en) METHOD AND APPARATUS FOR IMPROVING VOICE QUALITY
NZ714039A (en) Decoding method and decoding apparatus
GB201216254D0 (en) Method, apparatus and manufacture for smiling face detection
MX373497B (es) Método y dispositivo para extraer una característica.
MY179546A (en) Method for processing speech/audio signal and apparatus
EP3901420A3 (en) Flutter detection sensor
MY183933A (en) Apparatus and methods of switching coding technologies at a device
PH12019502393A1 (en) Signal processing method and apparatus
EP4657898A3 (en) Processing audio signals
MY178408A (en) Method and apparatus for processing lost frame
MY198074A (en) Information processing device, and signal transmission control method