TWI654601B - 語音信號檢測方法與裝置 - Google Patents

語音信號檢測方法與裝置

Info

Publication number
TWI654601B
TWI654601B TW106131148A TW106131148A TWI654601B TW I654601 B TWI654601 B TW I654601B TW 106131148 A TW106131148 A TW 106131148A TW 106131148 A TW106131148 A TW 106131148A TW I654601 B TWI654601 B TW I654601B
Authority
TW
Taiwan
Prior art keywords
audio signal
short
signal
term energy
voice signal
Prior art date
Application number
TW106131148A
Other languages
English (en)
Chinese (zh)
Other versions
TW201814692A (zh
Inventor
焦雷
官硯楚
曾曉東
林鋒
Original Assignee
Alibaba Group Services Limited
香港商阿里巴巴集團服務有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=59176496&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=TWI654601(B) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Alibaba Group Services Limited, 香港商阿里巴巴集團服務有限公司 filed Critical Alibaba Group Services Limited
Publication of TW201814692A publication Critical patent/TW201814692A/zh
Application granted granted Critical
Publication of TWI654601B publication Critical patent/TWI654601B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Circuits Of Receivers In General (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Electric Clocks (AREA)
TW106131148A 2016-10-12 2017-09-12 語音信號檢測方法與裝置 TWI654601B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
??201610890946.9 2016-10-12
CN201610890946.9A CN106887241A (zh) 2016-10-12 2016-10-12 一种语音信号检测方法与装置

Publications (2)

Publication Number Publication Date
TW201814692A TW201814692A (zh) 2018-04-16
TWI654601B true TWI654601B (zh) 2019-03-21

Family

ID=59176496

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106131148A TWI654601B (zh) 2016-10-12 2017-09-12 語音信號檢測方法與裝置

Country Status (10)

Country Link
US (1) US10706874B2 (https=)
EP (1) EP3528251B1 (https=)
JP (2) JP6859499B2 (https=)
KR (1) KR102214888B1 (https=)
CN (1) CN106887241A (https=)
MY (1) MY201634A (https=)
PH (1) PH12019500784B1 (https=)
SG (1) SG11201903320XA (https=)
TW (1) TWI654601B (https=)
WO (1) WO2018068636A1 (https=)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106887241A (zh) * 2016-10-12 2017-06-23 阿里巴巴集团控股有限公司 一种语音信号检测方法与装置
CN107957918B (zh) * 2016-10-14 2019-05-10 腾讯科技(深圳)有限公司 数据恢复方法和装置
CN108257616A (zh) * 2017-12-05 2018-07-06 苏州车萝卜汽车电子科技有限公司 人机对话的检测方法以及装置
CN108305639B (zh) * 2018-05-11 2021-03-09 南京邮电大学 语音情感识别方法、计算机可读存储介质、终端
CN108682432B (zh) * 2018-05-11 2021-03-16 南京邮电大学 语音情感识别装置
CN108847217A (zh) * 2018-05-31 2018-11-20 平安科技(深圳)有限公司 一种语音切分方法、装置、计算机设备及存储介质
CN109545193B (zh) * 2018-12-18 2023-03-14 百度在线网络技术(北京)有限公司 用于生成模型的方法和装置
CN110225444A (zh) * 2019-06-14 2019-09-10 四川长虹电器股份有限公司 一种麦克风阵列系统的故障检测方法及其检测系统
CN111724783B (zh) * 2020-06-24 2023-10-17 北京小米移动软件有限公司 智能设备的唤醒方法、装置、智能设备及介质
CN113270118B (zh) * 2021-05-14 2024-02-13 杭州网易智企科技有限公司 语音活动侦测方法及装置、存储介质和电子设备
CN116612775A (zh) * 2022-02-09 2023-08-18 宸芯科技股份有限公司 一种杂音消除方法、装置、电子设备及介质
CN114792530B (zh) * 2022-04-26 2025-07-04 美的集团(上海)有限公司 语音数据处理方法、装置、电子设备和存储介质
CN114898774B (zh) * 2022-05-06 2025-06-13 钉钉(中国)信息技术有限公司 一种音频掉点的检测方法及装置
CN116863947A (zh) * 2023-07-27 2023-10-10 海纳科德(湖北)科技有限公司 一种利用宠物语音信号识别情绪的方法及系统

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW333610B (en) 1997-10-16 1998-06-11 Winbond Electronics Corp The phonetic detecting apparatus and its detecting method
TW436759B (en) 1998-03-24 2001-05-28 Matsushita Electric Industrial Co Ltd Speech detection system for noisy conditions
TW201320058A (zh) 2011-09-16 2013-05-16 Qualcomm Inc 使用語音偵測之行動裝置情境資訊
TW201519222A (zh) 2013-10-18 2015-05-16 Knowles Electronics Llc 聲音活動偵測裝置和方法

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3297346B2 (ja) * 1997-04-30 2002-07-02 沖電気工業株式会社 音声検出装置
JP3266124B2 (ja) * 1999-01-07 2002-03-18 ヤマハ株式会社 アナログ信号中の類似波形検出装置及び同信号の時間軸伸長圧縮装置
KR100463657B1 (ko) * 2002-11-30 2004-12-29 삼성전자주식회사 음성구간 검출 장치 및 방법
US7715447B2 (en) 2003-12-23 2010-05-11 Intel Corporation Method and system for tone detection
CN101625860B (zh) * 2008-07-10 2012-07-04 新奥特(北京)视频技术有限公司 语音端点检测中的背景噪声自适应调整方法
WO2010061505A1 (ja) 2008-11-27 2010-06-03 日本電気株式会社 発話音声検出装置
CN101494049B (zh) * 2009-03-11 2011-07-27 北京邮电大学 一种用于音频监控系统中的音频特征参数的提取方法
ES2371619B1 (es) 2009-10-08 2012-08-08 Telefónica, S.A. Procedimiento de detección de segmentos de voz.
CN104485118A (zh) 2009-10-19 2015-04-01 瑞典爱立信有限公司 用于语音活动检测的检测器和方法
KR101666521B1 (ko) * 2010-01-08 2016-10-14 삼성전자 주식회사 입력 신호의 피치 주기 검출 방법 및 그 장치
CN102568457A (zh) * 2011-12-23 2012-07-11 深圳市万兴软件有限公司 一种基于哼唱输入的乐曲合成方法及装置
US9351089B1 (en) * 2012-03-14 2016-05-24 Amazon Technologies, Inc. Audio tap detection
JP5772739B2 (ja) * 2012-06-21 2015-09-02 ヤマハ株式会社 音声処理装置
CN103544961B (zh) * 2012-07-10 2017-12-19 中兴通讯股份有限公司 语音信号处理方法及装置
CN107195313B (zh) * 2012-08-31 2021-02-09 瑞典爱立信有限公司 用于语音活动性检测的方法和设备
CN103117067B (zh) * 2013-01-19 2015-07-15 渤海大学 一种低信噪比下语音端点检测方法
CN103177722B (zh) * 2013-03-08 2016-04-20 北京理工大学 一种基于音色相似度的歌曲检索方法
CN103198838A (zh) * 2013-03-29 2013-07-10 苏州皓泰视频技术有限公司 一种用于嵌入式系统的异常声音监控方法和监控装置
CN103247293B (zh) * 2013-05-14 2015-04-08 中国科学院自动化研究所 一种语音数据的编码及解码方法
WO2014194273A2 (en) * 2013-05-30 2014-12-04 Eisner, Mark Systems and methods for enhancing targeted audibility
CN103646649B (zh) * 2013-12-30 2016-04-13 中国科学院自动化研究所 一种高效的语音检测方法
CN104916288B (zh) 2014-03-14 2019-01-18 深圳Tcl新技术有限公司 一种音频中人声突出处理的方法及装置
CN104934032B (zh) * 2014-03-17 2019-04-05 华为技术有限公司 根据频域能量对语音信号进行处理的方法和装置
US9406313B2 (en) * 2014-03-21 2016-08-02 Intel Corporation Adaptive microphone sampling rate techniques
CN106328168B (zh) * 2016-08-30 2019-10-18 成都普创通信技术股份有限公司 一种语音信号相似度检测方法
CN106887241A (zh) * 2016-10-12 2017-06-23 阿里巴巴集团控股有限公司 一种语音信号检测方法与装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW333610B (en) 1997-10-16 1998-06-11 Winbond Electronics Corp The phonetic detecting apparatus and its detecting method
TW436759B (en) 1998-03-24 2001-05-28 Matsushita Electric Industrial Co Ltd Speech detection system for noisy conditions
TW201320058A (zh) 2011-09-16 2013-05-16 Qualcomm Inc 使用語音偵測之行動裝置情境資訊
TW201519222A (zh) 2013-10-18 2015-05-16 Knowles Electronics Llc 聲音活動偵測裝置和方法

Also Published As

Publication number Publication date
JP2019535039A (ja) 2019-12-05
PH12019500784A1 (en) 2019-11-11
WO2018068636A1 (zh) 2018-04-19
US10706874B2 (en) 2020-07-07
JP6999012B2 (ja) 2022-01-18
US20190237097A1 (en) 2019-08-01
KR20190061076A (ko) 2019-06-04
SG11201903320XA (en) 2019-05-30
EP3528251A1 (en) 2019-08-21
MY201634A (en) 2024-03-06
CN106887241A (zh) 2017-06-23
JP2021071729A (ja) 2021-05-06
PH12019500784B1 (en) 2024-02-28
EP3528251A4 (en) 2019-08-21
EP3528251B1 (en) 2022-02-23
TW201814692A (zh) 2018-04-16
KR102214888B1 (ko) 2021-02-15
JP6859499B2 (ja) 2021-04-14

Similar Documents

Publication Publication Date Title
TWI654601B (zh) 語音信號檢測方法與裝置
JP2019530264A (ja) 音波によるデータ送信/受信方法及びデータ伝送システム
US20180152163A1 (en) Noise control method and device
US10973458B2 (en) Daily cognitive monitoring of early signs of hearing loss
CN104991755B (zh) 一种信息处理方法及电子设备
US9961642B2 (en) Reduced power consuming mobile devices method and apparatus
US20120053937A1 (en) Generalizing text content summary from speech content
WO2013189263A1 (zh) 在移动终端中监控api函数调用的方法和装置
WO2019109420A1 (zh) 左右声道确定方法及耳机设备
WO2016201767A1 (zh) 一种语音控制方法、装置及计算机存储介质
WO2019183791A1 (zh) 同步信号块传输方法、设备及存储介质
CN106303816B (zh) 一种信息控制方法及电子设备
CN111382241A (zh) 会话场景切换方法及装置
WO2015188761A1 (en) Traffic acquiring method and apparatus based on operating system
WO2018026452A1 (en) System and method for distributing and replaying trigger packets via a variable latency bus interconnect
CN113971962A (zh) 一种信号的检测方法、计算设备及存储介质
CN111370034B (zh) 同步录音的方法及装置、无线耳机充电设备
CN108093356B (zh) 一种啸叫检测方法及装置
CN104881228A (zh) 学习时间检测方法、装置和系统
CN113936678A (zh) 目标语音的检测方法及装置、设备、存储介质
CN109040937B (zh) 麦克风堵塞提醒方法及相关装置
HK1237986A1 (en) Voice signal detection method and apparatus
HK1237986A (en) Voice signal detection method and apparatus
CN104486134A (zh) 一种监控文件发送积压的方法和装置
CN113611298A (zh) 智能设备的唤醒方法和装置、存储介质及电子装置

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees