TWI654601B - Voice signal detection method and device - Google Patents
Voice signal detection method and deviceInfo
- Publication number
- TWI654601B TWI654601B TW106131148A TW106131148A TWI654601B TW I654601 B TWI654601 B TW I654601B TW 106131148 A TW106131148 A TW 106131148A TW 106131148 A TW106131148 A TW 106131148A TW I654601 B TWI654601 B TW I654601B
- Authority
- TW
- Taiwan
- Prior art keywords
- audio signal
- short
- signal
- term energy
- voice signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/87—Detection of discrete points within a voice signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
- Circuits Of Receivers In General (AREA)
- Mobile Radio Communication Systems (AREA)
- Electric Clocks (AREA)
- Time-Division Multiplex Systems (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| ??201610890946.9 | 2016-10-12 | ||
| CN201610890946.9A CN106887241A (zh) | 2016-10-12 | 2016-10-12 | 一种语音信号检测方法与装置 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201814692A TW201814692A (zh) | 2018-04-16 |
| TWI654601B true TWI654601B (zh) | 2019-03-21 |
Family
ID=59176496
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW106131148A TWI654601B (zh) | 2016-10-12 | 2017-09-12 | Voice signal detection method and device |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US10706874B2 (enExample) |
| EP (1) | EP3528251B1 (enExample) |
| JP (2) | JP6859499B2 (enExample) |
| KR (1) | KR102214888B1 (enExample) |
| CN (1) | CN106887241A (enExample) |
| MY (1) | MY201634A (enExample) |
| PH (1) | PH12019500784B1 (enExample) |
| SG (1) | SG11201903320XA (enExample) |
| TW (1) | TWI654601B (enExample) |
| WO (1) | WO2018068636A1 (enExample) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106887241A (zh) * | 2016-10-12 | 2017-06-23 | 阿里巴巴集团控股有限公司 | 一种语音信号检测方法与装置 |
| CN107957918B (zh) * | 2016-10-14 | 2019-05-10 | 腾讯科技(深圳)有限公司 | 数据恢复方法和装置 |
| CN108257616A (zh) * | 2017-12-05 | 2018-07-06 | 苏州车萝卜汽车电子科技有限公司 | 人机对话的检测方法以及装置 |
| CN108305639B (zh) * | 2018-05-11 | 2021-03-09 | 南京邮电大学 | 语音情感识别方法、计算机可读存储介质、终端 |
| CN108682432B (zh) * | 2018-05-11 | 2021-03-16 | 南京邮电大学 | 语音情感识别装置 |
| CN108847217A (zh) * | 2018-05-31 | 2018-11-20 | 平安科技(深圳)有限公司 | 一种语音切分方法、装置、计算机设备及存储介质 |
| CN109545193B (zh) * | 2018-12-18 | 2023-03-14 | 百度在线网络技术(北京)有限公司 | 用于生成模型的方法和装置 |
| CN110225444A (zh) * | 2019-06-14 | 2019-09-10 | 四川长虹电器股份有限公司 | 一种麦克风阵列系统的故障检测方法及其检测系统 |
| CN111724783B (zh) * | 2020-06-24 | 2023-10-17 | 北京小米移动软件有限公司 | 智能设备的唤醒方法、装置、智能设备及介质 |
| CN113270118B (zh) * | 2021-05-14 | 2024-02-13 | 杭州网易智企科技有限公司 | 语音活动侦测方法及装置、存储介质和电子设备 |
| CN116612775A (zh) * | 2022-02-09 | 2023-08-18 | 宸芯科技股份有限公司 | 一种杂音消除方法、装置、电子设备及介质 |
| CN114792530B (zh) * | 2022-04-26 | 2025-07-04 | 美的集团(上海)有限公司 | 语音数据处理方法、装置、电子设备和存储介质 |
| CN114898774B (zh) * | 2022-05-06 | 2025-06-13 | 钉钉(中国)信息技术有限公司 | 一种音频掉点的检测方法及装置 |
| CN116863947A (zh) * | 2023-07-27 | 2023-10-10 | 海纳科德(湖北)科技有限公司 | 一种利用宠物语音信号识别情绪的方法及系统 |
Family Cites Families (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3297346B2 (ja) * | 1997-04-30 | 2002-07-02 | 沖電気工業株式会社 | 音声検出装置 |
| TW333610B (en) | 1997-10-16 | 1998-06-11 | Winbond Electronics Corp | The phonetic detecting apparatus and its detecting method |
| US6480823B1 (en) | 1998-03-24 | 2002-11-12 | Matsushita Electric Industrial Co., Ltd. | Speech detection for noisy conditions |
| JP3266124B2 (ja) * | 1999-01-07 | 2002-03-18 | ヤマハ株式会社 | アナログ信号中の類似波形検出装置及び同信号の時間軸伸長圧縮装置 |
| KR100463657B1 (ko) * | 2002-11-30 | 2004-12-29 | 삼성전자주식회사 | 음성구간 검출 장치 및 방법 |
| US7715447B2 (en) | 2003-12-23 | 2010-05-11 | Intel Corporation | Method and system for tone detection |
| CN101625860B (zh) * | 2008-07-10 | 2012-07-04 | 新奥特(北京)视频技术有限公司 | 语音端点检测中的背景噪声自适应调整方法 |
| JP5459220B2 (ja) | 2008-11-27 | 2014-04-02 | 日本電気株式会社 | 発話音声検出装置 |
| CN101494049B (zh) * | 2009-03-11 | 2011-07-27 | 北京邮电大学 | 一种用于音频监控系统中的音频特征参数的提取方法 |
| ES2371619B1 (es) | 2009-10-08 | 2012-08-08 | Telefónica, S.A. | Procedimiento de detección de segmentos de voz. |
| BR112012008671A2 (pt) | 2009-10-19 | 2016-04-19 | Ericsson Telefon Ab L M | método para detectar atividade de voz de um sinal de entrada recebido, e, detector de atividade de voz |
| KR101666521B1 (ko) * | 2010-01-08 | 2016-10-14 | 삼성전자 주식회사 | 입력 신호의 피치 주기 검출 방법 및 그 장치 |
| US20130090926A1 (en) | 2011-09-16 | 2013-04-11 | Qualcomm Incorporated | Mobile device context information using speech detection |
| CN102568457A (zh) * | 2011-12-23 | 2012-07-11 | 深圳市万兴软件有限公司 | 一种基于哼唱输入的乐曲合成方法及装置 |
| US9351089B1 (en) * | 2012-03-14 | 2016-05-24 | Amazon Technologies, Inc. | Audio tap detection |
| JP5772739B2 (ja) * | 2012-06-21 | 2015-09-02 | ヤマハ株式会社 | 音声処理装置 |
| CN103544961B (zh) * | 2012-07-10 | 2017-12-19 | 中兴通讯股份有限公司 | 语音信号处理方法及装置 |
| HUE038398T2 (hu) * | 2012-08-31 | 2018-10-29 | Ericsson Telefon Ab L M | Eljárás és eszköz hang aktivitás észlelésére |
| CN103117067B (zh) * | 2013-01-19 | 2015-07-15 | 渤海大学 | 一种低信噪比下语音端点检测方法 |
| CN103177722B (zh) * | 2013-03-08 | 2016-04-20 | 北京理工大学 | 一种基于音色相似度的歌曲检索方法 |
| CN103198838A (zh) * | 2013-03-29 | 2013-07-10 | 苏州皓泰视频技术有限公司 | 一种用于嵌入式系统的异常声音监控方法和监控装置 |
| CN103247293B (zh) * | 2013-05-14 | 2015-04-08 | 中国科学院自动化研究所 | 一种语音数据的编码及解码方法 |
| WO2014194273A2 (en) * | 2013-05-30 | 2014-12-04 | Eisner, Mark | Systems and methods for enhancing targeted audibility |
| US9502028B2 (en) | 2013-10-18 | 2016-11-22 | Knowles Electronics, Llc | Acoustic activity detection apparatus and method |
| CN103646649B (zh) * | 2013-12-30 | 2016-04-13 | 中国科学院自动化研究所 | 一种高效的语音检测方法 |
| CN104916288B (zh) | 2014-03-14 | 2019-01-18 | 深圳Tcl新技术有限公司 | 一种音频中人声突出处理的方法及装置 |
| CN104934032B (zh) * | 2014-03-17 | 2019-04-05 | 华为技术有限公司 | 根据频域能量对语音信号进行处理的方法和装置 |
| US9406313B2 (en) * | 2014-03-21 | 2016-08-02 | Intel Corporation | Adaptive microphone sampling rate techniques |
| CN106328168B (zh) * | 2016-08-30 | 2019-10-18 | 成都普创通信技术股份有限公司 | 一种语音信号相似度检测方法 |
| CN106887241A (zh) * | 2016-10-12 | 2017-06-23 | 阿里巴巴集团控股有限公司 | 一种语音信号检测方法与装置 |
-
2016
- 2016-10-12 CN CN201610890946.9A patent/CN106887241A/zh active Pending
-
2017
- 2017-09-12 TW TW106131148A patent/TWI654601B/zh active
- 2017-09-26 PH PH1/2019/500784A patent/PH12019500784B1/en unknown
- 2017-09-26 JP JP2019520035A patent/JP6859499B2/ja active Active
- 2017-09-26 KR KR1020197013519A patent/KR102214888B1/ko active Active
- 2017-09-26 MY MYPI2019001999A patent/MY201634A/en unknown
- 2017-09-26 SG SG11201903320XA patent/SG11201903320XA/en unknown
- 2017-09-26 WO PCT/CN2017/103489 patent/WO2018068636A1/zh not_active Ceased
- 2017-09-26 EP EP17860814.7A patent/EP3528251B1/en active Active
-
2019
- 2019-04-10 US US16/380,609 patent/US10706874B2/en active Active
-
2020
- 2020-12-04 JP JP2020201829A patent/JP6999012B2/ja active Active
Also Published As
| Publication number | Publication date |
|---|---|
| WO2018068636A1 (zh) | 2018-04-19 |
| PH12019500784A1 (en) | 2019-11-11 |
| JP2019535039A (ja) | 2019-12-05 |
| KR102214888B1 (ko) | 2021-02-15 |
| JP2021071729A (ja) | 2021-05-06 |
| US20190237097A1 (en) | 2019-08-01 |
| PH12019500784B1 (en) | 2024-02-28 |
| SG11201903320XA (en) | 2019-05-30 |
| CN106887241A (zh) | 2017-06-23 |
| EP3528251A1 (en) | 2019-08-21 |
| US10706874B2 (en) | 2020-07-07 |
| KR20190061076A (ko) | 2019-06-04 |
| EP3528251A4 (en) | 2019-08-21 |
| JP6999012B2 (ja) | 2022-01-18 |
| EP3528251B1 (en) | 2022-02-23 |
| JP6859499B2 (ja) | 2021-04-14 |
| MY201634A (en) | 2024-03-06 |
| TW201814692A (zh) | 2018-04-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI654601B (zh) | Voice signal detection method and device | |
| JP6489563B2 (ja) | 音量調節方法、システム、デバイス及びプログラム | |
| CN108345524B (zh) | 应用程序监控方法及应用程序监控装置 | |
| JP2019530264A (ja) | 音波によるデータ送信/受信方法及びデータ伝送システム | |
| CN111988647A (zh) | 音画同步调整方法、装置、设备以及介质 | |
| CN104991755B (zh) | 一种信息处理方法及电子设备 | |
| US9961642B2 (en) | Reduced power consuming mobile devices method and apparatus | |
| US10238333B2 (en) | Daily cognitive monitoring of early signs of hearing loss | |
| US20120053937A1 (en) | Generalizing text content summary from speech content | |
| WO2013189263A1 (zh) | 在移动终端中监控api函数调用的方法和装置 | |
| CN111787513A (zh) | 用于播放音频的方法和装置 | |
| WO2016201767A1 (zh) | 一种语音控制方法、装置及计算机存储介质 | |
| WO2019183791A1 (zh) | 同步信号块传输方法、设备及存储介质 | |
| CN106303816B (zh) | 一种信息控制方法及电子设备 | |
| CN111382241A (zh) | 会话场景切换方法及装置 | |
| WO2015188761A1 (en) | Traffic acquiring method and apparatus based on operating system | |
| US20180034749A1 (en) | System and method for distributing and replaying trigger packets via a variable latency bus interconnect | |
| CN113971962A (zh) | 一种信号的检测方法、计算设备及存储介质 | |
| CN110018806A (zh) | 一种语音处理方法和装置 | |
| CN115002229B (zh) | 边缘云网络系统、调度方法、设备、系统及存储介质 | |
| CN104881228B (zh) | 学习时间检测方法、装置和系统 | |
| CN111538249A (zh) | 分布式终端的控制方法、装置、设备和存储介质 | |
| CN109040937B (zh) | 麦克风堵塞提醒方法及相关装置 | |
| HK1237986A1 (en) | Voice signal detection method and apparatus | |
| HK1237986A (en) | Voice signal detection method and apparatus |