JP6859499B2 - 音声信号検出方法及び装置 - Google Patents
音声信号検出方法及び装置 Download PDFInfo
- Publication number
- JP6859499B2 JP6859499B2 JP2019520035A JP2019520035A JP6859499B2 JP 6859499 B2 JP6859499 B2 JP 6859499B2 JP 2019520035 A JP2019520035 A JP 2019520035A JP 2019520035 A JP2019520035 A JP 2019520035A JP 6859499 B2 JP6859499 B2 JP 6859499B2
- Authority
- JP
- Japan
- Prior art keywords
- audio signal
- energy
- short
- ratio
- frames
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title claims description 450
- 238000001514 detection method Methods 0.000 title description 44
- 238000000034 method Methods 0.000 claims description 65
- 238000005070 sampling Methods 0.000 claims description 43
- 230000015654 memory Effects 0.000 description 15
- 230000006870 function Effects 0.000 description 11
- 238000010586 diagram Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 9
- 238000004364 calculation method Methods 0.000 description 8
- 238000004590 computer program Methods 0.000 description 7
- 230000005540 biological transmission Effects 0.000 description 3
- 238000010079 rubber tapping Methods 0.000 description 3
- 230000010354 integration Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/87—Detection of discrete points within a voice signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
- Circuits Of Receivers In General (AREA)
- Mobile Radio Communication Systems (AREA)
- Electric Clocks (AREA)
- Time-Division Multiplex Systems (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610890946.9 | 2016-10-12 | ||
| CN201610890946.9A CN106887241A (zh) | 2016-10-12 | 2016-10-12 | 一种语音信号检测方法与装置 |
| PCT/CN2017/103489 WO2018068636A1 (zh) | 2016-10-12 | 2017-09-26 | 一种语音信号检测方法与装置 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2020201829A Division JP6999012B2 (ja) | 2016-10-12 | 2020-12-04 | 音声信号検出方法及び装置 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2019535039A JP2019535039A (ja) | 2019-12-05 |
| JP2019535039A5 JP2019535039A5 (enExample) | 2020-06-25 |
| JP6859499B2 true JP6859499B2 (ja) | 2021-04-14 |
Family
ID=59176496
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2019520035A Active JP6859499B2 (ja) | 2016-10-12 | 2017-09-26 | 音声信号検出方法及び装置 |
| JP2020201829A Active JP6999012B2 (ja) | 2016-10-12 | 2020-12-04 | 音声信号検出方法及び装置 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2020201829A Active JP6999012B2 (ja) | 2016-10-12 | 2020-12-04 | 音声信号検出方法及び装置 |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US10706874B2 (enExample) |
| EP (1) | EP3528251B1 (enExample) |
| JP (2) | JP6859499B2 (enExample) |
| KR (1) | KR102214888B1 (enExample) |
| CN (1) | CN106887241A (enExample) |
| MY (1) | MY201634A (enExample) |
| PH (1) | PH12019500784B1 (enExample) |
| SG (1) | SG11201903320XA (enExample) |
| TW (1) | TWI654601B (enExample) |
| WO (1) | WO2018068636A1 (enExample) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2021071729A (ja) * | 2016-10-12 | 2021-05-06 | アドバンスド ニュー テクノロジーズ カンパニー リミテッド | 音声信号検出方法及び装置 |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107957918B (zh) * | 2016-10-14 | 2019-05-10 | 腾讯科技(深圳)有限公司 | 数据恢复方法和装置 |
| CN108257616A (zh) * | 2017-12-05 | 2018-07-06 | 苏州车萝卜汽车电子科技有限公司 | 人机对话的检测方法以及装置 |
| CN108305639B (zh) * | 2018-05-11 | 2021-03-09 | 南京邮电大学 | 语音情感识别方法、计算机可读存储介质、终端 |
| CN108682432B (zh) * | 2018-05-11 | 2021-03-16 | 南京邮电大学 | 语音情感识别装置 |
| CN108847217A (zh) * | 2018-05-31 | 2018-11-20 | 平安科技(深圳)有限公司 | 一种语音切分方法、装置、计算机设备及存储介质 |
| CN109545193B (zh) * | 2018-12-18 | 2023-03-14 | 百度在线网络技术(北京)有限公司 | 用于生成模型的方法和装置 |
| CN110225444A (zh) * | 2019-06-14 | 2019-09-10 | 四川长虹电器股份有限公司 | 一种麦克风阵列系统的故障检测方法及其检测系统 |
| CN111724783B (zh) * | 2020-06-24 | 2023-10-17 | 北京小米移动软件有限公司 | 智能设备的唤醒方法、装置、智能设备及介质 |
| CN113270118B (zh) * | 2021-05-14 | 2024-02-13 | 杭州网易智企科技有限公司 | 语音活动侦测方法及装置、存储介质和电子设备 |
| CN116612775A (zh) * | 2022-02-09 | 2023-08-18 | 宸芯科技股份有限公司 | 一种杂音消除方法、装置、电子设备及介质 |
| CN114792530B (zh) * | 2022-04-26 | 2025-07-04 | 美的集团(上海)有限公司 | 语音数据处理方法、装置、电子设备和存储介质 |
| CN114898774B (zh) * | 2022-05-06 | 2025-06-13 | 钉钉(中国)信息技术有限公司 | 一种音频掉点的检测方法及装置 |
| CN116863947A (zh) * | 2023-07-27 | 2023-10-10 | 海纳科德(湖北)科技有限公司 | 一种利用宠物语音信号识别情绪的方法及系统 |
Family Cites Families (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3297346B2 (ja) * | 1997-04-30 | 2002-07-02 | 沖電気工業株式会社 | 音声検出装置 |
| TW333610B (en) | 1997-10-16 | 1998-06-11 | Winbond Electronics Corp | The phonetic detecting apparatus and its detecting method |
| US6480823B1 (en) | 1998-03-24 | 2002-11-12 | Matsushita Electric Industrial Co., Ltd. | Speech detection for noisy conditions |
| JP3266124B2 (ja) * | 1999-01-07 | 2002-03-18 | ヤマハ株式会社 | アナログ信号中の類似波形検出装置及び同信号の時間軸伸長圧縮装置 |
| KR100463657B1 (ko) * | 2002-11-30 | 2004-12-29 | 삼성전자주식회사 | 음성구간 검출 장치 및 방법 |
| US7715447B2 (en) | 2003-12-23 | 2010-05-11 | Intel Corporation | Method and system for tone detection |
| CN101625860B (zh) * | 2008-07-10 | 2012-07-04 | 新奥特(北京)视频技术有限公司 | 语音端点检测中的背景噪声自适应调整方法 |
| JP5459220B2 (ja) | 2008-11-27 | 2014-04-02 | 日本電気株式会社 | 発話音声検出装置 |
| CN101494049B (zh) * | 2009-03-11 | 2011-07-27 | 北京邮电大学 | 一种用于音频监控系统中的音频特征参数的提取方法 |
| ES2371619B1 (es) | 2009-10-08 | 2012-08-08 | Telefónica, S.A. | Procedimiento de detección de segmentos de voz. |
| BR112012008671A2 (pt) | 2009-10-19 | 2016-04-19 | Ericsson Telefon Ab L M | método para detectar atividade de voz de um sinal de entrada recebido, e, detector de atividade de voz |
| KR101666521B1 (ko) * | 2010-01-08 | 2016-10-14 | 삼성전자 주식회사 | 입력 신호의 피치 주기 검출 방법 및 그 장치 |
| US20130090926A1 (en) | 2011-09-16 | 2013-04-11 | Qualcomm Incorporated | Mobile device context information using speech detection |
| CN102568457A (zh) * | 2011-12-23 | 2012-07-11 | 深圳市万兴软件有限公司 | 一种基于哼唱输入的乐曲合成方法及装置 |
| US9351089B1 (en) * | 2012-03-14 | 2016-05-24 | Amazon Technologies, Inc. | Audio tap detection |
| JP5772739B2 (ja) * | 2012-06-21 | 2015-09-02 | ヤマハ株式会社 | 音声処理装置 |
| CN103544961B (zh) * | 2012-07-10 | 2017-12-19 | 中兴通讯股份有限公司 | 语音信号处理方法及装置 |
| HUE038398T2 (hu) * | 2012-08-31 | 2018-10-29 | Ericsson Telefon Ab L M | Eljárás és eszköz hang aktivitás észlelésére |
| CN103117067B (zh) * | 2013-01-19 | 2015-07-15 | 渤海大学 | 一种低信噪比下语音端点检测方法 |
| CN103177722B (zh) * | 2013-03-08 | 2016-04-20 | 北京理工大学 | 一种基于音色相似度的歌曲检索方法 |
| CN103198838A (zh) * | 2013-03-29 | 2013-07-10 | 苏州皓泰视频技术有限公司 | 一种用于嵌入式系统的异常声音监控方法和监控装置 |
| CN103247293B (zh) * | 2013-05-14 | 2015-04-08 | 中国科学院自动化研究所 | 一种语音数据的编码及解码方法 |
| WO2014194273A2 (en) * | 2013-05-30 | 2014-12-04 | Eisner, Mark | Systems and methods for enhancing targeted audibility |
| US9502028B2 (en) | 2013-10-18 | 2016-11-22 | Knowles Electronics, Llc | Acoustic activity detection apparatus and method |
| CN103646649B (zh) * | 2013-12-30 | 2016-04-13 | 中国科学院自动化研究所 | 一种高效的语音检测方法 |
| CN104916288B (zh) | 2014-03-14 | 2019-01-18 | 深圳Tcl新技术有限公司 | 一种音频中人声突出处理的方法及装置 |
| CN104934032B (zh) * | 2014-03-17 | 2019-04-05 | 华为技术有限公司 | 根据频域能量对语音信号进行处理的方法和装置 |
| US9406313B2 (en) * | 2014-03-21 | 2016-08-02 | Intel Corporation | Adaptive microphone sampling rate techniques |
| CN106328168B (zh) * | 2016-08-30 | 2019-10-18 | 成都普创通信技术股份有限公司 | 一种语音信号相似度检测方法 |
| CN106887241A (zh) * | 2016-10-12 | 2017-06-23 | 阿里巴巴集团控股有限公司 | 一种语音信号检测方法与装置 |
-
2016
- 2016-10-12 CN CN201610890946.9A patent/CN106887241A/zh active Pending
-
2017
- 2017-09-12 TW TW106131148A patent/TWI654601B/zh active
- 2017-09-26 PH PH1/2019/500784A patent/PH12019500784B1/en unknown
- 2017-09-26 JP JP2019520035A patent/JP6859499B2/ja active Active
- 2017-09-26 KR KR1020197013519A patent/KR102214888B1/ko active Active
- 2017-09-26 MY MYPI2019001999A patent/MY201634A/en unknown
- 2017-09-26 SG SG11201903320XA patent/SG11201903320XA/en unknown
- 2017-09-26 WO PCT/CN2017/103489 patent/WO2018068636A1/zh not_active Ceased
- 2017-09-26 EP EP17860814.7A patent/EP3528251B1/en active Active
-
2019
- 2019-04-10 US US16/380,609 patent/US10706874B2/en active Active
-
2020
- 2020-12-04 JP JP2020201829A patent/JP6999012B2/ja active Active
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2021071729A (ja) * | 2016-10-12 | 2021-05-06 | アドバンスド ニュー テクノロジーズ カンパニー リミテッド | 音声信号検出方法及び装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2018068636A1 (zh) | 2018-04-19 |
| PH12019500784A1 (en) | 2019-11-11 |
| JP2019535039A (ja) | 2019-12-05 |
| KR102214888B1 (ko) | 2021-02-15 |
| JP2021071729A (ja) | 2021-05-06 |
| US20190237097A1 (en) | 2019-08-01 |
| PH12019500784B1 (en) | 2024-02-28 |
| SG11201903320XA (en) | 2019-05-30 |
| TWI654601B (zh) | 2019-03-21 |
| CN106887241A (zh) | 2017-06-23 |
| EP3528251A1 (en) | 2019-08-21 |
| US10706874B2 (en) | 2020-07-07 |
| KR20190061076A (ko) | 2019-06-04 |
| EP3528251A4 (en) | 2019-08-21 |
| JP6999012B2 (ja) | 2022-01-18 |
| EP3528251B1 (en) | 2022-02-23 |
| MY201634A (en) | 2024-03-06 |
| TW201814692A (zh) | 2018-04-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6999012B2 (ja) | 音声信号検出方法及び装置 | |
| US11670325B2 (en) | Voice activity detection using a soft decision mechanism | |
| CN107068161B (zh) | 基于人工智能的语音降噪方法、装置和计算机设备 | |
| JP6784758B2 (ja) | ノイズ信号判定方法及び装置並びに音声ノイズ除去方法及び装置 | |
| CN107680584B (zh) | 用于切分音频的方法和装置 | |
| US9916843B2 (en) | Voice processing apparatus, voice processing method, and non-transitory computer-readable storage medium to determine whether voice signals are in a conversation state | |
| CN112331188A (zh) | 一种语音数据处理方法、系统及终端设备 | |
| US20240407342A1 (en) | Method, system, and device for classifying feeding intensity of fish school | |
| EP2947659A1 (en) | Voice processing device and voice processing method | |
| CN114333912B (zh) | 语音激活检测方法、装置、电子设备和存储介质 | |
| JP5614261B2 (ja) | 雑音抑制装置、雑音抑制方法、及びプログラム | |
| US10522160B2 (en) | Methods and apparatus to identify a source of speech captured at a wearable electronic device | |
| CN111986657A (zh) | 音频识别方法和装置、录音终端及服务器、存储介质 | |
| CN108093356B (zh) | 一种啸叫检测方法及装置 | |
| CN113436641A (zh) | 一种音乐转场时间点检测方法、设备及介质 | |
| JP2020527433A (ja) | 人体疲労値の取得方法及び装置 | |
| CN116320372A (zh) | 音频延时检测方法、系统、装置、存储介质和处理器 | |
| CN111145770B (zh) | 音频处理方法和装置 | |
| CN109841222B (zh) | 音频通信方法、通信设备及存储介质 | |
| CN114639390A (zh) | 一种语音噪声分析方法及系统 | |
| CN111883159B (zh) | 语音的处理方法及装置 | |
| HK1237986A1 (en) | Voice signal detection method and apparatus | |
| HK1237986A (en) | Voice signal detection method and apparatus | |
| JP2018180482A (ja) | 音声検出装置及び音声検出プログラム | |
| JP2016133600A (ja) | 顕著度推定方法、顕著度推定装置、プログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20190612 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20190612 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20200512 |
|
| A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20200512 |
|
| RD03 | Notification of appointment of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7423 Effective date: 20200605 |
|
| A975 | Report on accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A971005 Effective date: 20200721 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20200803 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20200911 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20201005 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20201104 |
|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20201204 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20201204 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20201228 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 6859499 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| RVTR | Cancellation due to determination of trial for invalidation | ||
| R157 | Certificate of patent or utility model (correction) |
Free format text: JAPANESE INTERMEDIATE CODE: R157 |