CN103646649B - A kind of speech detection method efficiently - Google Patents
A kind of speech detection method efficiently Download PDFInfo
- Publication number
- CN103646649B CN103646649B CN201310743203.5A CN201310743203A CN103646649B CN 103646649 B CN103646649 B CN 103646649B CN 201310743203 A CN201310743203 A CN 201310743203A CN 103646649 B CN103646649 B CN 103646649B
- Authority
- CN
- China
- Prior art keywords
- audio
- speech
- frame
- subband
- sound signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 60
- 230000005236 sound signal Effects 0.000 claims abstract description 52
- 238000000034 method Methods 0.000 claims abstract description 39
- 238000001228 spectrum Methods 0.000 claims abstract description 37
- 230000008569 process Effects 0.000 claims description 22
- 238000012549 training Methods 0.000 claims description 13
- 238000001914 filtration Methods 0.000 claims description 12
- 230000007704 transition Effects 0.000 claims description 11
- 238000012216 screening Methods 0.000 claims description 6
- 230000000052 comparative effect Effects 0.000 claims description 3
- 230000006870 function Effects 0.000 claims description 3
- 230000003068 static effect Effects 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 2
- 230000008447 perception Effects 0.000 claims description 2
- 230000003252 repetitive effect Effects 0.000 claims description 2
- 230000000717 retained effect Effects 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 12
- 238000004891 communication Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000013178 mathematical model Methods 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 1
- 206010038743 Restlessness Diseases 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310743203.5A CN103646649B (en) | 2013-12-30 | 2013-12-30 | A kind of speech detection method efficiently |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310743203.5A CN103646649B (en) | 2013-12-30 | 2013-12-30 | A kind of speech detection method efficiently |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103646649A CN103646649A (en) | 2014-03-19 |
CN103646649B true CN103646649B (en) | 2016-04-13 |
Family
ID=50251851
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310743203.5A Active CN103646649B (en) | 2013-12-30 | 2013-12-30 | A kind of speech detection method efficiently |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103646649B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102214888B1 (en) * | 2016-10-12 | 2021-02-15 | 어드밴스드 뉴 테크놀로지스 씨오., 엘티디. | Method and device for detecting an audio signal |
Families Citing this family (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104318927A (en) * | 2014-11-04 | 2015-01-28 | 东莞市北斗时空通信科技有限公司 | Anti-noise low-bitrate speech coding method and decoding method |
CN104464722B (en) * | 2014-11-13 | 2018-05-25 | 北京云知声信息技术有限公司 | Voice activity detection method and apparatus based on time domain and frequency domain |
CN104934043A (en) * | 2015-06-17 | 2015-09-23 | 广东欧珀移动通信有限公司 | Audio processing method and device |
CN105118522B (en) * | 2015-08-27 | 2021-02-12 | 广州市百果园网络科技有限公司 | Noise detection method and device |
CN105788592A (en) * | 2016-04-28 | 2016-07-20 | 乐视控股(北京)有限公司 | Audio classification method and apparatus thereof |
CN105843400A (en) * | 2016-05-05 | 2016-08-10 | 广东小天才科技有限公司 | Somatosensory interaction method and device and wearable device |
CN106020445A (en) * | 2016-05-05 | 2016-10-12 | 广东小天才科技有限公司 | Method for automatically identifying wearing by left hand and right hand and wearing equipment |
CN107919116B (en) * | 2016-10-11 | 2019-09-13 | 芋头科技(杭州)有限公司 | A kind of voice-activation detecting method and device |
KR102179511B1 (en) | 2016-10-14 | 2020-11-16 | 코우리츠 다이가꾸 호우진 오사카 | Swallowing diagnostic device and program |
CN107957918B (en) * | 2016-10-14 | 2019-05-10 | 腾讯科技(深圳)有限公司 | Data reconstruction method and device |
CN106548782A (en) * | 2016-10-31 | 2017-03-29 | 维沃移动通信有限公司 | The processing method and mobile terminal of acoustical signal |
CN106653047A (en) * | 2016-12-16 | 2017-05-10 | 广州视源电子科技股份有限公司 | Automatic gain control method and device for audio data |
CN106782508A (en) * | 2016-12-20 | 2017-05-31 | 美的集团股份有限公司 | The cutting method of speech audio and the cutting device of speech audio |
CN107039035A (en) * | 2017-01-10 | 2017-08-11 | 上海优同科技有限公司 | A kind of detection method of voice starting point and ending point |
CN107045870B (en) * | 2017-05-23 | 2020-06-26 | 南京理工大学 | Speech signal endpoint detection method based on characteristic value coding |
CN107910017A (en) * | 2017-12-19 | 2018-04-13 | 河海大学 | A kind of method that threshold value is set in noisy speech end-point detection |
CN108269566B (en) * | 2018-01-17 | 2020-08-25 | 南京理工大学 | Rifling wave identification method based on multi-scale sub-band energy set characteristics |
CN109036470B (en) * | 2018-06-04 | 2023-04-21 | 平安科技(深圳)有限公司 | Voice distinguishing method, device, computer equipment and storage medium |
CN108831508A (en) * | 2018-06-13 | 2018-11-16 | 百度在线网络技术(北京)有限公司 | Voice activity detection method, device and equipment |
CN109147795B (en) * | 2018-08-06 | 2021-05-14 | 珠海全志科技股份有限公司 | Voiceprint data transmission and identification method, identification device and storage medium |
CN109347580B (en) * | 2018-11-19 | 2021-01-19 | 湖南猎航电子科技有限公司 | Self-adaptive threshold signal detection method with known duty ratio |
CN111261143B (en) * | 2018-12-03 | 2024-03-22 | 嘉楠明芯(北京)科技有限公司 | Voice wakeup method and device and computer readable storage medium |
CN109448750B (en) * | 2018-12-20 | 2023-06-23 | 西京学院 | Speech enhancement method for improving speech quality of biological radar |
CN109801646B (en) * | 2019-01-31 | 2021-11-16 | 嘉楠明芯(北京)科技有限公司 | Voice endpoint detection method and device based on fusion features |
CN111916068B (en) * | 2019-05-07 | 2024-07-23 | 北京地平线机器人技术研发有限公司 | Audio detection method and device |
CN110097895B (en) * | 2019-05-14 | 2021-03-16 | 腾讯音乐娱乐科技(深圳)有限公司 | Pure music detection method, pure music detection device and storage medium |
CN110349597B (en) * | 2019-07-03 | 2021-06-25 | 山东师范大学 | Voice detection method and device |
CN110600010B (en) * | 2019-09-20 | 2022-05-17 | 度小满科技(北京)有限公司 | Corpus extraction method and apparatus |
CN110636176B (en) * | 2019-10-09 | 2022-05-17 | 科大讯飞股份有限公司 | Call fault detection method, device, equipment and storage medium |
CN111415685A (en) * | 2020-03-26 | 2020-07-14 | 腾讯科技(深圳)有限公司 | Audio signal detection method, device, equipment and computer readable storage medium |
CN111398944B (en) * | 2020-04-09 | 2022-05-17 | 浙江大学 | Radar signal processing method for identity recognition |
CN111883182B (en) * | 2020-07-24 | 2024-03-19 | 平安科技(深圳)有限公司 | Human voice detection method, device, equipment and storage medium |
CN112466331A (en) * | 2020-11-11 | 2021-03-09 | 昆明理工大学 | Voice music classification model based on beat spectrum characteristics |
CN112562735B (en) * | 2020-11-27 | 2023-03-24 | 锐迪科微电子(上海)有限公司 | Voice detection method, device, equipment and storage medium |
CN112528920A (en) * | 2020-12-21 | 2021-03-19 | 杭州格像科技有限公司 | Pet image emotion recognition method based on depth residual error network |
CN112767920A (en) * | 2020-12-31 | 2021-05-07 | 深圳市珍爱捷云信息技术有限公司 | Method, device, equipment and storage medium for recognizing call voice |
CN113160853A (en) * | 2021-03-31 | 2021-07-23 | 深圳鱼亮科技有限公司 | Voice endpoint detection method based on real-time face assistance |
CN113192488B (en) * | 2021-04-06 | 2022-05-06 | 青岛信芯微电子科技股份有限公司 | Voice processing method and device |
CN113541867A (en) * | 2021-06-30 | 2021-10-22 | 南京奥通智能科技有限公司 | Remote communication module for converged terminal |
CN113593599A (en) * | 2021-09-02 | 2021-11-02 | 北京云蝶智学科技有限公司 | Method for removing noise signal in voice signal |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101197130A (en) * | 2006-12-07 | 2008-06-11 | 华为技术有限公司 | Sound activity detecting method and detector thereof |
CN102473412A (en) * | 2009-07-21 | 2012-05-23 | 日本电信电话株式会社 | Audio signal section estimateing apparatus, audio signal section estimateing method, program therefor and recording medium |
CN103165127A (en) * | 2011-12-15 | 2013-06-19 | 佳能株式会社 | Sound segmentation equipment, sound segmentation method and sound detecting system |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100513175B1 (en) * | 2002-12-24 | 2005-09-07 | 한국전자통신연구원 | A Voice Activity Detector Employing Complex Laplacian Model |
-
2013
- 2013-12-30 CN CN201310743203.5A patent/CN103646649B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101197130A (en) * | 2006-12-07 | 2008-06-11 | 华为技术有限公司 | Sound activity detecting method and detector thereof |
CN102473412A (en) * | 2009-07-21 | 2012-05-23 | 日本电信电话株式会社 | Audio signal section estimateing apparatus, audio signal section estimateing method, program therefor and recording medium |
CN103165127A (en) * | 2011-12-15 | 2013-06-19 | 佳能株式会社 | Sound segmentation equipment, sound segmentation method and sound detecting system |
Non-Patent Citations (2)
Title |
---|
话者识别中结合模型和能量的语音激活检测算法;章钊、郭武;《小型微型计算机系统》;20100930;第31卷(第9期);1914-1917 * |
语音激活检测技术算法研究及其在语音编码器中的应用;沈红丽;《万方数据》;20120426;第2.2.3、3.3节 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102214888B1 (en) * | 2016-10-12 | 2021-02-15 | 어드밴스드 뉴 테크놀로지스 씨오., 엘티디. | Method and device for detecting an audio signal |
Also Published As
Publication number | Publication date |
---|---|
CN103646649A (en) | 2014-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103646649B (en) | A kind of speech detection method efficiently | |
CN103854662B (en) | Adaptive voice detection method based on multiple domain Combined estimator | |
Evangelopoulos et al. | Multiband modulation energy tracking for noisy speech detection | |
CN103489446B (en) | Based on the twitter identification method that adaptive energy detects under complex environment | |
CN101197130B (en) | Sound activity detecting method and detector thereof | |
Meyer et al. | Robustness of spectro-temporal features against intrinsic and extrinsic variations in automatic speech recognition | |
US20090076814A1 (en) | Apparatus and method for determining speech signal | |
CN104318927A (en) | Anti-noise low-bitrate speech coding method and decoding method | |
CN104157290A (en) | Speaker recognition method based on depth learning | |
CN104008751A (en) | Speaker recognition method based on BP neural network | |
CN103489454A (en) | Voice endpoint detection method based on waveform morphological characteristic clustering | |
Ghaemmaghami et al. | Noise robust voice activity detection using features extracted from the time-domain autocorrelation function | |
CN110136709A (en) | Audio recognition method and video conferencing system based on speech recognition | |
Couvreur et al. | Automatic noise recognition in urban environments based on artificial neural networks and hidden markov models | |
CN108806725A (en) | Speech differentiation method, apparatus, computer equipment and storage medium | |
Zhang et al. | Fault diagnosis method based on MFCC fusion and SVM | |
Chu et al. | A noise-robust FFT-based auditory spectrum with application in audio classification | |
Thomas et al. | Acoustic and data-driven features for robust speech activity detection | |
Singh et al. | Novel feature extraction algorithm using DWT and temporal statistical techniques for word dependent speaker’s recognition | |
Papadopoulos et al. | Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression. | |
CN110265049A (en) | A kind of audio recognition method and speech recognition system | |
TWI749547B (en) | Speech enhancement system based on deep learning | |
Park et al. | Frequency of Interest-based Noise Attenuation Method to Improve Anomaly Detection Performance | |
Pasad et al. | Voice activity detection for children's read speech recognition in noisy conditions | |
CN115662464B (en) | Method and system for intelligently identifying environmental noise |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20170508 Address after: 100094, No. 4, building A, No. 1, building 2, wing Cheng North Road, No. 405-346, Beijing, Haidian District Patentee after: Beijing Rui Heng Heng Xun Technology Co., Ltd. Address before: 100190 Zhongguancun East Road, Beijing, No. 95, No. Patentee before: Institute of Automation, Chinese Academy of Sciences |
|
TR01 | Transfer of patent right |
Effective date of registration: 20181218 Address after: 100190 Zhongguancun East Road, Haidian District, Haidian District, Beijing Patentee after: Institute of Automation, Chinese Academy of Sciences Address before: 100094 No. 405-346, 4th floor, Building A, No. 1, Courtyard 2, Yongcheng North Road, Haidian District, Beijing Patentee before: Beijing Rui Heng Heng Xun Technology Co., Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20190528 Address after: 310019 1105, 11 / F, 4 building, 9 Ring Road, Jianggan District nine, Hangzhou, Zhejiang. Patentee after: Limit element (Hangzhou) intelligent Polytron Technologies Inc Address before: 100190 Zhongguancun East Road, Haidian District, Haidian District, Beijing Patentee before: Institute of Automation, Chinese Academy of Sciences |
|
TR01 | Transfer of patent right | ||
CP01 | Change in the name or title of a patent holder |
Address after: 310019 1105, 11 / F, 4 building, 9 Ring Road, Jianggan District nine, Hangzhou, Zhejiang. Patentee after: Zhongke extreme element (Hangzhou) Intelligent Technology Co., Ltd Address before: 310019 1105, 11 / F, 4 building, 9 Ring Road, Jianggan District nine, Hangzhou, Zhejiang. Patentee before: Limit element (Hangzhou) intelligent Polytron Technologies Inc. |
|
CP01 | Change in the name or title of a patent holder |