CN103839544A - 语音激活检测方法和装置 - Google Patents
语音激活检测方法和装置 Download PDFInfo
- Publication number
- CN103839544A CN103839544A CN201210488703.4A CN201210488703A CN103839544A CN 103839544 A CN103839544 A CN 103839544A CN 201210488703 A CN201210488703 A CN 201210488703A CN 103839544 A CN103839544 A CN 103839544A
- Authority
- CN
- China
- Prior art keywords
- unharmonic
- thr
- dull
- frequency
- weight
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 41
- 230000005236 sound signal Effects 0.000 claims abstract description 80
- 239000012634 fragment Substances 0.000 claims abstract description 46
- 230000004913 activation Effects 0.000 claims abstract description 30
- 238000001228 spectrum Methods 0.000 claims description 49
- 238000000034 method Methods 0.000 claims description 25
- 238000010183 spectrum analysis Methods 0.000 claims description 22
- 238000012545 processing Methods 0.000 claims description 10
- 238000013507 mapping Methods 0.000 claims description 7
- 230000003595 spectral effect Effects 0.000 claims description 6
- 238000009499 grossing Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 18
- 238000004364 calculation method Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 210000001624 hip Anatomy 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Images
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Electrophonic Musical Instruments (AREA)
Abstract
Description
Claims (27)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210488703.4A CN103839544B (zh) | 2012-11-27 | 2012-11-27 | 语音激活检测方法和装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210488703.4A CN103839544B (zh) | 2012-11-27 | 2012-11-27 | 语音激活检测方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103839544A true CN103839544A (zh) | 2014-06-04 |
CN103839544B CN103839544B (zh) | 2016-09-07 |
Family
ID=50802978
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210488703.4A Active CN103839544B (zh) | 2012-11-27 | 2012-11-27 | 语音激活检测方法和装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103839544B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106571150A (zh) * | 2015-10-12 | 2017-04-19 | 阿里巴巴集团控股有限公司 | 定位音乐人声区的方法和系统 |
TWI659412B (zh) * | 2016-10-11 | 2019-05-11 | 中國商芋頭科技(杭州)有限公司 | 一種語音激活檢測方法及裝置 |
CN111554315A (zh) * | 2020-05-29 | 2020-08-18 | 展讯通信(天津)有限公司 | 单通道语音增强方法及装置、存储介质、终端 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5459814A (en) * | 1993-03-26 | 1995-10-17 | Hughes Aircraft Company | Voice activity detector for speech signals in variable background noise |
CN1242553A (zh) * | 1998-03-24 | 2000-01-26 | 松下电器产业株式会社 | 用于噪声环境的语音检测系统 |
US20020188445A1 (en) * | 2001-06-01 | 2002-12-12 | Dunling Li | Background noise estimation method for an improved G.729 annex B compliant voice activity detection circuit |
JP2010529494A (ja) * | 2007-06-07 | 2010-08-26 | 華為技術有限公司 | 音声活動を検出するための装置および方法 |
CN101853661A (zh) * | 2010-05-14 | 2010-10-06 | 中国科学院声学研究所 | 基于非监督学习的噪声谱估计与语音活动度检测方法 |
-
2012
- 2012-11-27 CN CN201210488703.4A patent/CN103839544B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5459814A (en) * | 1993-03-26 | 1995-10-17 | Hughes Aircraft Company | Voice activity detector for speech signals in variable background noise |
CN1242553A (zh) * | 1998-03-24 | 2000-01-26 | 松下电器产业株式会社 | 用于噪声环境的语音检测系统 |
US20020188445A1 (en) * | 2001-06-01 | 2002-12-12 | Dunling Li | Background noise estimation method for an improved G.729 annex B compliant voice activity detection circuit |
JP2010529494A (ja) * | 2007-06-07 | 2010-08-26 | 華為技術有限公司 | 音声活動を検出するための装置および方法 |
CN101853661A (zh) * | 2010-05-14 | 2010-10-06 | 中国科学院声学研究所 | 基于非监督学习的噪声谱估计与语音活动度检测方法 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106571150A (zh) * | 2015-10-12 | 2017-04-19 | 阿里巴巴集团控股有限公司 | 定位音乐人声区的方法和系统 |
TWI659412B (zh) * | 2016-10-11 | 2019-05-11 | 中國商芋頭科技(杭州)有限公司 | 一種語音激活檢測方法及裝置 |
CN111554315A (zh) * | 2020-05-29 | 2020-08-18 | 展讯通信(天津)有限公司 | 单通道语音增强方法及装置、存储介质、终端 |
CN111554315B (zh) * | 2020-05-29 | 2022-07-15 | 展讯通信(天津)有限公司 | 单通道语音增强方法及装置、存储介质、终端 |
Also Published As
Publication number | Publication date |
---|---|
CN103839544B (zh) | 2016-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103646649B (zh) | 一种高效的语音检测方法 | |
Gonzalez et al. | PEFAC-A pitch estimation algorithm robust to high levels of noise | |
CN108896878B (zh) | 一种基于超声波的局部放电检测方法 | |
US7499686B2 (en) | Method and apparatus for multi-sensory speech enhancement on a mobile device | |
CN103594094B (zh) | 自适应谱减法实时语音增强 | |
Evangelopoulos et al. | Multiband modulation energy tracking for noisy speech detection | |
CN107316653B (zh) | 一种基于改进的经验小波变换的基频检测方法 | |
Krishnamoorthy et al. | Enhancement of noisy speech by temporal and spectral processing | |
US9454976B2 (en) | Efficient discrimination of voiced and unvoiced sounds | |
CN109378013B (zh) | 一种语音降噪方法 | |
CN101968957A (zh) | 一种噪声条件下的语音检测方法 | |
CN101320566A (zh) | 基于多带谱减法的非空气传导语音增强方法 | |
Khoa | Noise robust voice activity detection | |
CN105575405A (zh) | 一种双麦克风语音激活检测方法及语音采集设备 | |
CN103996399A (zh) | 语音检测方法和系统 | |
CN103839544A (zh) | 语音激活检测方法和装置 | |
Sarkar et al. | Automatic speech segmentation using average level crossing rate information | |
Meduri et al. | A survey and evaluation of voice activity detection algorithms | |
CN102789780B (zh) | 基于谱时幅度分级向量辨识环境声音事件的方法 | |
Jin et al. | An improved speech endpoint detection based on spectral subtraction and adaptive sub-band spectral entropy | |
Jamaludin et al. | An improved time domain pitch detection algorithm for pathological voice | |
Dov et al. | Voice activity detection in presence of transients using the scattering transform | |
Patil et al. | Classification of normal and pathological voices using TEO phase and Mel cepstral features | |
Sanam et al. | Teager energy operation on wavelet packet coefficients for enhancing noisy speech using a hard thresholding function | |
Képesi et al. | High-resolution noise-robust spectral-based pitch estimation. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20170204 Address after: Room 32, building 3205F, No. 707, Zhang Yang Road, free trade zone,, China (Shanghai) Patentee after: Xin Xin Finance Leasing Co.,Ltd. Address before: 201203 Shanghai city Zuchongzhi road Pudong New Area Zhangjiang hi tech park, Spreadtrum Center Building 1, Lane 2288 Patentee before: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20170707 Address after: Room 2062, Wenstin administration apartment, No. 9 Financial Street B, Beijing, Xicheng District Patentee after: Xin Xin finance leasing (Beijing) Co.,Ltd. Address before: Room 32, building 707, Zhang Yang Road, China (Shanghai) free trade zone, 3205F Patentee before: Xin Xin Finance Leasing Co.,Ltd. |
|
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20140604 Assignee: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Assignor: Xin Xin finance leasing (Beijing) Co.,Ltd. Contract record no.: 2018990000163 Denomination of invention: Voice activity detection method and apparatus Granted publication date: 20160907 License type: Exclusive License Record date: 20180626 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200306 Address after: 201203 Zuchongzhi Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai 2288 Patentee after: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Address before: 100033 room 2062, Wenstin administrative apartments, 9 Financial Street B, Xicheng District, Beijing. Patentee before: Xin Xin finance leasing (Beijing) Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200529 Address after: 361012 unit 05, 8 / F, building D, Xiamen international shipping center, No.97 Xiangyu Road, Xiamen area, China (Fujian) free trade zone, Xiamen City, Fujian Province Patentee after: Xinxin Finance Leasing (Xiamen) Co.,Ltd. Address before: 201203 Zuchongzhi Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai 2288 Patentee before: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. |
|
EC01 | Cancellation of recordation of patent licensing contract | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Assignor: Xin Xin finance leasing (Beijing) Co.,Ltd. Contract record no.: 2018990000163 Date of cancellation: 20210301 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20140604 Assignee: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Assignor: Xinxin Finance Leasing (Xiamen) Co.,Ltd. Contract record no.: X2021110000010 Denomination of invention: Voice activation detection method and device Granted publication date: 20160907 License type: Exclusive License Record date: 20210317 |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230724 Address after: 201203 Shanghai city Zuchongzhi road Pudong New Area Zhangjiang hi tech park, Spreadtrum Center Building 1, Lane 2288 Patentee after: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Address before: 361012 unit 05, 8 / F, building D, Xiamen international shipping center, 97 Xiangyu Road, Xiamen area, China (Fujian) pilot Free Trade Zone, Xiamen City, Fujian Province Patentee before: Xinxin Finance Leasing (Xiamen) Co.,Ltd. |