CN108711437A - 语音处理方法和装置 - Google Patents
语音处理方法和装置 Download PDFInfo
- Publication number
- CN108711437A CN108711437A CN201810184535.7A CN201810184535A CN108711437A CN 108711437 A CN108711437 A CN 108711437A CN 201810184535 A CN201810184535 A CN 201810184535A CN 108711437 A CN108711437 A CN 108711437A
- Authority
- CN
- China
- Prior art keywords
- zero
- voice signal
- crossing rate
- voice
- threshold value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 title claims abstract description 38
- 238000000034 method Methods 0.000 title claims abstract description 35
- 238000001514 detection method Methods 0.000 claims abstract description 80
- 230000000694 effects Effects 0.000 claims abstract description 38
- 239000013598 vector Substances 0.000 claims description 35
- 239000000284 extract Substances 0.000 claims description 14
- 238000011156 evaluation Methods 0.000 claims description 8
- 108010001267 Protein Subunits Proteins 0.000 claims description 2
- 238000004364 calculation method Methods 0.000 abstract description 12
- 238000010586 diagram Methods 0.000 description 15
- 238000005516 engineering process Methods 0.000 description 12
- 230000006854 communication Effects 0.000 description 8
- 238000004891 communication Methods 0.000 description 8
- 238000000605 extraction Methods 0.000 description 6
- 238000012549 training Methods 0.000 description 6
- 238000004590 computer program Methods 0.000 description 5
- 230000005236 sound signal Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 description 4
- 238000011946 reduction process Methods 0.000 description 4
- 238000003672 processing method Methods 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000009432 framing Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 230000002411 adverse Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000007175 bidirectional communication Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000000465 moulding Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Probability & Statistics with Applications (AREA)
- Quality & Reliability (AREA)
- Telephone Function (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810184535.7A CN108711437A (zh) | 2018-03-06 | 2018-03-06 | 语音处理方法和装置 |
PCT/CN2018/082036 WO2019169685A1 (fr) | 2018-03-06 | 2018-04-04 | Procédé et dispositif de traitement de la parole et dispositif électronique |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810184535.7A CN108711437A (zh) | 2018-03-06 | 2018-03-06 | 语音处理方法和装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108711437A true CN108711437A (zh) | 2018-10-26 |
Family
ID=63866292
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810184535.7A Pending CN108711437A (zh) | 2018-03-06 | 2018-03-06 | 语音处理方法和装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN108711437A (fr) |
WO (1) | WO2019169685A1 (fr) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019169551A1 (fr) * | 2018-03-06 | 2019-09-12 | 深圳市沃特沃德股份有限公司 | Procédé et dispositif de traitement vocal et appareil électronique |
CN111696564A (zh) * | 2020-06-05 | 2020-09-22 | 北京搜狗科技发展有限公司 | 语音处理方法、装置和介质 |
CN112735469A (zh) * | 2020-10-28 | 2021-04-30 | 西安电子科技大学 | 低内存语音关键词检测方法、系统、介质、设备及终端 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101308653A (zh) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | 一种应用于语音识别系统的端点检测方法 |
KR20090089552A (ko) * | 2008-02-19 | 2009-08-24 | 연세대학교 산학협력단 | 신호판별장치와 방법 및 음악신호 추출장치와 방법 |
CN103943104A (zh) * | 2014-04-15 | 2014-07-23 | 海信集团有限公司 | 一种语音信息识别的方法及终端设备 |
CN106328125A (zh) * | 2016-10-28 | 2017-01-11 | 许昌学院 | 一种河南方言语音识别系统 |
CN106328168A (zh) * | 2016-08-30 | 2017-01-11 | 成都普创通信技术股份有限公司 | 一种语音信号相似度检测方法 |
CN106601234A (zh) * | 2016-11-16 | 2017-04-26 | 华南理工大学 | 一种面向货物分拣的地名语音建模系统的实现方法 |
CN107045870A (zh) * | 2017-05-23 | 2017-08-15 | 南京理工大学 | 一种基于特征值编码的语音信号端点检测方法 |
CN107274911A (zh) * | 2017-05-03 | 2017-10-20 | 昆明理工大学 | 一种基于声音特征的相似度分析方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104700843A (zh) * | 2015-02-05 | 2015-06-10 | 海信集团有限公司 | 一种年龄识别的方法及装置 |
CN105721651B (zh) * | 2016-01-19 | 2018-10-26 | 海信集团有限公司 | 一种语音拨号方法和设备 |
JP6724511B2 (ja) * | 2016-04-12 | 2020-07-15 | 富士通株式会社 | 音声認識装置、音声認識方法および音声認識プログラム |
CN107610715B (zh) * | 2017-10-10 | 2021-03-02 | 昆明理工大学 | 一种基于多种声音特征的相似度计算方法 |
-
2018
- 2018-03-06 CN CN201810184535.7A patent/CN108711437A/zh active Pending
- 2018-04-04 WO PCT/CN2018/082036 patent/WO2019169685A1/fr active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20090089552A (ko) * | 2008-02-19 | 2009-08-24 | 연세대학교 산학협력단 | 신호판별장치와 방법 및 음악신호 추출장치와 방법 |
CN101308653A (zh) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | 一种应用于语音识别系统的端点检测方法 |
CN103943104A (zh) * | 2014-04-15 | 2014-07-23 | 海信集团有限公司 | 一种语音信息识别的方法及终端设备 |
CN106328168A (zh) * | 2016-08-30 | 2017-01-11 | 成都普创通信技术股份有限公司 | 一种语音信号相似度检测方法 |
CN106328125A (zh) * | 2016-10-28 | 2017-01-11 | 许昌学院 | 一种河南方言语音识别系统 |
CN106601234A (zh) * | 2016-11-16 | 2017-04-26 | 华南理工大学 | 一种面向货物分拣的地名语音建模系统的实现方法 |
CN107274911A (zh) * | 2017-05-03 | 2017-10-20 | 昆明理工大学 | 一种基于声音特征的相似度分析方法 |
CN107045870A (zh) * | 2017-05-23 | 2017-08-15 | 南京理工大学 | 一种基于特征值编码的语音信号端点检测方法 |
Non-Patent Citations (4)
Title |
---|
宋知用: "《MATLAB语音信号分析与合成》", 31 October 2017, 北京航空航天大学 * |
杨东慧 等: "《多媒体技术与应用项目教程》", 28 February 2018, 航空工业出版社 * |
杨娜: "《传感器与测试技术》", 31 July 2012, 航空工业出版社 * |
陆虎敏: "《飞机座舱显示与控制技术》", 31 December 2015, 航空工业出版社 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019169551A1 (fr) * | 2018-03-06 | 2019-09-12 | 深圳市沃特沃德股份有限公司 | Procédé et dispositif de traitement vocal et appareil électronique |
CN111696564A (zh) * | 2020-06-05 | 2020-09-22 | 北京搜狗科技发展有限公司 | 语音处理方法、装置和介质 |
CN111696564B (zh) * | 2020-06-05 | 2023-08-18 | 北京搜狗科技发展有限公司 | 语音处理方法、装置和介质 |
CN112735469A (zh) * | 2020-10-28 | 2021-04-30 | 西安电子科技大学 | 低内存语音关键词检测方法、系统、介质、设备及终端 |
CN112735469B (zh) * | 2020-10-28 | 2024-05-17 | 西安电子科技大学 | 低内存语音关键词检测方法、系统、介质、设备及终端 |
Also Published As
Publication number | Publication date |
---|---|
WO2019169685A1 (fr) | 2019-09-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108447506A (zh) | 语音处理方法和语音处理装置 | |
CN101930735B (zh) | 语音情感识别设备和进行语音情感识别的方法 | |
CN110797016B (zh) | 一种语音识别方法、装置、电子设备及存储介质 | |
CN103065629A (zh) | 一种仿人机器人的语音识别系统 | |
US8301578B2 (en) | System and method for tagging signals of interest in time variant data | |
CN103377651B (zh) | 语音自动合成装置及方法 | |
CN107492382A (zh) | 基于神经网络的声纹信息提取方法及装置 | |
CN108711437A (zh) | 语音处理方法和装置 | |
CN108597505A (zh) | 语音识别方法、装置及终端设备 | |
CN110222841A (zh) | 基于间距损失函数的神经网络训练方法和装置 | |
CN112634876A (zh) | 语音识别方法、装置、存储介质及电子设备 | |
CN107562760A (zh) | 一种语音数据处理方法及装置 | |
CN108986798B (zh) | 语音数据的处理方法、装置及设备 | |
CN108648769A (zh) | 语音活性检测方法、装置及设备 | |
CN113129867B (zh) | 语音识别模型的训练方法、语音识别方法、装置和设备 | |
CN107274892A (zh) | 说话人识别方法及装置 | |
CN113823293B (zh) | 一种基于语音增强的说话人识别方法及系统 | |
CN104732972A (zh) | 一种基于分组统计的hmm声纹识别签到方法及系统 | |
CN106033669A (zh) | 语音识别方法及装置 | |
Ying et al. | Characteristics of human auditory model based on compensation of glottal features in speech emotion recognition | |
CN105679323A (zh) | 一种号码发现方法及系统 | |
CN106340310B (zh) | 语音检测方法及装置 | |
Salian et al. | Speech Emotion Recognition using Time Distributed CNN and LSTM | |
CN107919136A (zh) | 一种基于高斯混合模型的数字语音采样频率估计方法 | |
Bansod et al. | Speaker Recognition using Marathi (Varhadi) Language |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181026 |
|
RJ01 | Rejection of invention patent application after publication |