CN102129860A - 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 - Google Patents
基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 Download PDFInfo
- Publication number
- CN102129860A CN102129860A CN2011100858447A CN201110085844A CN102129860A CN 102129860 A CN102129860 A CN 102129860A CN 2011100858447 A CN2011100858447 A CN 2011100858447A CN 201110085844 A CN201110085844 A CN 201110085844A CN 102129860 A CN102129860 A CN 102129860A
- Authority
- CN
- China
- Prior art keywords
- mrow
- msub
- msubsup
- math
- sigma
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 56
- 238000012549 training Methods 0.000 claims abstract description 50
- 238000009826 distribution Methods 0.000 claims abstract description 40
- 238000007781 pre-processing Methods 0.000 claims abstract description 9
- 238000004364 calculation method Methods 0.000 claims abstract description 8
- 238000000605 extraction Methods 0.000 claims abstract description 8
- 238000005315 distribution function Methods 0.000 claims abstract description 7
- 239000013598 vector Substances 0.000 claims description 11
- 239000011159 matrix material Substances 0.000 claims description 7
- 238000004422 calculation algorithm Methods 0.000 claims description 6
- 238000005070 sampling Methods 0.000 claims description 6
- 238000013139 quantization Methods 0.000 claims description 5
- 238000009432 framing Methods 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 230000007704 transition Effects 0.000 claims description 3
- 238000007476 Maximum Likelihood Methods 0.000 abstract 1
- 239000000203 mixture Substances 0.000 description 15
- 230000001419 dependent effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- NAWXUBYGYWOOIX-SFHVURJKSA-N (2s)-2-[[4-[2-(2,4-diaminoquinazolin-6-yl)ethyl]benzoyl]amino]-4-methylidenepentanedioic acid Chemical compound C1=CC2=NC(N)=NC(N)=C2C=C1CCC1=CC=C(C(=O)N[C@@H](CC(=C)C(O)=O)C(O)=O)C=C1 NAWXUBYGYWOOIX-SFHVURJKSA-N 0.000 description 1
- 238000012935 Averaging Methods 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Landscapes
- Complex Calculations (AREA)
Abstract
Description
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100858447A CN102129860B (zh) | 2011-04-07 | 2011-04-07 | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100858447A CN102129860B (zh) | 2011-04-07 | 2011-04-07 | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102129860A true CN102129860A (zh) | 2011-07-20 |
CN102129860B CN102129860B (zh) | 2012-07-04 |
Family
ID=44267916
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011100858447A Expired - Fee Related CN102129860B (zh) | 2011-04-07 | 2011-04-07 | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102129860B (zh) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102355439A (zh) * | 2011-08-11 | 2012-02-15 | 魏昕 | 通信系统中基于无限成分数的t混合模型的调制信号的盲检测方法 |
CN103514878A (zh) * | 2012-06-27 | 2014-01-15 | 北京百度网讯科技有限公司 | 声学建模方法及装置和语音识别方法及装置 |
CN105556546A (zh) * | 2013-09-20 | 2016-05-04 | 日本电气株式会社 | 分层隐变量模型估计设备、分层隐变量模型估计方法、供应量预测设备、供应量预测方法、以及记录介质 |
CN106683661A (zh) * | 2015-11-05 | 2017-05-17 | 阿里巴巴集团控股有限公司 | 基于语音的角色分离方法及装置 |
CN107342076A (zh) * | 2017-07-11 | 2017-11-10 | 华南理工大学 | 一种兼容非常态语音的智能家居控制系统及方法 |
CN107610708A (zh) * | 2017-06-09 | 2018-01-19 | 平安科技(深圳)有限公司 | 识别声纹的方法及设备 |
CN107690651A (zh) * | 2015-04-16 | 2018-02-13 | 罗伯特·博世有限公司 | 用于自动化手语识别的系统和方法 |
CN108766419A (zh) * | 2018-05-04 | 2018-11-06 | 华南理工大学 | 一种基于深度学习的非常态语音区别方法 |
CN109119064A (zh) * | 2018-09-05 | 2019-01-01 | 东南大学 | 一种适用于翻转课堂的英语口语教学系统的实现方法 |
CN110188338A (zh) * | 2018-02-23 | 2019-08-30 | 富士通株式会社 | 文本相关的说话人确认方法和设备 |
US10460245B2 (en) * | 2015-09-04 | 2019-10-29 | Civitas Learning, Inc. | Flexible, personalized student success modeling for institutions with complex term structures and competency-based education |
CN112002343A (zh) * | 2020-08-18 | 2020-11-27 | 海尔优家智能科技(北京)有限公司 | 语音纯度的识别方法、装置、存储介质及电子装置 |
WO2021127975A1 (zh) * | 2019-12-24 | 2021-07-01 | 广州国音智能科技有限公司 | 一种声音采集对象声纹检测方法、装置和设备 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1171592A (zh) * | 1996-05-01 | 1998-01-28 | 微软公司 | 采用连续密度隐藏式马尔克夫模型的语音识别方法和系统 |
CN1787076A (zh) * | 2005-12-13 | 2006-06-14 | 浙江大学 | 基于混合支持向量机的说话人识别方法 |
WO2006109515A1 (ja) * | 2005-03-31 | 2006-10-19 | Pioneer Corporation | 操作者認識装置、操作者認識方法、および、操作者認識プログラム |
KR100673834B1 (ko) * | 2004-12-03 | 2007-01-24 | 고한석 | 문맥 요구형 화자 독립 인증 시스템 및 방법 |
-
2011
- 2011-04-07 CN CN2011100858447A patent/CN102129860B/zh not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1171592A (zh) * | 1996-05-01 | 1998-01-28 | 微软公司 | 采用连续密度隐藏式马尔克夫模型的语音识别方法和系统 |
KR100673834B1 (ko) * | 2004-12-03 | 2007-01-24 | 고한석 | 문맥 요구형 화자 독립 인증 시스템 및 방법 |
WO2006109515A1 (ja) * | 2005-03-31 | 2006-10-19 | Pioneer Corporation | 操作者認識装置、操作者認識方法、および、操作者認識プログラム |
US20090254757A1 (en) * | 2005-03-31 | 2009-10-08 | Pioneer Corporation | Operator recognition device, operator recognition method and operator recognition program |
CN1787076A (zh) * | 2005-12-13 | 2006-06-14 | 浙江大学 | 基于混合支持向量机的说话人识别方法 |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102355439A (zh) * | 2011-08-11 | 2012-02-15 | 魏昕 | 通信系统中基于无限成分数的t混合模型的调制信号的盲检测方法 |
CN103514878A (zh) * | 2012-06-27 | 2014-01-15 | 北京百度网讯科技有限公司 | 声学建模方法及装置和语音识别方法及装置 |
CN105556546B (zh) * | 2013-09-20 | 2019-01-08 | 日本电气株式会社 | 分层隐变量模型估计设备、分层隐变量模型估计方法、供应量预测设备、供应量预测方法、以及记录介质 |
CN105556546A (zh) * | 2013-09-20 | 2016-05-04 | 日本电气株式会社 | 分层隐变量模型估计设备、分层隐变量模型估计方法、供应量预测设备、供应量预测方法、以及记录介质 |
CN107690651A (zh) * | 2015-04-16 | 2018-02-13 | 罗伯特·博世有限公司 | 用于自动化手语识别的系统和方法 |
US10460245B2 (en) * | 2015-09-04 | 2019-10-29 | Civitas Learning, Inc. | Flexible, personalized student success modeling for institutions with complex term structures and competency-based education |
CN106683661A (zh) * | 2015-11-05 | 2017-05-17 | 阿里巴巴集团控股有限公司 | 基于语音的角色分离方法及装置 |
CN107610708A (zh) * | 2017-06-09 | 2018-01-19 | 平安科技(深圳)有限公司 | 识别声纹的方法及设备 |
CN107342076B (zh) * | 2017-07-11 | 2020-09-22 | 华南理工大学 | 一种兼容非常态语音的智能家居控制系统及方法 |
CN107342076A (zh) * | 2017-07-11 | 2017-11-10 | 华南理工大学 | 一种兼容非常态语音的智能家居控制系统及方法 |
CN110188338A (zh) * | 2018-02-23 | 2019-08-30 | 富士通株式会社 | 文本相关的说话人确认方法和设备 |
CN110188338B (zh) * | 2018-02-23 | 2023-02-21 | 富士通株式会社 | 文本相关的说话人确认方法和设备 |
CN108766419A (zh) * | 2018-05-04 | 2018-11-06 | 华南理工大学 | 一种基于深度学习的非常态语音区别方法 |
CN109119064A (zh) * | 2018-09-05 | 2019-01-01 | 东南大学 | 一种适用于翻转课堂的英语口语教学系统的实现方法 |
WO2021127975A1 (zh) * | 2019-12-24 | 2021-07-01 | 广州国音智能科技有限公司 | 一种声音采集对象声纹检测方法、装置和设备 |
CN112002343A (zh) * | 2020-08-18 | 2020-11-27 | 海尔优家智能科技(北京)有限公司 | 语音纯度的识别方法、装置、存储介质及电子装置 |
CN112002343B (zh) * | 2020-08-18 | 2024-01-23 | 海尔优家智能科技(北京)有限公司 | 语音纯度的识别方法、装置、存储介质及电子装置 |
Also Published As
Publication number | Publication date |
---|---|
CN102129860B (zh) | 2012-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102129860B (zh) | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 | |
US7617103B2 (en) | Incrementally regulated discriminative margins in MCE training for speech recognition | |
US5684925A (en) | Speech representation by feature-based word prototypes comprising phoneme targets having reliable high similarity | |
US6226612B1 (en) | Method of evaluating an utterance in a speech recognition system | |
US9595257B2 (en) | Downsampling schemes in a hierarchical neural network structure for phoneme recognition | |
US7672847B2 (en) | Discriminative training of hidden Markov models for continuous speech recognition | |
US6223159B1 (en) | Speaker adaptation device and speech recognition device | |
KR100307623B1 (ko) | 엠.에이.피 화자 적응 조건에서 파라미터의 분별적 추정 방법 및 장치 및 이를 각각 포함한 음성 인식 방법 및 장치 | |
EP2189976A1 (en) | Method for adapting a codebook for speech recognition | |
CN114387997B (zh) | 一种基于深度学习的语音情感识别方法 | |
CN102034472A (zh) | 一种基于嵌入时延神经网络的高斯混合模型的说话人识别方法 | |
EP0453649A2 (en) | Method and apparatus for modeling words with composite Markov models | |
CN101452701B (zh) | 基于反模型的置信度估计方法及装置 | |
EP1514258B1 (en) | Frequency distribution of minimum vector distance for dynamic time warping | |
US6526379B1 (en) | Discriminative clustering methods for automatic speech recognition | |
US7617101B2 (en) | Method and system for utterance verification | |
US20100076759A1 (en) | Apparatus and method for recognizing a speech | |
US20050015251A1 (en) | High-order entropy error functions for neural classifiers | |
US20040122672A1 (en) | Gaussian model-based dynamic time warping system and method for speech processing | |
Almpanidis et al. | Phonemic segmentation using the generalised Gamma distribution and small sample Bayesian information criterion | |
US20040181409A1 (en) | Speech recognition using model parameters dependent on acoustic environment | |
JP4960845B2 (ja) | 音声パラメータ学習装置とその方法、それらを用いた音声認識装置と音声認識方法、それらのプログラムと記録媒体 | |
CN118711611A (zh) | 基于音素标识扰动的听觉数据安全评测方法 | |
CN104240699B (zh) | 一种简单有效的短语语音识别方法 | |
US6275799B1 (en) | Reference pattern learning system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: NANJING POST + TELECOMMUNICATION UNIV. Free format text: FORMER OWNER: WEI XIN Effective date: 20120203 |
|
C41 | Transfer of patent application or patent right or utility model | ||
C53 | Correction of patent for invention or patent application | ||
CB03 | Change of inventor or designer information |
Inventor after: Wei Cuan Inventor after: Yang Zhen Inventor after: Li Chunguang Inventor before: Wei Cuan |
|
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 210096 NANJING, JIANGSU PROVINCE TO: 210003 NANJING, JIANGSU PROVINCE Free format text: CORRECT: INVENTOR; FROM: WEI XIN TO: WEI XIN YANG ZHEN LI CHUNGUANG |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20120203 Address after: 210003 Nanjing City, Jiangsu Province, the new model road No. 66 Applicant after: Nanjing Post & Telecommunication Univ. Address before: 210096 School of information science and engineering, Southeast University, No. four, 2 arch, Jiangsu, Nanjing Applicant before: Wei Cuan |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120704 Termination date: 20140407 |