CN102129860A - 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 - Google Patents
基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 Download PDFInfo
- Publication number
- CN102129860A CN102129860A CN2011100858447A CN201110085844A CN102129860A CN 102129860 A CN102129860 A CN 102129860A CN 2011100858447 A CN2011100858447 A CN 2011100858447A CN 201110085844 A CN201110085844 A CN 201110085844A CN 102129860 A CN102129860 A CN 102129860A
- Authority
- CN
- China
- Prior art keywords
- mrow
- msub
- msubsup
- math
- mover
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000012549 training Methods 0.000 claims abstract description 49
- 238000007781 pre-processing Methods 0.000 claims abstract description 9
- 238000000605 extraction Methods 0.000 claims abstract description 8
- 238000005315 distribution function Methods 0.000 claims abstract description 7
- 239000013598 vector Substances 0.000 claims description 12
- 238000004364 calculation method Methods 0.000 claims description 8
- 239000011159 matrix material Substances 0.000 claims description 7
- 238000013139 quantization Methods 0.000 claims description 6
- 238000005070 sampling Methods 0.000 claims description 6
- 238000009432 framing Methods 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 230000007704 transition Effects 0.000 claims description 3
- 238000012545 processing Methods 0.000 abstract description 2
- 239000000203 mixture Substances 0.000 description 7
- 230000004927 fusion Effects 0.000 description 4
- 230000007547 defect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Landscapes
- Complex Calculations (AREA)
Abstract
Description
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100858447A CN102129860B (zh) | 2011-04-07 | 2011-04-07 | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100858447A CN102129860B (zh) | 2011-04-07 | 2011-04-07 | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102129860A true CN102129860A (zh) | 2011-07-20 |
CN102129860B CN102129860B (zh) | 2012-07-04 |
Family
ID=44267916
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011100858447A Expired - Fee Related CN102129860B (zh) | 2011-04-07 | 2011-04-07 | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102129860B (zh) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102355439A (zh) * | 2011-08-11 | 2012-02-15 | 魏昕 | 通信系统中基于无限成分数的t混合模型的调制信号的盲检测方法 |
CN103514878A (zh) * | 2012-06-27 | 2014-01-15 | 北京百度网讯科技有限公司 | 声学建模方法及装置和语音识别方法及装置 |
CN105556546A (zh) * | 2013-09-20 | 2016-05-04 | 日本电气株式会社 | 分层隐变量模型估计设备、分层隐变量模型估计方法、供应量预测设备、供应量预测方法、以及记录介质 |
CN106683661A (zh) * | 2015-11-05 | 2017-05-17 | 阿里巴巴集团控股有限公司 | 基于语音的角色分离方法及装置 |
CN107342076A (zh) * | 2017-07-11 | 2017-11-10 | 华南理工大学 | 一种兼容非常态语音的智能家居控制系统及方法 |
CN107610708A (zh) * | 2017-06-09 | 2018-01-19 | 平安科技(深圳)有限公司 | 识别声纹的方法及设备 |
CN107690651A (zh) * | 2015-04-16 | 2018-02-13 | 罗伯特·博世有限公司 | 用于自动化手语识别的系统和方法 |
CN108766419A (zh) * | 2018-05-04 | 2018-11-06 | 华南理工大学 | 一种基于深度学习的非常态语音区别方法 |
CN109119064A (zh) * | 2018-09-05 | 2019-01-01 | 东南大学 | 一种适用于翻转课堂的英语口语教学系统的实现方法 |
CN110188338A (zh) * | 2018-02-23 | 2019-08-30 | 富士通株式会社 | 文本相关的说话人确认方法和设备 |
US10460245B2 (en) * | 2015-09-04 | 2019-10-29 | Civitas Learning, Inc. | Flexible, personalized student success modeling for institutions with complex term structures and competency-based education |
CN112002343A (zh) * | 2020-08-18 | 2020-11-27 | 海尔优家智能科技(北京)有限公司 | 语音纯度的识别方法、装置、存储介质及电子装置 |
WO2021127975A1 (zh) * | 2019-12-24 | 2021-07-01 | 广州国音智能科技有限公司 | 一种声音采集对象声纹检测方法、装置和设备 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1171592A (zh) * | 1996-05-01 | 1998-01-28 | 微软公司 | 采用连续密度隐藏式马尔克夫模型的语音识别方法和系统 |
CN1787076A (zh) * | 2005-12-13 | 2006-06-14 | 浙江大学 | 基于混合支持向量机的说话人识别方法 |
WO2006109515A1 (ja) * | 2005-03-31 | 2006-10-19 | Pioneer Corporation | 操作者認識装置、操作者認識方法、および、操作者認識プログラム |
KR100673834B1 (ko) * | 2004-12-03 | 2007-01-24 | 고한석 | 문맥 요구형 화자 독립 인증 시스템 및 방법 |
-
2011
- 2011-04-07 CN CN2011100858447A patent/CN102129860B/zh not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1171592A (zh) * | 1996-05-01 | 1998-01-28 | 微软公司 | 采用连续密度隐藏式马尔克夫模型的语音识别方法和系统 |
KR100673834B1 (ko) * | 2004-12-03 | 2007-01-24 | 고한석 | 문맥 요구형 화자 독립 인증 시스템 및 방법 |
WO2006109515A1 (ja) * | 2005-03-31 | 2006-10-19 | Pioneer Corporation | 操作者認識装置、操作者認識方法、および、操作者認識プログラム |
US20090254757A1 (en) * | 2005-03-31 | 2009-10-08 | Pioneer Corporation | Operator recognition device, operator recognition method and operator recognition program |
CN1787076A (zh) * | 2005-12-13 | 2006-06-14 | 浙江大学 | 基于混合支持向量机的说话人识别方法 |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102355439A (zh) * | 2011-08-11 | 2012-02-15 | 魏昕 | 通信系统中基于无限成分数的t混合模型的调制信号的盲检测方法 |
CN103514878A (zh) * | 2012-06-27 | 2014-01-15 | 北京百度网讯科技有限公司 | 声学建模方法及装置和语音识别方法及装置 |
CN105556546B (zh) * | 2013-09-20 | 2019-01-08 | 日本电气株式会社 | 分层隐变量模型估计设备、分层隐变量模型估计方法、供应量预测设备、供应量预测方法、以及记录介质 |
CN105556546A (zh) * | 2013-09-20 | 2016-05-04 | 日本电气株式会社 | 分层隐变量模型估计设备、分层隐变量模型估计方法、供应量预测设备、供应量预测方法、以及记录介质 |
CN107690651A (zh) * | 2015-04-16 | 2018-02-13 | 罗伯特·博世有限公司 | 用于自动化手语识别的系统和方法 |
US10460245B2 (en) * | 2015-09-04 | 2019-10-29 | Civitas Learning, Inc. | Flexible, personalized student success modeling for institutions with complex term structures and competency-based education |
CN106683661A (zh) * | 2015-11-05 | 2017-05-17 | 阿里巴巴集团控股有限公司 | 基于语音的角色分离方法及装置 |
CN107610708A (zh) * | 2017-06-09 | 2018-01-19 | 平安科技(深圳)有限公司 | 识别声纹的方法及设备 |
CN107342076B (zh) * | 2017-07-11 | 2020-09-22 | 华南理工大学 | 一种兼容非常态语音的智能家居控制系统及方法 |
CN107342076A (zh) * | 2017-07-11 | 2017-11-10 | 华南理工大学 | 一种兼容非常态语音的智能家居控制系统及方法 |
CN110188338A (zh) * | 2018-02-23 | 2019-08-30 | 富士通株式会社 | 文本相关的说话人确认方法和设备 |
CN110188338B (zh) * | 2018-02-23 | 2023-02-21 | 富士通株式会社 | 文本相关的说话人确认方法和设备 |
CN108766419A (zh) * | 2018-05-04 | 2018-11-06 | 华南理工大学 | 一种基于深度学习的非常态语音区别方法 |
CN109119064A (zh) * | 2018-09-05 | 2019-01-01 | 东南大学 | 一种适用于翻转课堂的英语口语教学系统的实现方法 |
WO2021127975A1 (zh) * | 2019-12-24 | 2021-07-01 | 广州国音智能科技有限公司 | 一种声音采集对象声纹检测方法、装置和设备 |
CN112002343A (zh) * | 2020-08-18 | 2020-11-27 | 海尔优家智能科技(北京)有限公司 | 语音纯度的识别方法、装置、存储介质及电子装置 |
CN112002343B (zh) * | 2020-08-18 | 2024-01-23 | 海尔优家智能科技(北京)有限公司 | 语音纯度的识别方法、装置、存储介质及电子装置 |
Also Published As
Publication number | Publication date |
---|---|
CN102129860B (zh) | 2012-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102129860B (zh) | 基于无限状态隐马尔可夫模型的与文本相关的说话人识别方法 | |
US9536525B2 (en) | Speaker indexing device and speaker indexing method | |
US9595257B2 (en) | Downsampling schemes in a hierarchical neural network structure for phoneme recognition | |
US5684925A (en) | Speech representation by feature-based word prototypes comprising phoneme targets having reliable high similarity | |
US6226612B1 (en) | Method of evaluating an utterance in a speech recognition system | |
Prasad et al. | Improved cepstral mean and variance normalization using Bayesian framework | |
EP2189976B1 (en) | Method for adapting a codebook for speech recognition | |
KR100307623B1 (ko) | 엠.에이.피 화자 적응 조건에서 파라미터의 분별적 추정 방법 및 장치 및 이를 각각 포함한 음성 인식 방법 및 장치 | |
EP0453649B1 (en) | Method and apparatus for modeling words with composite Markov models | |
CN101645269A (zh) | 一种语种识别系统及方法 | |
CN110189746B (zh) | 一种应用于地空通信的话音识别方法 | |
US7617101B2 (en) | Method and system for utterance verification | |
CN102945670A (zh) | 一种用于语音识别系统的多环境特征补偿方法 | |
Singh et al. | Model compensation and matched condition methods for robust speech recognition | |
EP2903003A1 (en) | Online maximum-likelihood mean and variance normalization for speech recognition | |
CN104485108A (zh) | 一种基于多说话人模型的噪声与说话人联合补偿方法 | |
JP3298858B2 (ja) | 低複雑性スピーチ認識器の区分ベースの類似性方法 | |
WO2010035892A1 (en) | Speech recognition method | |
US20040122672A1 (en) | Gaussian model-based dynamic time warping system and method for speech processing | |
Seneviratne et al. | Noise Robust Acoustic to Articulatory Speech Inversion. | |
JP4960845B2 (ja) | 音声パラメータ学習装置とその方法、それらを用いた音声認識装置と音声認識方法、それらのプログラムと記録媒体 | |
US20050027530A1 (en) | Audio-visual speaker identification using coupled hidden markov models | |
CN102237082B (zh) | 语音识别系统的自适应方法 | |
US20040083102A1 (en) | Method of automatic processing of a speech signal | |
Shahin | Improving speaker identification performance under the shouted talking condition using the second-order hidden Markov models |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: NANJING POST + TELECOMMUNICATION UNIV. Free format text: FORMER OWNER: WEI XIN Effective date: 20120203 |
|
C41 | Transfer of patent application or patent right or utility model | ||
C53 | Correction of patent for invention or patent application | ||
CB03 | Change of inventor or designer information |
Inventor after: Wei Cuan Inventor after: Yang Zhen Inventor after: Li Chunguang Inventor before: Wei Cuan |
|
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 210096 NANJING, JIANGSU PROVINCE TO: 210003 NANJING, JIANGSU PROVINCE Free format text: CORRECT: INVENTOR; FROM: WEI XIN TO: WEI XIN YANG ZHEN LI CHUNGUANG |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20120203 Address after: 210003 Nanjing City, Jiangsu Province, the new model road No. 66 Applicant after: Nanjing Post & Telecommunication Univ. Address before: 210096 School of information science and engineering, Southeast University, No. four, 2 arch, Jiangsu, Nanjing Applicant before: Wei Cuan |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120704 Termination date: 20140407 |