CN104765996B - Voiceprint password authentication method and system - Google Patents
Voiceprint password authentication method and system Download PDFInfo
- Publication number
- CN104765996B CN104765996B CN201410005651.XA CN201410005651A CN104765996B CN 104765996 B CN104765996 B CN 104765996B CN 201410005651 A CN201410005651 A CN 201410005651A CN 104765996 B CN104765996 B CN 104765996B
- Authority
- CN
- China
- Prior art keywords
- model
- cryptogram
- vocal print
- user
- unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 230000001755 vocal effect Effects 0.000 claims abstract description 139
- 238000012549 training Methods 0.000 claims abstract description 69
- 238000012546 transfer Methods 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 5
- 230000008859 change Effects 0.000 abstract description 4
- 230000003044 adaptive effect Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 230000000694 effects Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000003542 behavioural effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Landscapes
- Collating Specific Patterns (AREA)
Abstract
Description
Claims (18)
- A kind of 1. voiceprint password authentication method, it is characterised in that including:Receive voice signal input by user;Speech recognition is carried out to the voice signal, obtains cryptogram;Determine whether there is the corresponding back of the body of the cryptogram trained beforehand through the password voice data gathered offline Scape model;If it is, obtain the background model;If it is not, then according to the cryptogram to training obtained pronunciation list in advance Meta-model is spliced, and obtains the corresponding background model of the cryptogram;Using the vocal print feature sequence in the voice signal, the background model and the vocal print cryptogram-modle of the user to institute User is stated to be authenticated.
- 2. according to the method described in claim 1, it is characterized in that, training obtains pronunciation unit mould in advance in the following way Type:Obtain training voice data;Pronunciation unit is determined according to the trained voice data;Determine the topological structure of the acoustic model of the pronunciation unit;Parameter training is carried out to the acoustic model, obtains pronunciation unit model.
- 3. according to the method described in claim 2, it is characterized in that, the acoustic model is GMM model;It is described that the pronunciation unit model that training obtains in advance is spliced according to the cryptogram, obtain the cryptogram Corresponding background model includes:The corresponding GMM model of each pronunciation unit in the cryptogram is obtained, obtains GMM model set;To the model unit in the GMM model set, spliced using equal weight, obtain new combination GMM model;The Gauss weight of the combination GMM model is updated so that the sum of Gauss weight of the combination GMM model is 1, Obtain the corresponding background model of the cryptogram.
- 4. according to the method described in claim 2, it is characterized in that, the acoustic model is GMM model;It is described that the pronunciation unit model that training obtains in advance is spliced according to the cryptogram, obtain the cryptogram Corresponding background model includes:The corresponding GMM model of each pronunciation unit in the cryptogram is obtained, obtains GMM model sequence;After splicing successively at least two model units in the GMM model sequence, using default from redirecting probability and outer Redirect probability to carry out redirecting transfer, obtain the corresponding background model of the cryptogram, wherein, it is described to redirect probability and outer jump certainly Turn the sum of probability as 1.
- 5. according to the method described in claim 2, it is characterized in that, the acoustic model is HMM model;It is described that the pronunciation unit model that training obtains in advance is spliced according to the cryptogram, obtain the cryptogram Corresponding background model includes:Obtain the corresponding HMM model sequence of each pronunciation unit in the cryptogram;After splicing successively to the model unit in the HMM model sequence, the corresponding background model of the cryptogram is obtained.
- 6. the according to the method described in claim 1, it is characterized in that, vocal print feature sequence using in the voice signal Row, the background model and the vocal print cryptogram-modle of the user are authenticated including to the user:First likelihood score of the vocal print feature sequence relative to the vocal print cryptogram-modle, and vocal print spy are calculated respectively Levy second likelihood score of the sequence relative to the background model;According to first likelihood score and the ratio and predetermined threshold value of second likelihood score, determine whether user is legal use Family.
- 7. method according to any one of claims 1 to 6, it is characterised in that the method further includes:Before speech recognition is carried out to the voice signal or after the corresponding background model of the cryptogram is obtained, Extract the vocal print feature sequence in the voice signal.
- 8. method according to any one of claims 1 to 6, it is characterised in that the method further includes:Before speech recognition is carried out to the voice signal or after the corresponding background model of the cryptogram is obtained, Obtain the vocal print cryptogram-modle of the user.
- 9. according to the method described in claim 8, it is characterized in that, the method further includes:Before the vocal print cryptogram-modle of the user is obtained, judge currently to whether there is the vocal print cryptogram-modle of user;If it does not exist, then according to the registration voice signal of user and the vocal print cryptogram-modle of background model structure user.
- A kind of 10. vocal print cipher authentication system, it is characterised in that including:Receiving module, for receiving voice signal input by user;Identification module, for carrying out speech recognition to the voice signal, obtains cryptogram;Determining module, is used to determine whether there is the password trained beforehand through the password voice data gathered offline The corresponding background model of text;Background model acquisition module, for after the determining module determines there are the corresponding background model of the cryptogram, The background model is obtained, after the determining module determines that the corresponding background model of the cryptogram is not present, according to institute State cryptogram to splice the pronunciation unit model that training obtains in advance, obtain the corresponding background mould of the cryptogram Type, pronunciation unit model to training voice data by being trained to obtain;Authentication module, for utilizing vocal print feature sequence, the background model and the sound of the user in the voice signal Line cryptogram-modle is authenticated the user.
- 11. system according to claim 10, it is characterised in that the system also includes:Training module, for instructing in advance Get pronunciation unit model;The training module includes:Voice data acquiring unit, for obtaining trained voice data;First determination unit, for determining pronunciation unit according to the trained voice data;Second determination unit, the topological structure of the acoustic model for determining the pronunciation unit;Parameter training unit, for carrying out parameter training to the acoustic model, obtains pronunciation unit model.
- 12. system according to claim 11, it is characterised in that the acoustic model is GMM model, the background model Acquisition module includes:GMM model acquiring unit, for obtaining the corresponding GMM model of each pronunciation unit in the cryptogram, obtains GMM model Set;First concatenation unit, for the model unit in the GMM model set, being spliced using equal weight, obtaining new group Close GMM model;Weight updating block, for being updated to the Gauss weight of the combination GMM model so that the combination GMM model The sum of Gauss weight be 1, obtain the corresponding background model of the cryptogram.
- 13. system according to claim 11, it is characterised in that the acoustic model is GMM model, the background model Acquisition module includes:GMM model acquiring unit, for obtaining the corresponding GMM model of each pronunciation unit in the cryptogram, obtains GMM model Sequence;Second concatenation unit, after splicing successively at least two model units in the GMM model sequence, using default Redirect certainly probability and it is outer redirect probability and carry out redirecting transfer, obtain the corresponding background model of the cryptogram, wherein, it is described From probability and outer the sum of the probability that redirects is redirected for 1.
- 14. system according to claim 11, it is characterised in that the acoustic model is HMM model, the background model Acquisition module includes:HMM model retrieval unit, for obtaining the corresponding HMM model sequence of each pronunciation unit in the cryptogram;3rd concatenation unit, after splicing successively to the model unit in the HMM model sequence, obtains the cryptogram Corresponding background model.
- 15. system according to claim 10, it is characterised in that the authentication module includes:Computing unit, for calculating first likelihood score of the vocal print feature sequence relative to the vocal print cryptogram-modle respectively, And the vocal print feature sequence is relative to the second likelihood score of the background model;Determination unit, for the ratio and predetermined threshold value according to first likelihood score and second likelihood score, determines to use Whether family is validated user.
- 16. according to claim 10 to 15 any one of them system, it is characterised in that the system also includes:Vocal print feature sequential extraction procedures module, for before speech recognition is carried out to the voice signal or described close obtaining After the corresponding background model of code text, the vocal print feature sequence in the voice signal is extracted.
- 17. according to claim 10 to 15 any one of them system, it is characterised in that the system also includes:Vocal print cryptogram-modle acquisition module, for before speech recognition is carried out to the voice signal or described close obtaining After the corresponding background model of code text, the vocal print cryptogram-modle of the user is obtained.
- 18. system according to claim 17, it is characterised in that the system also includes:Judgment module, for before the vocal print cryptogram-modle of the user is obtained, judging currently with the presence or absence of the vocal print of user Cryptogram-modle;Vocal print cryptogram-modle build module, for the judgment module judge there is currently no vocal print cryptogram-modle after, according to The registration voice signal of user and the vocal print cryptogram-modle of background model structure user.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410005651.XA CN104765996B (en) | 2014-01-06 | 2014-01-06 | Voiceprint password authentication method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410005651.XA CN104765996B (en) | 2014-01-06 | 2014-01-06 | Voiceprint password authentication method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104765996A CN104765996A (en) | 2015-07-08 |
CN104765996B true CN104765996B (en) | 2018-04-27 |
Family
ID=53647821
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410005651.XA Active CN104765996B (en) | 2014-01-06 | 2014-01-06 | Voiceprint password authentication method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104765996B (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106373575B (en) * | 2015-07-23 | 2020-07-21 | 阿里巴巴集团控股有限公司 | User voiceprint model construction method, device and system |
CN106782572B (en) * | 2017-01-22 | 2020-04-07 | 清华大学 | Voice password authentication method and system |
CN107068154A (en) * | 2017-03-13 | 2017-08-18 | 平安科技(深圳)有限公司 | The method and system of authentication based on Application on Voiceprint Recognition |
CN107274906A (en) * | 2017-06-28 | 2017-10-20 | 百度在线网络技术(北京)有限公司 | Voice information processing method, device, terminal and storage medium |
GB201801527D0 (en) * | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Method, apparatus and systems for biometric processes |
CN110310647B (en) | 2017-09-29 | 2022-02-25 | 腾讯科技(深圳)有限公司 | Voice identity feature extractor, classifier training method and related equipment |
CN109872721A (en) * | 2017-12-05 | 2019-06-11 | 富士通株式会社 | Voice authentication method, information processing device, and storage medium |
CN109727342A (en) * | 2018-07-06 | 2019-05-07 | 平安科技(深圳)有限公司 | Recognition methods, device, access control system and the storage medium of access control system |
CN111292733A (en) * | 2018-12-06 | 2020-06-16 | 阿里巴巴集团控股有限公司 | Voice interaction method and device |
CN110880327B (en) * | 2019-10-29 | 2024-07-09 | 平安科技(深圳)有限公司 | Audio signal processing method and device |
CN114003878A (en) * | 2021-10-25 | 2022-02-01 | 国电南京自动化股份有限公司 | Double-factor authentication method, device and system based on voice recognition |
TWI876358B (en) * | 2023-05-30 | 2025-03-11 | 兆豐國際商業銀行股份有限公司 | Identity recognition system and method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1412741A (en) * | 2002-12-13 | 2003-04-23 | 郑方 | Chinese speech identification method with dialect background |
CN1447278A (en) * | 2002-11-15 | 2003-10-08 | 郑方 | Method for recognizing voice print |
CN1652206A (en) * | 2005-04-01 | 2005-08-10 | 郑方 | Sound veins identifying method |
CN102238189A (en) * | 2011-08-01 | 2011-11-09 | 安徽科大讯飞信息科技股份有限公司 | Voiceprint password authentication method and system |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101154380B (en) * | 2006-09-29 | 2011-01-26 | 株式会社东芝 | Method and device for registration and validation of speaker's authentication |
-
2014
- 2014-01-06 CN CN201410005651.XA patent/CN104765996B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1447278A (en) * | 2002-11-15 | 2003-10-08 | 郑方 | Method for recognizing voice print |
CN1412741A (en) * | 2002-12-13 | 2003-04-23 | 郑方 | Chinese speech identification method with dialect background |
CN1652206A (en) * | 2005-04-01 | 2005-08-10 | 郑方 | Sound veins identifying method |
CN102238189A (en) * | 2011-08-01 | 2011-11-09 | 安徽科大讯飞信息科技股份有限公司 | Voiceprint password authentication method and system |
Also Published As
Publication number | Publication date |
---|---|
CN104765996A (en) | 2015-07-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104765996B (en) | Voiceprint password authentication method and system | |
CN108510976B (en) | Multi-language mixed voice recognition method | |
KR102198835B1 (en) | End-to-end speaker recognition using deep neural network | |
CN105336322B (en) | Polyphone model training method, and speech synthesis method and device | |
US10176811B2 (en) | Neural network-based voiceprint information extraction method and apparatus | |
Diez et al. | Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. | |
Larcher et al. | Text-dependent speaker verification: Classifiers, databases and RSR2015 | |
CN102354495B (en) | Test method and system for semi-open oral test questions | |
CN108766415B (en) | A method of voice assessment | |
CN110050302A (en) | Speech synthesis | |
Chen et al. | Pronunciation and silence probability modeling for ASR. | |
CN109313892A (en) | Steady language identification method and system | |
WO2006034200A2 (en) | Method and system for the automatic generation of speech features for scoring high entropy speech | |
CN110415725B (en) | Method and system for evaluating pronunciation quality of second language using first language data | |
Yin et al. | Automatic cognitive load detection from speech features | |
Zhong et al. | DNN i-Vector Speaker Verification with Short, Text-Constrained Test Utterances. | |
Serizel et al. | Deep neural network adaptation for children's and adults' speech recognition | |
CN103594087A (en) | Method and system for improving oral evaluation performance | |
CN109300339A (en) | A kind of exercising method and system of Oral English Practice | |
US20180012602A1 (en) | System and methods for pronunciation analysis-based speaker verification | |
Jyothi et al. | Lexical access experiments with context-dependent articulatory feature-based models | |
KR20220071960A (en) | A method and a TTS system for calculating an encoder score of an attention alignment corresponded to a spectrogram | |
Behravan et al. | Introducing attribute features to foreign accent recognition | |
TW201937479A (en) | Multilingual mixed speech recognition method | |
CN112863476B (en) | Personalized speech synthesis model construction, speech synthesis and test methods and devices |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Yin Bing Inventor after: Jin Xin Inventor after: Wei Si Inventor after: Liu Lin Inventor after: Liu Junhua Inventor after: Zhang Chengfeng Inventor after: Wang Jianshe Inventor after: Liu Haibo Inventor after: Hu Guoping Inventor after: Hu Yu Inventor before: Yin Bing Inventor before: Wei Si Inventor before: Hu Guoping Inventor before: Hu Yu |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |