CN105869630B - Speaker's voice spoofing attack detection method and system based on deep learning - Google Patents
Speaker's voice spoofing attack detection method and system based on deep learning Download PDFInfo
- Publication number
- CN105869630B CN105869630B CN201610478041.0A CN201610478041A CN105869630B CN 105869630 B CN105869630 B CN 105869630B CN 201610478041 A CN201610478041 A CN 201610478041A CN 105869630 B CN105869630 B CN 105869630B
- Authority
- CN
- China
- Prior art keywords
- neural network
- depth
- voice
- speaker
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610478041.0A CN105869630B (en) | 2016-06-27 | 2016-06-27 | Speaker's voice spoofing attack detection method and system based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610478041.0A CN105869630B (en) | 2016-06-27 | 2016-06-27 | Speaker's voice spoofing attack detection method and system based on deep learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105869630A CN105869630A (en) | 2016-08-17 |
CN105869630B true CN105869630B (en) | 2019-08-02 |
Family
ID=56655288
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610478041.0A Active CN105869630B (en) | 2016-06-27 | 2016-06-27 | Speaker's voice spoofing attack detection method and system based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105869630B (en) |
Families Citing this family (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108320732A (en) * | 2017-01-13 | 2018-07-24 | 阿里巴巴集团控股有限公司 | The method and apparatus for generating target speaker's speech recognition computation model |
US20180211403A1 (en) * | 2017-01-20 | 2018-07-26 | Ford Global Technologies, Llc | Recurrent Deep Convolutional Neural Network For Object Detection |
CN106875007A (en) * | 2017-01-25 | 2017-06-20 | 上海交通大学 | End-to-end deep neural network is remembered based on convolution shot and long term for voice fraud detection |
CN106991999B (en) * | 2017-03-29 | 2020-06-02 | 北京小米移动软件有限公司 | Voice recognition method and device |
CN107221320A (en) * | 2017-05-19 | 2017-09-29 | 百度在线网络技术(北京)有限公司 | Train method, device, equipment and the computer-readable storage medium of acoustic feature extraction model |
GB2578386B (en) | 2017-06-27 | 2021-12-01 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
GB201713697D0 (en) | 2017-06-28 | 2017-10-11 | Cirrus Logic Int Semiconductor Ltd | Magnetic detection of replay attack |
GB2563953A (en) | 2017-06-28 | 2019-01-02 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
GB201801528D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Method, apparatus and systems for biometric processes |
GB201801527D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Method, apparatus and systems for biometric processes |
GB201801532D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for audio playback |
GB201801530D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for authentication |
GB201801526D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for authentication |
CN107527616A (en) * | 2017-09-29 | 2017-12-29 | 上海与德通讯技术有限公司 | Intelligent identification Method and robot |
GB2567503A (en) | 2017-10-13 | 2019-04-17 | Cirrus Logic Int Semiconductor Ltd | Analysing speech signals |
GB201801664D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of liveness |
GB201801663D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of liveness |
GB201801661D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic International Uk Ltd | Detection of liveness |
GB201804843D0 (en) | 2017-11-14 | 2018-05-09 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
US10657259B2 (en) * | 2017-11-01 | 2020-05-19 | International Business Machines Corporation | Protecting cognitive systems from gradient based attacks through the use of deceiving gradients |
GB201801659D0 (en) | 2017-11-14 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of loudspeaker playback |
CN108172224B (en) * | 2017-12-19 | 2019-08-27 | 浙江大学 | Method based on the defence of machine learning without vocal command control voice assistant |
CN108417217B (en) * | 2018-01-11 | 2021-07-13 | 思必驰科技股份有限公司 | Speaker recognition network model training method, speaker recognition method and system |
CN108281158A (en) * | 2018-01-12 | 2018-07-13 | 平安科技(深圳)有限公司 | Voice biopsy method, server and storage medium based on deep learning |
US11475899B2 (en) | 2018-01-23 | 2022-10-18 | Cirrus Logic, Inc. | Speaker identification |
US11264037B2 (en) | 2018-01-23 | 2022-03-01 | Cirrus Logic, Inc. | Speaker identification |
US11735189B2 (en) | 2018-01-23 | 2023-08-22 | Cirrus Logic, Inc. | Speaker identification |
US11538455B2 (en) | 2018-02-16 | 2022-12-27 | Dolby Laboratories Licensing Corporation | Speech style transfer |
EP3752964B1 (en) * | 2018-02-16 | 2023-06-28 | Dolby Laboratories Licensing Corporation | Speech style transfer |
CN108711436B (en) * | 2018-05-17 | 2020-06-09 | 哈尔滨工业大学 | Speaker verification system replay attack detection method based on high frequency and bottleneck characteristics |
US10692490B2 (en) | 2018-07-31 | 2020-06-23 | Cirrus Logic, Inc. | Detection of replay attack |
CN109165726A (en) * | 2018-08-17 | 2019-01-08 | 联智科技(天津)有限责任公司 | A kind of neural network embedded system for without speaker verification's text |
US10915614B2 (en) | 2018-08-31 | 2021-02-09 | Cirrus Logic, Inc. | Biometric authentication |
US11037574B2 (en) | 2018-09-05 | 2021-06-15 | Cirrus Logic, Inc. | Speaker recognition and speaker change detection |
CN109065069B (en) | 2018-10-10 | 2020-09-04 | 广州市百果园信息技术有限公司 | Audio detection method, device, equipment and storage medium |
CN109147799A (en) * | 2018-10-18 | 2019-01-04 | 广州势必可赢网络科技有限公司 | A kind of method, apparatus of speech recognition, equipment and computer storage medium |
CN109394476B (en) * | 2018-12-06 | 2021-01-19 | 上海神添实业有限公司 | Method and system for automatic intention recognition of brain muscle information and intelligent control of upper limbs |
CN109448759A (en) * | 2018-12-28 | 2019-03-08 | 武汉大学 | A kind of anti-voice authentication spoofing attack detection method based on gas explosion sound |
CN109767776B (en) * | 2019-01-14 | 2023-12-15 | 广东技术师范大学 | Deception voice detection method based on dense neural network |
CN109920447B (en) * | 2019-01-29 | 2021-07-13 | 天津大学 | Recording fraud detection method based on adaptive filter amplitude phase characteristic extraction |
CN110110732B (en) * | 2019-05-08 | 2020-04-28 | 杭州视在科技有限公司 | Intelligent inspection method for catering kitchen |
CN110348189A (en) * | 2019-06-17 | 2019-10-18 | 五邑大学 | A kind of identity spoofing detection method and its system, device, storage medium |
CN110491391B (en) * | 2019-07-02 | 2021-09-17 | 厦门大学 | Deception voice detection method based on deep neural network |
CN110335591A (en) * | 2019-07-04 | 2019-10-15 | 广州云从信息科技有限公司 | A kind of parameter management method, device, machine readable media and equipment |
CN110414536B (en) * | 2019-07-17 | 2022-03-25 | 北京得意音通技术有限责任公司 | Playback detection method, storage medium, and electronic device |
CN110349586B (en) * | 2019-07-23 | 2022-05-13 | 北京邮电大学 | Telecommunication fraud detection method and device |
CN110827837B (en) * | 2019-10-18 | 2022-02-22 | 中山大学 | Whale activity audio classification method based on deep learning |
SG11202010803VA (en) | 2019-10-31 | 2020-11-27 | Alipay Hangzhou Inf Tech Co Ltd | System and method for determining voice characteristics |
CN111028852A (en) * | 2019-11-06 | 2020-04-17 | 杭州哲信信息技术有限公司 | Noise removing method in intelligent calling system based on CNN |
CN111243621A (en) * | 2020-01-14 | 2020-06-05 | 四川大学 | Construction method of GRU-SVM deep learning model for synthetic speech detection |
CN111327608B (en) * | 2020-02-14 | 2021-02-02 | 中南大学 | Application layer malicious request detection method and system based on cascade deep neural network |
CN111755014B (en) * | 2020-07-02 | 2022-06-03 | 四川长虹电器股份有限公司 | Domain-adaptive replay attack detection method and system |
CN113362822B (en) * | 2021-06-08 | 2022-09-30 | 北京计算机技术及应用研究所 | Black box voice confrontation sample generation method with auditory masking |
CN113641980A (en) * | 2021-08-23 | 2021-11-12 | 北京百度网讯科技有限公司 | Authentication method and apparatus, electronic device, and medium |
CN113555023B (en) * | 2021-09-18 | 2022-01-11 | 中国科学院自动化研究所 | Method for joint modeling of voice authentication and speaker recognition |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102436810A (en) * | 2011-10-26 | 2012-05-02 | 华南理工大学 | Record replay attack detection method and system based on channel mode noise |
CN104954532A (en) * | 2015-06-19 | 2015-09-30 | 深圳天珑无线科技有限公司 | Voice recognition method, voice recognition device and mobile terminal |
CN105139857A (en) * | 2015-09-02 | 2015-12-09 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | Countercheck method for automatically identifying speaker aiming to voice deception |
-
2016
- 2016-06-27 CN CN201610478041.0A patent/CN105869630B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102436810A (en) * | 2011-10-26 | 2012-05-02 | 华南理工大学 | Record replay attack detection method and system based on channel mode noise |
CN104954532A (en) * | 2015-06-19 | 2015-09-30 | 深圳天珑无线科技有限公司 | Voice recognition method, voice recognition device and mobile terminal |
CN105139857A (en) * | 2015-09-02 | 2015-12-09 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | Countercheck method for automatically identifying speaker aiming to voice deception |
Non-Patent Citations (1)
Title |
---|
Using Deep Learning for Detecting Spoofing Attacks on Speech Signals;Alan Godoy 等;《airxiv》;20160119;第1-5页 |
Also Published As
Publication number | Publication date |
---|---|
CN105869630A (en) | 2016-08-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105869630B (en) | Speaker's voice spoofing attack detection method and system based on deep learning | |
CN104732978B (en) | The relevant method for distinguishing speek person of text based on combined depth study | |
CN110491391B (en) | Deception voice detection method based on deep neural network | |
CN108231067A (en) | Sound scenery recognition methods based on convolutional neural networks and random forest classification | |
CN102968990B (en) | Speaker identifying method and system | |
CN107886943A (en) | A kind of method for recognizing sound-groove and device | |
CN109584884A (en) | A kind of speech identity feature extractor, classifier training method and relevant device | |
Tapkir et al. | Novel spectral root cepstral features for replay spoof detection | |
CN110428843A (en) | A kind of voice gender identification deep learning method | |
CN110211604A (en) | A kind of depth residual error network structure for voice deformation detection | |
CN106531174A (en) | Animal sound recognition method based on wavelet packet decomposition and spectrogram features | |
CN104978507A (en) | Intelligent well logging evaluation expert system identity authentication method based on voiceprint recognition | |
CN107784215B (en) | Audio unit based on intelligent terminal carries out the user authen method and system of labiomaney | |
CN111816185A (en) | Method and device for identifying speaker in mixed voice | |
CN111611566B (en) | Speaker verification system and replay attack detection method thereof | |
Gomez-Alanis et al. | Performance evaluation of front-and back-end techniques for ASV spoofing detection systems based on deep features | |
CN111613240A (en) | Camouflage voice detection method based on attention mechanism and Bi-LSTM | |
Gautam et al. | Biometric system from heart sound using wavelet based feature set | |
WO2022268183A1 (en) | Video-based random gesture authentication method and system | |
CN107274912A (en) | A kind of equipment source discrimination method of mobile phone recording | |
Sailor et al. | Unsupervised Representation Learning Using Convolutional Restricted Boltzmann Machine for Spoof Speech Detection. | |
CN111785262B (en) | Speaker age and gender classification method based on residual error network and fusion characteristics | |
Islam et al. | Neural-Response-Based Text-Dependent speaker identification under noisy conditions | |
Purnapatra et al. | Longitudinal study of voice recognition in children | |
Neelima et al. | Spoofing det ection and count ermeasure is aut omat ic speaker verificat ion syst em using dynamic feat ures |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200617 Address after: Room 105G, 199 GuoShoujing Road, Pudong New Area, Shanghai, 200120 Patentee after: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Address before: 200240 Dongchuan Road, Shanghai, No. 800, No. Patentee before: SHANGHAI JIAO TONG University |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20201028 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Patentee after: AI SPEECH Ltd. Address before: Room 105G, 199 GuoShoujing Road, Pudong New Area, Shanghai, 200120 Patentee before: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Patentee after: Sipic Technology Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Patentee before: AI SPEECH Ltd. |