CN110223676A - The optimization method and system of deception recording detection neural network model - Google Patents
The optimization method and system of deception recording detection neural network model Download PDFInfo
- Publication number
- CN110223676A CN110223676A CN201910516188.8A CN201910516188A CN110223676A CN 110223676 A CN110223676 A CN 110223676A CN 201910516188 A CN201910516188 A CN 201910516188A CN 110223676 A CN110223676 A CN 110223676A
- Authority
- CN
- China
- Prior art keywords
- data
- domain
- feature extractor
- deception
- loss function
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 116
- 238000003062 neural network model Methods 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 title claims abstract description 40
- 238000005457 optimization Methods 0.000 title claims abstract description 32
- 238000012549 training Methods 0.000 claims abstract description 59
- 230000009977 dual effect Effects 0.000 claims abstract description 19
- 230000009467 reduction Effects 0.000 claims abstract description 19
- 230000006870 function Effects 0.000 claims description 47
- 238000003860 storage Methods 0.000 claims description 14
- 230000015654 memory Effects 0.000 claims description 10
- 238000005070 sampling Methods 0.000 claims description 6
- 238000004891 communication Methods 0.000 claims description 5
- 230000008901 benefit Effects 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 4
- 238000010276 construction Methods 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 238000005303 weighing Methods 0.000 claims 2
- 238000012360 testing method Methods 0.000 abstract description 14
- 238000005520 cutting process Methods 0.000 abstract description 9
- 238000013528 artificial neural network Methods 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 241001269238 Data Species 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 238000013527 convolutional neural network Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000004907 flux Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 210000004218 nerve net Anatomy 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003042 antagnostic effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000013136 deep learning model Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 238000013077 scoring method Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000003313 weakening effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910516188.8A CN110223676A (en) | 2019-06-14 | 2019-06-14 | The optimization method and system of deception recording detection neural network model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910516188.8A CN110223676A (en) | 2019-06-14 | 2019-06-14 | The optimization method and system of deception recording detection neural network model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110223676A true CN110223676A (en) | 2019-09-10 |
Family
ID=67817331
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910516188.8A Pending CN110223676A (en) | 2019-06-14 | 2019-06-14 | The optimization method and system of deception recording detection neural network model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110223676A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112735381A (en) * | 2020-12-29 | 2021-04-30 | 四川虹微技术有限公司 | Model updating method and device |
CN113284508A (en) * | 2021-07-21 | 2021-08-20 | 中国科学院自动化研究所 | Hierarchical differentiation based generated audio detection system |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106875007A (en) * | 2017-01-25 | 2017-06-20 | 上海交通大学 | End-to-end deep neural network is remembered based on convolution shot and long term for voice fraud detection |
US20180082689A1 (en) * | 2016-09-19 | 2018-03-22 | Pindrop Security, Inc. | Speaker recognition in the call center |
CN107944410A (en) * | 2017-12-01 | 2018-04-20 | 中国科学院重庆绿色智能技术研究院 | A kind of cross-cutting facial characteristics analytic method based on convolutional neural networks |
CN108141363A (en) * | 2015-10-15 | 2018-06-08 | 诺基亚技术有限公司 | For the device of certification, method and computer program product |
CN108198561A (en) * | 2017-12-13 | 2018-06-22 | 宁波大学 | A kind of pirate recordings speech detection method based on convolutional neural networks |
US20180254046A1 (en) * | 2017-03-03 | 2018-09-06 | Pindrop Security, Inc. | Method and apparatus for detecting spoofing conditions |
US20180374487A1 (en) * | 2017-06-27 | 2018-12-27 | Cirrus Logic International Semiconductor Ltd. | Detection of replay attack |
CN109754812A (en) * | 2019-01-30 | 2019-05-14 | 华南理工大学 | A kind of voiceprint authentication method of the anti-recording attack detecting based on convolutional neural networks |
US20190180742A1 (en) * | 2017-12-08 | 2019-06-13 | Google Llc | Digital assistant processing of stacked data structures |
-
2019
- 2019-06-14 CN CN201910516188.8A patent/CN110223676A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108141363A (en) * | 2015-10-15 | 2018-06-08 | 诺基亚技术有限公司 | For the device of certification, method and computer program product |
US20180082689A1 (en) * | 2016-09-19 | 2018-03-22 | Pindrop Security, Inc. | Speaker recognition in the call center |
CN106875007A (en) * | 2017-01-25 | 2017-06-20 | 上海交通大学 | End-to-end deep neural network is remembered based on convolution shot and long term for voice fraud detection |
US20180254046A1 (en) * | 2017-03-03 | 2018-09-06 | Pindrop Security, Inc. | Method and apparatus for detecting spoofing conditions |
US20180374487A1 (en) * | 2017-06-27 | 2018-12-27 | Cirrus Logic International Semiconductor Ltd. | Detection of replay attack |
CN107944410A (en) * | 2017-12-01 | 2018-04-20 | 中国科学院重庆绿色智能技术研究院 | A kind of cross-cutting facial characteristics analytic method based on convolutional neural networks |
US20190180742A1 (en) * | 2017-12-08 | 2019-06-13 | Google Llc | Digital assistant processing of stacked data structures |
CN108198561A (en) * | 2017-12-13 | 2018-06-22 | 宁波大学 | A kind of pirate recordings speech detection method based on convolutional neural networks |
CN109754812A (en) * | 2019-01-30 | 2019-05-14 | 华南理工大学 | A kind of voiceprint authentication method of the anti-recording attack detecting based on convolutional neural networks |
Non-Patent Citations (4)
Title |
---|
HIMAWAN I 等: "Deep domain adaptation for anti-spoofing in speaker verification systems", 《COMPUTER SPEECH & LANGUAGE》 * |
WANG H 等: "Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training", 《INTERSPEECH. 2019》 * |
WANG Q 等: "Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition", 《2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)》 * |
徐涌钞: "基于高频和瓶颈特征的说话人验证系统重放攻击检测方法", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112735381A (en) * | 2020-12-29 | 2021-04-30 | 四川虹微技术有限公司 | Model updating method and device |
CN112735381B (en) * | 2020-12-29 | 2022-09-27 | 四川虹微技术有限公司 | Model updating method and device |
CN113284508A (en) * | 2021-07-21 | 2021-08-20 | 中国科学院自动化研究所 | Hierarchical differentiation based generated audio detection system |
CN113284508B (en) * | 2021-07-21 | 2021-11-09 | 中国科学院自动化研究所 | Hierarchical differentiation based generated audio detection system |
US11763836B2 (en) | 2021-07-21 | 2023-09-19 | Institute Of Automation, Chinese Academy Of Sciences | Hierarchical generated audio detection system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109637546B (en) | Knowledge distillation method and apparatus | |
CN110246487A (en) | Optimization method and system for single pass speech recognition modeling | |
CN107924682A (en) | Neutral net for speaker verification | |
CN110473569A (en) | Detect the optimization method and system of speaker's spoofing attack | |
CN111835784B (en) | Data generalization method and system for replay attack detection system | |
CN108766445A (en) | Method for recognizing sound-groove and system | |
CN108109613A (en) | For the audio training of Intelligent dialogue voice platform and recognition methods and electronic equipment | |
Yang et al. | Modified magnitude-phase spectrum information for spoofing detection | |
CN104902012B (en) | The method and singing contest system of singing contest are carried out by network | |
CN109584884A (en) | A kind of speech identity feature extractor, classifier training method and relevant device | |
CN108986798B (en) | Processing method, device and the equipment of voice data | |
CN108711336A (en) | A kind of piano performance points-scoring system and its method | |
CN110223676A (en) | The optimization method and system of deception recording detection neural network model | |
CN109976998A (en) | A kind of Software Defects Predict Methods, device and electronic equipment | |
CN108091326A (en) | A kind of method for recognizing sound-groove and system based on linear regression | |
CN108877783A (en) | The method and apparatus for determining the audio types of audio data | |
CN110008984A (en) | A kind of object module training method and device based on multitask sample | |
CN106991312A (en) | Internet based on Application on Voiceprint Recognition is counter to cheat authentication method | |
Cáceres et al. | The Biometric Vox system for the ASVspoof 2021 challenge | |
Shi et al. | Semi-supervised acoustic event detection based on tri-training | |
CN108417207A (en) | A kind of depth mixing generation network self-adapting method and system | |
CN110223678A (en) | Audio recognition method and system | |
CN108932646A (en) | User tag verification method, device and electronic equipment based on operator | |
CN111147871B (en) | Singing recognition method and device in live broadcast room, server and storage medium | |
Kawa et al. | Attack agnostic dataset: Towards generalization and stabilization of audio deepfake detection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200616 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Co.,Ltd. Applicant after: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Co.,Ltd. Applicant before: SHANGHAI JIAO TONG University |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20201026 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Co.,Ltd. Applicant before: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
CB02 | Change of applicant information |
Address after: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant after: Sipic Technology Co.,Ltd. Address before: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant before: AI SPEECH Co.,Ltd. |
|
CB02 | Change of applicant information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190910 |
|
RJ01 | Rejection of invention patent application after publication |