CN112102808A - 用于伪造语音的深度神经网络的构建方法及系统 - Google Patents
用于伪造语音的深度神经网络的构建方法及系统 Download PDFInfo
- Publication number
- CN112102808A CN112102808A CN202010863825.1A CN202010863825A CN112102808A CN 112102808 A CN112102808 A CN 112102808A CN 202010863825 A CN202010863825 A CN 202010863825A CN 112102808 A CN112102808 A CN 112102808A
- Authority
- CN
- China
- Prior art keywords
- voice
- module
- voiceprint
- electrically connected
- output end
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 15
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 38
- 238000012545 processing Methods 0.000 claims abstract description 38
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 38
- 238000012795 verification Methods 0.000 claims abstract description 32
- 239000011664 nicotinic acid Substances 0.000 claims abstract description 17
- 238000005242 forging Methods 0.000 claims abstract description 10
- 238000007781 pre-processing Methods 0.000 claims description 20
- 238000012549 training Methods 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 12
- 230000002194 synthesizing effect Effects 0.000 claims description 7
- 238000012216 screening Methods 0.000 claims description 2
- 238000005516 engineering process Methods 0.000 abstract description 11
- 238000004141 dimensional analysis Methods 0.000 abstract description 3
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Lock And Its Accessories (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010863825.1A CN112102808A (zh) | 2020-08-25 | 2020-08-25 | 用于伪造语音的深度神经网络的构建方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010863825.1A CN112102808A (zh) | 2020-08-25 | 2020-08-25 | 用于伪造语音的深度神经网络的构建方法及系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112102808A true CN112102808A (zh) | 2020-12-18 |
Family
ID=73754321
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010863825.1A Pending CN112102808A (zh) | 2020-08-25 | 2020-08-25 | 用于伪造语音的深度神经网络的构建方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112102808A (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114023333A (zh) * | 2021-11-02 | 2022-02-08 | 中国工商银行股份有限公司 | 声纹识别的测试方法、装置、存储介质及电子设备 |
CN115497481A (zh) * | 2022-11-17 | 2022-12-20 | 北京远鉴信息技术有限公司 | 一种虚假语音的识别方法、装置、电子设备及存储介质 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102708867A (zh) * | 2012-05-30 | 2012-10-03 | 北京正鹰科技有限责任公司 | 一种基于声纹和语音的防录音假冒身份识别方法及系统 |
CN104123932A (zh) * | 2014-07-29 | 2014-10-29 | 科大讯飞股份有限公司 | 一种语音转换系统及方法 |
US20180254046A1 (en) * | 2017-03-03 | 2018-09-06 | Pindrop Security, Inc. | Method and apparatus for detecting spoofing conditions |
CN109147799A (zh) * | 2018-10-18 | 2019-01-04 | 广州势必可赢网络科技有限公司 | 一种语音识别的方法、装置、设备及计算机存储介质 |
CN110136687A (zh) * | 2019-05-20 | 2019-08-16 | 深圳市数字星河科技有限公司 | 一种基于语音训练克隆口音及声韵方法 |
CN111048064A (zh) * | 2020-03-13 | 2020-04-21 | 同盾控股有限公司 | 基于单说话人语音合成数据集的声音克隆方法及装置 |
CN111210803A (zh) * | 2020-04-21 | 2020-05-29 | 南京硅基智能科技有限公司 | 一种基于Bottleneck特征训练克隆音色及韵律的系统及方法 |
CN111223474A (zh) * | 2020-01-15 | 2020-06-02 | 武汉水象电子科技有限公司 | 一种基于多神经网络的语音克隆方法和系统 |
-
2020
- 2020-08-25 CN CN202010863825.1A patent/CN112102808A/zh active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102708867A (zh) * | 2012-05-30 | 2012-10-03 | 北京正鹰科技有限责任公司 | 一种基于声纹和语音的防录音假冒身份识别方法及系统 |
CN104123932A (zh) * | 2014-07-29 | 2014-10-29 | 科大讯飞股份有限公司 | 一种语音转换系统及方法 |
US20180254046A1 (en) * | 2017-03-03 | 2018-09-06 | Pindrop Security, Inc. | Method and apparatus for detecting spoofing conditions |
CN109147799A (zh) * | 2018-10-18 | 2019-01-04 | 广州势必可赢网络科技有限公司 | 一种语音识别的方法、装置、设备及计算机存储介质 |
CN110136687A (zh) * | 2019-05-20 | 2019-08-16 | 深圳市数字星河科技有限公司 | 一种基于语音训练克隆口音及声韵方法 |
CN111223474A (zh) * | 2020-01-15 | 2020-06-02 | 武汉水象电子科技有限公司 | 一种基于多神经网络的语音克隆方法和系统 |
CN111048064A (zh) * | 2020-03-13 | 2020-04-21 | 同盾控股有限公司 | 基于单说话人语音合成数据集的声音克隆方法及装置 |
CN111210803A (zh) * | 2020-04-21 | 2020-05-29 | 南京硅基智能科技有限公司 | 一种基于Bottleneck特征训练克隆音色及韵律的系统及方法 |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114023333A (zh) * | 2021-11-02 | 2022-02-08 | 中国工商银行股份有限公司 | 声纹识别的测试方法、装置、存储介质及电子设备 |
CN115497481A (zh) * | 2022-11-17 | 2022-12-20 | 北京远鉴信息技术有限公司 | 一种虚假语音的识别方法、装置、电子设备及存储介质 |
CN115497481B (zh) * | 2022-11-17 | 2023-03-03 | 北京远鉴信息技术有限公司 | 一种虚假语音的识别方法、装置、电子设备及存储介质 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3327720B1 (en) | User voiceprint model construction method and apparatus | |
US9813551B2 (en) | Multi-party conversation analyzer and logger | |
US8842886B2 (en) | Adaptive tuning of biometric engines | |
CA2549092C (en) | System and method for providing improved claimant authentication | |
US8571867B2 (en) | Method and system for bio-metric voice print authentication | |
CN109346086A (zh) | 声纹识别方法、装置、计算机设备和计算机可读存储介质 | |
EP0779602A2 (en) | Method and apparatus employing audio and video data from an individual for authentication purposes | |
CN109920435B (zh) | 一种声纹识别方法及声纹识别装置 | |
CN108985776A (zh) | 基于多重信息验证的信用卡安全监测方法 | |
CN112102808A (zh) | 用于伪造语音的深度神经网络的构建方法及系统 | |
CN109560941A (zh) | 会议记录方法、装置、智能终端及存储介质 | |
CN103078828A (zh) | 一种云模式的语音鉴权系统 | |
Dimaunahan et al. | MFCC and VQ voice recognition based ATM security for the visually disabled | |
CN112417412A (zh) | 一种银行账户余额查询方法、装置及系统 | |
Zewoudie et al. | The use of audio fingerprints for authentication of speakers on speech operated interfaces | |
Shirvanian et al. | Quantifying the breakability of voice assistants | |
CN110556114B (zh) | 基于注意力机制的通话人识别方法及装置 | |
Goyal et al. | MFRASTA: Voice biometric feature using integration of MFCC and RASTA-PLP | |
CN118487767A (zh) | 一种多方式联合的客户身份认证方法及装置 | |
CN114023334A (zh) | 说话人识别方法、装置、计算机设备和存储介质 | |
JPH09218697A (ja) | 話者検証システム | |
JP2011008544A (ja) | 本人認証装置および本人認証方法 | |
Feustel et al. | Voice-based security: identity verification over telephone lines | |
Kounoudes et al. | Intelligent Speaker Verification based Biometric System for Electronic Commerce Applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210225 Address after: Room A501, Building No. 1588, Lianhai Road, Minhang District, Shanghai 201100 Applicant after: Shanghai Hongzhen Information Science & Technology Co.,Ltd. Applicant after: Nanjing Red array Network Security Technology Research Institute Co.,Ltd. Address before: Room A501, Building No. 1588, Lianhai Road, Minhang District, Shanghai 201100 Applicant before: Shanghai Hongzhen Information Science & Technology Co.,Ltd. Applicant before: Jiangsu pseudo extreme Computing Information Technology Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20201218 |
|
WD01 | Invention patent application deemed withdrawn after publication |