CN111653289B - Playback voice detection method - Google Patents
Playback voice detection method Download PDFInfo
- Publication number
- CN111653289B CN111653289B CN202010479392.XA CN202010479392A CN111653289B CN 111653289 B CN111653289 B CN 111653289B CN 202010479392 A CN202010479392 A CN 202010479392A CN 111653289 B CN111653289 B CN 111653289B
- Authority
- CN
- China
- Prior art keywords
- speech
- training
- cepstrum coefficient
- features
- layer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 33
- 238000012549 training Methods 0.000 claims abstract description 43
- 238000012360 testing method Methods 0.000 claims abstract description 27
- 238000000034 method Methods 0.000 claims abstract description 9
- 230000004913 activation Effects 0.000 claims description 14
- 238000001228 spectrum Methods 0.000 claims description 14
- 238000009432 framing Methods 0.000 claims description 11
- 230000003595 spectral effect Effects 0.000 claims description 10
- 230000009466 transformation Effects 0.000 claims description 4
- 238000012937 correction Methods 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000013135 deep learning Methods 0.000 abstract description 4
- 230000008901 benefit Effects 0.000 abstract description 3
- 230000006870 function Effects 0.000 description 19
- 238000000605 extraction Methods 0.000 description 11
- 238000009826 distribution Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000013528 artificial neural network Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 210000002569 neuron Anatomy 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 3
- 230000003068 static effect Effects 0.000 description 3
- 238000012952 Resampling Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 1
- 238000010923 batch production Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Complex Calculations (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010479392.XA CN111653289B (en) | 2020-05-29 | 2020-05-29 | Playback voice detection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010479392.XA CN111653289B (en) | 2020-05-29 | 2020-05-29 | Playback voice detection method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111653289A CN111653289A (en) | 2020-09-11 |
CN111653289B true CN111653289B (en) | 2022-12-27 |
Family
ID=72344774
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010479392.XA Active CN111653289B (en) | 2020-05-29 | 2020-05-29 | Playback voice detection method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111653289B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114822587B (en) * | 2021-01-19 | 2023-07-14 | 四川大学 | Audio characteristic compression method based on constant Q transformation |
CN113012684B (en) * | 2021-03-04 | 2022-05-31 | 电子科技大学 | Synthesized voice detection method based on voice segmentation |
CN113096692B (en) * | 2021-03-19 | 2024-05-28 | 招商银行股份有限公司 | Voice detection method and device, equipment and storage medium |
CN113506583B (en) * | 2021-06-28 | 2024-01-05 | 杭州电子科技大学 | Camouflage voice detection method using residual error network |
CN113284486B (en) * | 2021-07-26 | 2021-11-16 | 中国科学院自动化研究所 | Robust voice identification method for environmental countermeasure |
CN113488074B (en) * | 2021-08-20 | 2023-06-23 | 四川大学 | Two-dimensional time-frequency characteristic generation method for detecting synthesized voice |
CN115022087B (en) * | 2022-07-20 | 2024-02-27 | 中国工商银行股份有限公司 | Voice recognition verification processing method and device |
CN117153190B (en) * | 2023-10-27 | 2024-01-19 | 广东技术师范大学 | Playback voice detection method based on attention mechanism combination characteristics |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107464568B (en) * | 2017-09-25 | 2020-06-30 | 四川长虹电器股份有限公司 | Speaker identification method and system based on three-dimensional convolution neural network text independence |
CN109920447B (en) * | 2019-01-29 | 2021-07-13 | 天津大学 | Recording fraud detection method based on adaptive filter amplitude phase characteristic extraction |
CN109935233A (en) * | 2019-01-29 | 2019-06-25 | 天津大学 | A kind of recording attack detection method based on amplitude and phase information |
-
2020
- 2020-05-29 CN CN202010479392.XA patent/CN111653289B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN111653289A (en) | 2020-09-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111653289B (en) | Playback voice detection method | |
CN108369813B (en) | Specific voice recognition method, apparatus and storage medium | |
CN108447495B (en) | Deep learning voice enhancement method based on comprehensive feature set | |
WO2019232829A1 (en) | Voiceprint recognition method and apparatus, computer device and storage medium | |
Kumar et al. | Design of an automatic speaker recognition system using MFCC, vector quantization and LBG algorithm | |
CN108831443B (en) | Mobile recording equipment source identification method based on stacked self-coding network | |
CN110459241B (en) | Method and system for extracting voice features | |
CN106782511A (en) | Amendment linear depth autoencoder network audio recognition method | |
CN111785285A (en) | Voiceprint recognition method for home multi-feature parameter fusion | |
CN111899757B (en) | Single-channel voice separation method and system for target speaker extraction | |
CN109378014A (en) | A kind of mobile device source discrimination and system based on convolutional neural networks | |
WO2019232833A1 (en) | Speech differentiating method and device, computer device and storage medium | |
CN112541533A (en) | Modified vehicle identification method based on neural network and feature fusion | |
CN111048097A (en) | Twin network voiceprint recognition method based on 3D convolution | |
WO2019232867A1 (en) | Voice discrimination method and apparatus, and computer device, and storage medium | |
CN111489763A (en) | Adaptive method for speaker recognition in complex environment based on GMM model | |
CN118486297B (en) | Response method based on voice emotion recognition and intelligent voice assistant system | |
CN112466276A (en) | Speech synthesis system training method and device and readable storage medium | |
CN109300470A (en) | Audio mixing separation method and audio mixing separator | |
CN116778956A (en) | Transformer acoustic feature extraction and fault identification method | |
CN113571095B (en) | Speech emotion recognition method and system based on nested deep neural network | |
CN113516987B (en) | Speaker recognition method, speaker recognition device, storage medium and equipment | |
CN114283835A (en) | Voice enhancement and detection method suitable for actual communication condition | |
CN118173092A (en) | Online customer service platform based on AI voice interaction | |
CN118098247A (en) | Voiceprint recognition method and system based on parallel feature extraction model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230821 Address after: Room 502, No. 3 Pulan 1st Street, Chancheng District, Foshan City, Guangdong Province, 528000 Patentee after: Kong Fanbin Address before: Room 337, Building 3, No. 266, Zhenxing Road, Yuyue Town, Deqing County, Huzhou City, Zhejiang Province, 313000 Patentee before: Huzhou Chuangguan Technology Co.,Ltd. Effective date of registration: 20230821 Address after: Room 337, Building 3, No. 266, Zhenxing Road, Yuyue Town, Deqing County, Huzhou City, Zhejiang Province, 313000 Patentee after: Huzhou Chuangguan Technology Co.,Ltd. Address before: 315211, Fenghua Road, Jiangbei District, Zhejiang, Ningbo 818 Patentee before: Ningbo University |