CN111951809A - 多人声纹辨别方法及系统 - Google Patents
多人声纹辨别方法及系统 Download PDFInfo
- Publication number
- CN111951809A CN111951809A CN201910401565.3A CN201910401565A CN111951809A CN 111951809 A CN111951809 A CN 111951809A CN 201910401565 A CN201910401565 A CN 201910401565A CN 111951809 A CN111951809 A CN 111951809A
- Authority
- CN
- China
- Prior art keywords
- frequency domain
- voice
- voice information
- test
- person
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000012360 testing method Methods 0.000 claims abstract description 54
- 238000013145 classification model Methods 0.000 claims abstract description 43
- 238000012549 training Methods 0.000 claims abstract description 38
- 238000006243 chemical reaction Methods 0.000 claims abstract description 22
- 238000013526 transfer learning Methods 0.000 claims description 8
- 230000010365 information processing Effects 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/14—Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910401565.3A CN111951809B (zh) | 2019-05-14 | 2019-05-14 | 多人声纹辨别方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910401565.3A CN111951809B (zh) | 2019-05-14 | 2019-05-14 | 多人声纹辨别方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111951809A true CN111951809A (zh) | 2020-11-17 |
CN111951809B CN111951809B (zh) | 2024-06-21 |
Family
ID=73336305
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910401565.3A Active CN111951809B (zh) | 2019-05-14 | 2019-05-14 | 多人声纹辨别方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111951809B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113436634A (zh) * | 2021-07-30 | 2021-09-24 | 中国平安人寿保险股份有限公司 | 基于声纹识别的语音分类方法、装置及相关设备 |
CN113555032A (zh) * | 2020-12-22 | 2021-10-26 | 腾讯科技(深圳)有限公司 | 多说话人场景识别及网络训练方法、装置 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100100376A1 (en) * | 2008-10-17 | 2010-04-22 | International Business Machines Corporation | Visualization interface of continuous waveform multi-speaker identification |
US20170358306A1 (en) * | 2016-06-13 | 2017-12-14 | Alibaba Group Holding Limited | Neural network-based voiceprint information extraction method and apparatus |
CN107610709A (zh) * | 2017-08-01 | 2018-01-19 | 百度在线网络技术(北京)有限公司 | 一种训练声纹识别模型的方法及系统 |
EP3346463A1 (en) * | 2017-01-10 | 2018-07-11 | Fujitsu Limited | Identity verification method and apparatus based on voiceprint |
CN108335699A (zh) * | 2018-01-18 | 2018-07-27 | 浙江大学 | 一种基于动态时间规整和语音活动检测的声纹识别方法 |
CN108648760A (zh) * | 2018-04-17 | 2018-10-12 | 四川长虹电器股份有限公司 | 实时声纹辨识系统与方法 |
CN109524014A (zh) * | 2018-11-29 | 2019-03-26 | 辽宁工业大学 | 一种基于深度卷积神经网络的声纹识别分析方法 |
CN109582822A (zh) * | 2018-10-19 | 2019-04-05 | 百度在线网络技术(北京)有限公司 | 一种基于用户语音的音乐推荐方法及装置 |
WO2019080639A1 (zh) * | 2017-10-23 | 2019-05-02 | 腾讯科技(深圳)有限公司 | 一种对象识别方法、计算机设备及计算机可读存储介质 |
-
2019
- 2019-05-14 CN CN201910401565.3A patent/CN111951809B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100100376A1 (en) * | 2008-10-17 | 2010-04-22 | International Business Machines Corporation | Visualization interface of continuous waveform multi-speaker identification |
US20170358306A1 (en) * | 2016-06-13 | 2017-12-14 | Alibaba Group Holding Limited | Neural network-based voiceprint information extraction method and apparatus |
EP3346463A1 (en) * | 2017-01-10 | 2018-07-11 | Fujitsu Limited | Identity verification method and apparatus based on voiceprint |
CN107610709A (zh) * | 2017-08-01 | 2018-01-19 | 百度在线网络技术(北京)有限公司 | 一种训练声纹识别模型的方法及系统 |
WO2019080639A1 (zh) * | 2017-10-23 | 2019-05-02 | 腾讯科技(深圳)有限公司 | 一种对象识别方法、计算机设备及计算机可读存储介质 |
CN108335699A (zh) * | 2018-01-18 | 2018-07-27 | 浙江大学 | 一种基于动态时间规整和语音活动检测的声纹识别方法 |
CN108648760A (zh) * | 2018-04-17 | 2018-10-12 | 四川长虹电器股份有限公司 | 实时声纹辨识系统与方法 |
CN109582822A (zh) * | 2018-10-19 | 2019-04-05 | 百度在线网络技术(北京)有限公司 | 一种基于用户语音的音乐推荐方法及装置 |
CN109524014A (zh) * | 2018-11-29 | 2019-03-26 | 辽宁工业大学 | 一种基于深度卷积神经网络的声纹识别分析方法 |
Non-Patent Citations (1)
Title |
---|
丁冬兵;: "TL-CNN-GAP模型下的小样本声纹识别方法研究", 电脑知识与技术, no. 24, pages 177 - 178 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113555032A (zh) * | 2020-12-22 | 2021-10-26 | 腾讯科技(深圳)有限公司 | 多说话人场景识别及网络训练方法、装置 |
CN113555032B (zh) * | 2020-12-22 | 2024-03-12 | 腾讯科技(深圳)有限公司 | 多说话人场景识别及网络训练方法、装置 |
CN113436634A (zh) * | 2021-07-30 | 2021-09-24 | 中国平安人寿保险股份有限公司 | 基于声纹识别的语音分类方法、装置及相关设备 |
CN113436634B (zh) * | 2021-07-30 | 2023-06-20 | 中国平安人寿保险股份有限公司 | 基于声纹识别的语音分类方法、装置及相关设备 |
Also Published As
Publication number | Publication date |
---|---|
CN111951809B (zh) | 2024-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107154257B (zh) | 基于客户语音情感的客服服务质量评价方法及系统 | |
CN109036382B (zh) | 一种基于kl散度的音频特征提取方法 | |
CN111429935B (zh) | 一种语音话者分离方法和装置 | |
CN109560941A (zh) | 会议记录方法、装置、智能终端及存储介质 | |
CN113113022A (zh) | 一种基于说话人声纹信息的自动识别身份的方法 | |
CN113823293A (zh) | 一种基于语音增强的说话人识别方法及系统 | |
CN111951809B (zh) | 多人声纹辨别方法及系统 | |
Yudin et al. | Speaker’s voice recognition methods in high-level interference conditions | |
Charisma et al. | Speaker recognition using mel-frequency cepstrum coefficients and sum square error | |
CN113516987B (zh) | 一种说话人识别方法、装置、存储介质及设备 | |
AU2018102038A4 (en) | A Speaker Identification Method Based on DTW Algorithm | |
CN110556114B (zh) | 基于注意力机制的通话人识别方法及装置 | |
CN117612567A (zh) | 基于语音情感识别的家宽装维满意度推理方法及系统 | |
Abushariah et al. | Voice based automatic person identification system using vector quantization | |
Ahmad et al. | The impact of low-pass filter in speaker identification | |
CN114822557A (zh) | 课堂中不同声音的区分方法、装置、设备以及存储介质 | |
CN113838469A (zh) | 一种身份识别方法、系统及存储介质 | |
CN106887229A (zh) | 一种提升声纹识别准确度的方法和系统 | |
CN112634942B (zh) | 一种手机录音原始性的鉴定方法、存储介质及设备 | |
Lee et al. | Robust feature extraction for mobile-based speech emotion recognition system | |
US20230005479A1 (en) | Method for processing an audio stream and corresponding system | |
CN112151070B (zh) | 一种语音检测的方法、装置及电子设备 | |
NISSY et al. | Telephone Voice Speaker Recognition Using Mel Frequency Cepstral Coefficients with Cascaded Feed Forward Neural Network | |
Alamri | Text-independent, automatic speaker recognition system evaluation with males speaking both Arabic and English | |
Ayoub et al. | Investigation of the relation between amount of VoIP speech data and performance in speaker identification task over VoIP networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Country or region after: China Address after: 518000 East Block 1401A12, Tian'an Innovation Technology Plaza (Phase II), No. 2 Tairan 10th Road, Tian'an Community, Shatou Street, Futian District, Shenzhen, Guangdong Province Applicant after: Shenzhen Dongchen Digital Intelligence Technology Co.,Ltd. Address before: 518000, Building 301, C57, Longxiang Mountain Villa, Longxiang North Road, Baishisha Community, Fuyong Street, Bao'an District, Shenzhen City, Guangdong Province Applicant before: Shenzhen Ziwan Technology Co.,Ltd. Country or region before: China |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20240522 Address after: 518000, B09, 2nd Floor, Dongfang Yayuan, Baomin Second Road, Chentian Community, Xixiang Street, Bao'an District, Shenzhen City, Guangdong Province Applicant after: Shenzhen Jintong Technology Co.,Ltd. Country or region after: China Address before: 518000 East Block 1401A12, Tian'an Innovation Technology Plaza (Phase II), No. 2 Tairan 10th Road, Tian'an Community, Shatou Street, Futian District, Shenzhen, Guangdong Province Applicant before: Shenzhen Dongchen Digital Intelligence Technology Co.,Ltd. Country or region before: China |
|
GR01 | Patent grant | ||
GR01 | Patent grant |