SG11202013135XA - System and method for personalized speaker verification - Google Patents

System and method for personalized speaker verification

Info

Publication number
SG11202013135XA
SG11202013135XA SG11202013135XA SG11202013135XA SG11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA
Authority
SG
Singapore
Prior art keywords
speaker verification
personalized speaker
personalized
verification
speaker
Prior art date
Application number
SG11202013135XA
Inventor
Zhiming Wang
Kaisheng Yao
Xiaolong Li
Original Assignee
Alipay Hangzhou Inf Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alipay Hangzhou Inf Tech Co Ltd filed Critical Alipay Hangzhou Inf Tech Co Ltd
Publication of SG11202013135XA publication Critical patent/SG11202013135XA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/06Decision making techniques; Pattern matching strategies
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/18Artificial neural networks; Connectionist approaches
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/20Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
SG11202013135XA 2019-10-31 2020-01-09 System and method for personalized speaker verification SG11202013135XA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
PCT/CN2019/114812 WO2020035085A2 (en) 2019-10-31 2019-10-31 System and method for determining voice characteristics
PCT/CN2020/071194 WO2020098828A2 (en) 2019-10-31 2020-01-09 System and method for personalized speaker verification

Publications (1)

Publication Number Publication Date
SG11202013135XA true SG11202013135XA (en) 2021-01-28

Family

ID=69525955

Family Applications (2)

Application Number Title Priority Date Filing Date
SG11202010803VA SG11202010803VA (en) 2019-10-31 2019-10-31 System and method for determining voice characteristics
SG11202013135XA SG11202013135XA (en) 2019-10-31 2020-01-09 System and method for personalized speaker verification

Family Applications Before (1)

Application Number Title Priority Date Filing Date
SG11202010803VA SG11202010803VA (en) 2019-10-31 2019-10-31 System and method for determining voice characteristics

Country Status (5)

Country Link
US (3) US10997980B2 (en)
CN (2) CN111712874B (en)
SG (2) SG11202010803VA (en)
TW (1) TWI737462B (en)
WO (2) WO2020035085A2 (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108806696B (en) * 2018-05-08 2020-06-05 平安科技(深圳)有限公司 Method and device for establishing voiceprint model, computer equipment and storage medium
US11556848B2 (en) * 2019-10-21 2023-01-17 International Business Machines Corporation Resolving conflicts between experts' intuition and data-driven artificial intelligence models
WO2020035085A2 (en) * 2019-10-31 2020-02-20 Alipay (Hangzhou) Information Technology Co., Ltd. System and method for determining voice characteristics
US11651767B2 (en) 2020-03-03 2023-05-16 International Business Machines Corporation Metric learning of speaker diarization
US11443748B2 (en) * 2020-03-03 2022-09-13 International Business Machines Corporation Metric learning of speaker diarization
CN111833855B (en) * 2020-03-16 2024-02-23 南京邮电大学 Multi-to-multi speaker conversion method based on DenseNet STARGAN
CN111540367B (en) * 2020-04-17 2023-03-31 合肥讯飞数码科技有限公司 Voice feature extraction method and device, electronic equipment and storage medium
CN111524525B (en) * 2020-04-28 2023-06-16 平安科技(深圳)有限公司 Voiceprint recognition method, device, equipment and storage medium of original voice
US20220067279A1 (en) * 2020-08-31 2022-03-03 Recruit Co., Ltd., Systems and methods for multilingual sentence embeddings
CN113555032B (en) * 2020-12-22 2024-03-12 腾讯科技(深圳)有限公司 Multi-speaker scene recognition and network training method and device
US11689868B2 (en) * 2021-04-26 2023-06-27 Mun Hoong Leong Machine learning based hearing assistance system
CN113345454B (en) * 2021-06-01 2024-02-09 平安科技(深圳)有限公司 Training and application methods, devices, equipment and storage medium of voice conversion model
TWI795173B (en) * 2022-01-17 2023-03-01 中華電信股份有限公司 Multilingual speech recognition system, method and computer readable medium
CN114529191A (en) * 2022-02-16 2022-05-24 支付宝(杭州)信息技术有限公司 Method and apparatus for risk identification
US20230352029A1 (en) * 2022-05-02 2023-11-02 Tencent America LLC Progressive contrastive learning framework for self-supervised speaker verification
CN115035890B (en) * 2022-06-23 2023-12-05 北京百度网讯科技有限公司 Training method and device of voice recognition model, electronic equipment and storage medium
CN117495571B (en) * 2023-12-28 2024-04-05 北京芯盾时代科技有限公司 Data processing method and device, electronic equipment and storage medium

Family Cites Families (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69322894T2 (en) * 1992-03-02 1999-07-29 At & T Corp Learning method and device for speech recognition
US5640429A (en) * 1995-01-20 1997-06-17 The United States Of America As Represented By The Secretary Of The Air Force Multichannel non-gaussian receiver and method
US6519561B1 (en) 1997-11-03 2003-02-11 T-Netix, Inc. Model adaptation of neural tree networks and other fused models for speaker verification
US6609093B1 (en) * 2000-06-01 2003-08-19 International Business Machines Corporation Methods and apparatus for performing heteroscedastic discriminant analysis in pattern recognition systems
US20030225719A1 (en) * 2002-05-31 2003-12-04 Lucent Technologies, Inc. Methods and apparatus for fast and robust model training for object classification
US9113001B2 (en) * 2005-04-21 2015-08-18 Verint Americas Inc. Systems, methods, and media for disambiguating call data to determine fraud
TWI297487B (en) * 2005-11-18 2008-06-01 Tze Fen Li A method for speech recognition
US9247056B2 (en) * 2007-02-28 2016-01-26 International Business Machines Corporation Identifying contact center agents based upon biometric characteristics of an agent's speech
US7958068B2 (en) * 2007-12-12 2011-06-07 International Business Machines Corporation Method and apparatus for model-shared subspace boosting for multi-label classification
EP2189976B1 (en) * 2008-11-21 2012-10-24 Nuance Communications, Inc. Method for adapting a codebook for speech recognition
FR2940498B1 (en) * 2008-12-23 2011-04-15 Thales Sa METHOD AND SYSTEM FOR AUTHENTICATING A USER AND / OR CRYPTOGRAPHIC DATA
US10032254B2 (en) * 2010-09-28 2018-07-24 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Method and device for recovering a digital image from a sequence of observed digital images
US8442823B2 (en) * 2010-10-19 2013-05-14 Motorola Solutions, Inc. Methods for creating and searching a database of speakers
US9679561B2 (en) * 2011-03-28 2017-06-13 Nuance Communications, Inc. System and method for rapid customization of speech recognition models
US9967218B2 (en) * 2011-10-26 2018-05-08 Oath Inc. Online active learning in user-generated content streams
US9042867B2 (en) * 2012-02-24 2015-05-26 Agnitio S.L. System and method for speaker recognition on mobile devices
US8527276B1 (en) * 2012-10-25 2013-09-03 Google Inc. Speech synthesis using deep neural networks
US20140222423A1 (en) * 2013-02-07 2014-08-07 Nuance Communications, Inc. Method and Apparatus for Efficient I-Vector Extraction
US9406298B2 (en) * 2013-02-07 2016-08-02 Nuance Communications, Inc. Method and apparatus for efficient i-vector extraction
CN103310788B (en) * 2013-05-23 2016-03-16 北京云知声信息技术有限公司 A kind of voice information identification method and system
US9514753B2 (en) * 2013-11-04 2016-12-06 Google Inc. Speaker identification using hash-based indexing
US9311932B2 (en) * 2014-01-23 2016-04-12 International Business Machines Corporation Adaptive pause detection in speech recognition
US9542948B2 (en) * 2014-04-09 2017-01-10 Google Inc. Text-dependent speaker identification
US10073985B2 (en) * 2015-02-27 2018-09-11 Samsung Electronics Co., Ltd. Apparatus and method for trusted execution environment file protection
US9687208B2 (en) * 2015-06-03 2017-06-27 iMEDI PLUS Inc. Method and system for recognizing physiological sound
US9978374B2 (en) * 2015-09-04 2018-05-22 Google Llc Neural networks for speaker verification
US10262654B2 (en) * 2015-09-24 2019-04-16 Microsoft Technology Licensing, Llc Detecting actionable items in a conversation among participants
CN107274904A (en) * 2016-04-07 2017-10-20 富士通株式会社 Method for distinguishing speek person and Speaker Identification equipment
CN105869630B (en) * 2016-06-27 2019-08-02 上海交通大学 Speaker's voice spoofing attack detection method and system based on deep learning
US10535000B2 (en) 2016-08-08 2020-01-14 Interactive Intelligence Group, Inc. System and method for speaker change detection
US9824692B1 (en) * 2016-09-12 2017-11-21 Pindrop Security, Inc. End-to-end speaker recognition using deep neural network
CA3036561C (en) * 2016-09-19 2021-06-29 Pindrop Security, Inc. Channel-compensated low-level features for speaker recognition
US10553218B2 (en) 2016-09-19 2020-02-04 Pindrop Security, Inc. Dimensionality reduction of baum-welch statistics for speaker recognition
WO2018106971A1 (en) * 2016-12-07 2018-06-14 Interactive Intelligence Group, Inc. System and method for neural network based speaker classification
US10140980B2 (en) * 2016-12-21 2018-11-27 Google LCC Complex linear projection for acoustic modeling
CN108288470B (en) * 2017-01-10 2021-12-21 富士通株式会社 Voiceprint-based identity verification method and device
CN106991312B (en) * 2017-04-05 2020-01-10 百融云创科技股份有限公司 Internet anti-fraud authentication method based on voiceprint recognition
US11556794B2 (en) * 2017-08-31 2023-01-17 International Business Machines Corporation Facilitating neural networks
US10679129B2 (en) * 2017-09-28 2020-06-09 D5Ai Llc Stochastic categorical autoencoder network
JP6879433B2 (en) * 2017-09-29 2021-06-02 日本電気株式会社 Regression device, regression method, and program
US20190213705A1 (en) * 2017-12-08 2019-07-11 Digimarc Corporation Artwork generated to convey digital messages, and methods/apparatuses for generating such artwork
CN108417217B (en) * 2018-01-11 2021-07-13 思必驰科技股份有限公司 Speaker recognition network model training method, speaker recognition method and system
CN111771213B (en) * 2018-02-16 2021-10-08 杜比实验室特许公司 Speech style migration
US11468316B2 (en) * 2018-03-13 2022-10-11 Recogni Inc. Cluster compression for compressing weights in neural networks
US10347241B1 (en) * 2018-03-23 2019-07-09 Microsoft Technology Licensing, Llc Speaker-invariant training via adversarial learning
CN109065022B (en) * 2018-06-06 2022-08-09 平安科技(深圳)有限公司 Method for extracting i-vector, method, device, equipment and medium for speaker recognition
CN109256139A (en) * 2018-07-26 2019-01-22 广东工业大学 A kind of method for distinguishing speek person based on Triplet-Loss
CN110164452B (en) * 2018-10-10 2023-03-10 腾讯科技(深圳)有限公司 Voiceprint recognition method, model training method and server
CN110288978B (en) * 2018-10-25 2022-08-30 腾讯科技(深圳)有限公司 Speech recognition model training method and device
US10510002B1 (en) * 2019-02-14 2019-12-17 Capital One Services, Llc Stochastic gradient boosting for deep neural networks
CN110136729B (en) * 2019-03-27 2021-08-20 北京奇艺世纪科技有限公司 Model generation method, audio processing method, device and computer-readable storage medium
CN109903774A (en) * 2019-04-12 2019-06-18 南京大学 A kind of method for recognizing sound-groove based on angle separation loss function
US10878575B2 (en) * 2019-04-15 2020-12-29 Adobe Inc. Foreground-aware image inpainting
CN110223699B (en) * 2019-05-15 2021-04-13 桂林电子科技大学 Speaker identity confirmation method, device and storage medium
WO2020035085A2 (en) * 2019-10-31 2020-02-20 Alipay (Hangzhou) Information Technology Co., Ltd. System and method for determining voice characteristics

Also Published As

Publication number Publication date
US20210043216A1 (en) 2021-02-11
CN111712874A (en) 2020-09-25
CN111418009B (en) 2023-09-05
US10997980B2 (en) 2021-05-04
TWI737462B (en) 2021-08-21
WO2020035085A3 (en) 2020-08-20
WO2020098828A2 (en) 2020-05-22
CN111418009A (en) 2020-07-14
WO2020098828A3 (en) 2020-09-03
CN111712874B (en) 2023-07-14
US20210110833A1 (en) 2021-04-15
US11244689B2 (en) 2022-02-08
US11031018B2 (en) 2021-06-08
WO2020035085A2 (en) 2020-02-20
SG11202010803VA (en) 2020-11-27
US20210210101A1 (en) 2021-07-08
TW202119393A (en) 2021-05-16

Similar Documents

Publication Publication Date Title
SG11202013135XA (en) System and method for personalized speaker verification
HUE051594T2 (en) Method and system for speaker verification
EP3905078A4 (en) Identity verification method and system therefor
SG11202006574PA (en) System and method for decentralized-identifier authentication
SG11202004850PA (en) System and method for blockchain-based cross-entity authentication
EP3719678A4 (en) Identity verification method and apparatus
GB201701137D0 (en) System and method for personalized sound isolation in vehilce audio zones
EP3373202C0 (en) Verification method and system
ZA201904570B (en) System and method for validation of possession-based authentication response
EP3444998A4 (en) Network verification method and associated apparatus and system
GB2567276B (en) Audio control print system and control method
EP3726373C0 (en) Creating an app method and system
EP3811671A4 (en) Method and apparatus for validating stored system information
EP3516856A4 (en) System and method for secure interactive voice response
EP3973421A4 (en) System and method for electronic claim verification
EP3909220C0 (en) System and method for secure detokenization
GB2582952B (en) Audio contribution identification system and method
GB2584251B (en) Method and system for customized content
GB202015498D0 (en) Verification system and method
EP3819770C0 (en) System and method for software verification
GB201916840D0 (en) Voice authentication system and method
SG10201805515QA (en) Method and system for crediting account
GB202020599D0 (en) Blockchain related verification method and system
GB201914863D0 (en) Movement verification system and method
SG10201908221SA (en) System and Method For Providing Personalized Security Services