SG11202013135XA - System and method for personalized speaker verification - Google Patents
System and method for personalized speaker verificationInfo
- Publication number
- SG11202013135XA SG11202013135XA SG11202013135XA SG11202013135XA SG11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA SG 11202013135X A SG11202013135X A SG 11202013135XA
- Authority
- SG
- Singapore
- Prior art keywords
- speaker verification
- personalized speaker
- personalized
- verification
- speaker
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/20—Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/114812 WO2020035085A2 (en) | 2019-10-31 | 2019-10-31 | System and method for determining voice characteristics |
PCT/CN2020/071194 WO2020098828A2 (en) | 2019-10-31 | 2020-01-09 | System and method for personalized speaker verification |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202013135XA true SG11202013135XA (en) | 2021-01-28 |
Family
ID=69525955
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202010803VA SG11202010803VA (en) | 2019-10-31 | 2019-10-31 | System and method for determining voice characteristics |
SG11202013135XA SG11202013135XA (en) | 2019-10-31 | 2020-01-09 | System and method for personalized speaker verification |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202010803VA SG11202010803VA (en) | 2019-10-31 | 2019-10-31 | System and method for determining voice characteristics |
Country Status (5)
Country | Link |
---|---|
US (3) | US10997980B2 (en) |
CN (2) | CN111712874B (en) |
SG (2) | SG11202010803VA (en) |
TW (1) | TWI737462B (en) |
WO (2) | WO2020035085A2 (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806696B (en) * | 2018-05-08 | 2020-06-05 | 平安科技(深圳)有限公司 | Method and device for establishing voiceprint model, computer equipment and storage medium |
US11556848B2 (en) * | 2019-10-21 | 2023-01-17 | International Business Machines Corporation | Resolving conflicts between experts' intuition and data-driven artificial intelligence models |
WO2020035085A2 (en) * | 2019-10-31 | 2020-02-20 | Alipay (Hangzhou) Information Technology Co., Ltd. | System and method for determining voice characteristics |
US11651767B2 (en) | 2020-03-03 | 2023-05-16 | International Business Machines Corporation | Metric learning of speaker diarization |
US11443748B2 (en) * | 2020-03-03 | 2022-09-13 | International Business Machines Corporation | Metric learning of speaker diarization |
CN111833855B (en) * | 2020-03-16 | 2024-02-23 | 南京邮电大学 | Multi-to-multi speaker conversion method based on DenseNet STARGAN |
CN111540367B (en) * | 2020-04-17 | 2023-03-31 | 合肥讯飞数码科技有限公司 | Voice feature extraction method and device, electronic equipment and storage medium |
CN111524525B (en) * | 2020-04-28 | 2023-06-16 | 平安科技(深圳)有限公司 | Voiceprint recognition method, device, equipment and storage medium of original voice |
US20220067279A1 (en) * | 2020-08-31 | 2022-03-03 | Recruit Co., Ltd., | Systems and methods for multilingual sentence embeddings |
CN113555032B (en) * | 2020-12-22 | 2024-03-12 | 腾讯科技(深圳)有限公司 | Multi-speaker scene recognition and network training method and device |
US11689868B2 (en) * | 2021-04-26 | 2023-06-27 | Mun Hoong Leong | Machine learning based hearing assistance system |
CN113345454B (en) * | 2021-06-01 | 2024-02-09 | 平安科技(深圳)有限公司 | Training and application methods, devices, equipment and storage medium of voice conversion model |
TWI795173B (en) * | 2022-01-17 | 2023-03-01 | 中華電信股份有限公司 | Multilingual speech recognition system, method and computer readable medium |
CN114529191A (en) * | 2022-02-16 | 2022-05-24 | 支付宝(杭州)信息技术有限公司 | Method and apparatus for risk identification |
US20230352029A1 (en) * | 2022-05-02 | 2023-11-02 | Tencent America LLC | Progressive contrastive learning framework for self-supervised speaker verification |
CN115035890B (en) * | 2022-06-23 | 2023-12-05 | 北京百度网讯科技有限公司 | Training method and device of voice recognition model, electronic equipment and storage medium |
CN117495571B (en) * | 2023-12-28 | 2024-04-05 | 北京芯盾时代科技有限公司 | Data processing method and device, electronic equipment and storage medium |
Family Cites Families (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69322894T2 (en) * | 1992-03-02 | 1999-07-29 | At & T Corp | Learning method and device for speech recognition |
US5640429A (en) * | 1995-01-20 | 1997-06-17 | The United States Of America As Represented By The Secretary Of The Air Force | Multichannel non-gaussian receiver and method |
US6519561B1 (en) | 1997-11-03 | 2003-02-11 | T-Netix, Inc. | Model adaptation of neural tree networks and other fused models for speaker verification |
US6609093B1 (en) * | 2000-06-01 | 2003-08-19 | International Business Machines Corporation | Methods and apparatus for performing heteroscedastic discriminant analysis in pattern recognition systems |
US20030225719A1 (en) * | 2002-05-31 | 2003-12-04 | Lucent Technologies, Inc. | Methods and apparatus for fast and robust model training for object classification |
US9113001B2 (en) * | 2005-04-21 | 2015-08-18 | Verint Americas Inc. | Systems, methods, and media for disambiguating call data to determine fraud |
TWI297487B (en) * | 2005-11-18 | 2008-06-01 | Tze Fen Li | A method for speech recognition |
US9247056B2 (en) * | 2007-02-28 | 2016-01-26 | International Business Machines Corporation | Identifying contact center agents based upon biometric characteristics of an agent's speech |
US7958068B2 (en) * | 2007-12-12 | 2011-06-07 | International Business Machines Corporation | Method and apparatus for model-shared subspace boosting for multi-label classification |
EP2189976B1 (en) * | 2008-11-21 | 2012-10-24 | Nuance Communications, Inc. | Method for adapting a codebook for speech recognition |
FR2940498B1 (en) * | 2008-12-23 | 2011-04-15 | Thales Sa | METHOD AND SYSTEM FOR AUTHENTICATING A USER AND / OR CRYPTOGRAPHIC DATA |
US10032254B2 (en) * | 2010-09-28 | 2018-07-24 | MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. | Method and device for recovering a digital image from a sequence of observed digital images |
US8442823B2 (en) * | 2010-10-19 | 2013-05-14 | Motorola Solutions, Inc. | Methods for creating and searching a database of speakers |
US9679561B2 (en) * | 2011-03-28 | 2017-06-13 | Nuance Communications, Inc. | System and method for rapid customization of speech recognition models |
US9967218B2 (en) * | 2011-10-26 | 2018-05-08 | Oath Inc. | Online active learning in user-generated content streams |
US9042867B2 (en) * | 2012-02-24 | 2015-05-26 | Agnitio S.L. | System and method for speaker recognition on mobile devices |
US8527276B1 (en) * | 2012-10-25 | 2013-09-03 | Google Inc. | Speech synthesis using deep neural networks |
US20140222423A1 (en) * | 2013-02-07 | 2014-08-07 | Nuance Communications, Inc. | Method and Apparatus for Efficient I-Vector Extraction |
US9406298B2 (en) * | 2013-02-07 | 2016-08-02 | Nuance Communications, Inc. | Method and apparatus for efficient i-vector extraction |
CN103310788B (en) * | 2013-05-23 | 2016-03-16 | 北京云知声信息技术有限公司 | A kind of voice information identification method and system |
US9514753B2 (en) * | 2013-11-04 | 2016-12-06 | Google Inc. | Speaker identification using hash-based indexing |
US9311932B2 (en) * | 2014-01-23 | 2016-04-12 | International Business Machines Corporation | Adaptive pause detection in speech recognition |
US9542948B2 (en) * | 2014-04-09 | 2017-01-10 | Google Inc. | Text-dependent speaker identification |
US10073985B2 (en) * | 2015-02-27 | 2018-09-11 | Samsung Electronics Co., Ltd. | Apparatus and method for trusted execution environment file protection |
US9687208B2 (en) * | 2015-06-03 | 2017-06-27 | iMEDI PLUS Inc. | Method and system for recognizing physiological sound |
US9978374B2 (en) * | 2015-09-04 | 2018-05-22 | Google Llc | Neural networks for speaker verification |
US10262654B2 (en) * | 2015-09-24 | 2019-04-16 | Microsoft Technology Licensing, Llc | Detecting actionable items in a conversation among participants |
CN107274904A (en) * | 2016-04-07 | 2017-10-20 | 富士通株式会社 | Method for distinguishing speek person and Speaker Identification equipment |
CN105869630B (en) * | 2016-06-27 | 2019-08-02 | 上海交通大学 | Speaker's voice spoofing attack detection method and system based on deep learning |
US10535000B2 (en) | 2016-08-08 | 2020-01-14 | Interactive Intelligence Group, Inc. | System and method for speaker change detection |
US9824692B1 (en) * | 2016-09-12 | 2017-11-21 | Pindrop Security, Inc. | End-to-end speaker recognition using deep neural network |
CA3036561C (en) * | 2016-09-19 | 2021-06-29 | Pindrop Security, Inc. | Channel-compensated low-level features for speaker recognition |
US10553218B2 (en) | 2016-09-19 | 2020-02-04 | Pindrop Security, Inc. | Dimensionality reduction of baum-welch statistics for speaker recognition |
WO2018106971A1 (en) * | 2016-12-07 | 2018-06-14 | Interactive Intelligence Group, Inc. | System and method for neural network based speaker classification |
US10140980B2 (en) * | 2016-12-21 | 2018-11-27 | Google LCC | Complex linear projection for acoustic modeling |
CN108288470B (en) * | 2017-01-10 | 2021-12-21 | 富士通株式会社 | Voiceprint-based identity verification method and device |
CN106991312B (en) * | 2017-04-05 | 2020-01-10 | 百融云创科技股份有限公司 | Internet anti-fraud authentication method based on voiceprint recognition |
US11556794B2 (en) * | 2017-08-31 | 2023-01-17 | International Business Machines Corporation | Facilitating neural networks |
US10679129B2 (en) * | 2017-09-28 | 2020-06-09 | D5Ai Llc | Stochastic categorical autoencoder network |
JP6879433B2 (en) * | 2017-09-29 | 2021-06-02 | 日本電気株式会社 | Regression device, regression method, and program |
US20190213705A1 (en) * | 2017-12-08 | 2019-07-11 | Digimarc Corporation | Artwork generated to convey digital messages, and methods/apparatuses for generating such artwork |
CN108417217B (en) * | 2018-01-11 | 2021-07-13 | 思必驰科技股份有限公司 | Speaker recognition network model training method, speaker recognition method and system |
CN111771213B (en) * | 2018-02-16 | 2021-10-08 | 杜比实验室特许公司 | Speech style migration |
US11468316B2 (en) * | 2018-03-13 | 2022-10-11 | Recogni Inc. | Cluster compression for compressing weights in neural networks |
US10347241B1 (en) * | 2018-03-23 | 2019-07-09 | Microsoft Technology Licensing, Llc | Speaker-invariant training via adversarial learning |
CN109065022B (en) * | 2018-06-06 | 2022-08-09 | 平安科技(深圳)有限公司 | Method for extracting i-vector, method, device, equipment and medium for speaker recognition |
CN109256139A (en) * | 2018-07-26 | 2019-01-22 | 广东工业大学 | A kind of method for distinguishing speek person based on Triplet-Loss |
CN110164452B (en) * | 2018-10-10 | 2023-03-10 | 腾讯科技(深圳)有限公司 | Voiceprint recognition method, model training method and server |
CN110288978B (en) * | 2018-10-25 | 2022-08-30 | 腾讯科技(深圳)有限公司 | Speech recognition model training method and device |
US10510002B1 (en) * | 2019-02-14 | 2019-12-17 | Capital One Services, Llc | Stochastic gradient boosting for deep neural networks |
CN110136729B (en) * | 2019-03-27 | 2021-08-20 | 北京奇艺世纪科技有限公司 | Model generation method, audio processing method, device and computer-readable storage medium |
CN109903774A (en) * | 2019-04-12 | 2019-06-18 | 南京大学 | A kind of method for recognizing sound-groove based on angle separation loss function |
US10878575B2 (en) * | 2019-04-15 | 2020-12-29 | Adobe Inc. | Foreground-aware image inpainting |
CN110223699B (en) * | 2019-05-15 | 2021-04-13 | 桂林电子科技大学 | Speaker identity confirmation method, device and storage medium |
WO2020035085A2 (en) * | 2019-10-31 | 2020-02-20 | Alipay (Hangzhou) Information Technology Co., Ltd. | System and method for determining voice characteristics |
-
2019
- 2019-10-31 WO PCT/CN2019/114812 patent/WO2020035085A2/en active Application Filing
- 2019-10-31 SG SG11202010803VA patent/SG11202010803VA/en unknown
- 2019-10-31 CN CN201980011206.5A patent/CN111712874B/en active Active
-
2020
- 2020-01-09 CN CN202080000759.3A patent/CN111418009B/en active Active
- 2020-01-09 WO PCT/CN2020/071194 patent/WO2020098828A2/en active Application Filing
- 2020-01-09 SG SG11202013135XA patent/SG11202013135XA/en unknown
- 2020-08-25 TW TW109128922A patent/TWI737462B/en active
- 2020-10-27 US US17/081,956 patent/US10997980B2/en active Active
- 2020-12-22 US US17/131,182 patent/US11031018B2/en active Active
-
2021
- 2021-03-22 US US17/208,294 patent/US11244689B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20210043216A1 (en) | 2021-02-11 |
CN111712874A (en) | 2020-09-25 |
CN111418009B (en) | 2023-09-05 |
US10997980B2 (en) | 2021-05-04 |
TWI737462B (en) | 2021-08-21 |
WO2020035085A3 (en) | 2020-08-20 |
WO2020098828A2 (en) | 2020-05-22 |
CN111418009A (en) | 2020-07-14 |
WO2020098828A3 (en) | 2020-09-03 |
CN111712874B (en) | 2023-07-14 |
US20210110833A1 (en) | 2021-04-15 |
US11244689B2 (en) | 2022-02-08 |
US11031018B2 (en) | 2021-06-08 |
WO2020035085A2 (en) | 2020-02-20 |
SG11202010803VA (en) | 2020-11-27 |
US20210210101A1 (en) | 2021-07-08 |
TW202119393A (en) | 2021-05-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202013135XA (en) | System and method for personalized speaker verification | |
HUE051594T2 (en) | Method and system for speaker verification | |
EP3905078A4 (en) | Identity verification method and system therefor | |
SG11202006574PA (en) | System and method for decentralized-identifier authentication | |
SG11202004850PA (en) | System and method for blockchain-based cross-entity authentication | |
EP3719678A4 (en) | Identity verification method and apparatus | |
GB201701137D0 (en) | System and method for personalized sound isolation in vehilce audio zones | |
EP3373202C0 (en) | Verification method and system | |
ZA201904570B (en) | System and method for validation of possession-based authentication response | |
EP3444998A4 (en) | Network verification method and associated apparatus and system | |
GB2567276B (en) | Audio control print system and control method | |
EP3726373C0 (en) | Creating an app method and system | |
EP3811671A4 (en) | Method and apparatus for validating stored system information | |
EP3516856A4 (en) | System and method for secure interactive voice response | |
EP3973421A4 (en) | System and method for electronic claim verification | |
EP3909220C0 (en) | System and method for secure detokenization | |
GB2582952B (en) | Audio contribution identification system and method | |
GB2584251B (en) | Method and system for customized content | |
GB202015498D0 (en) | Verification system and method | |
EP3819770C0 (en) | System and method for software verification | |
GB201916840D0 (en) | Voice authentication system and method | |
SG10201805515QA (en) | Method and system for crediting account | |
GB202020599D0 (en) | Blockchain related verification method and system | |
GB201914863D0 (en) | Movement verification system and method | |
SG10201908221SA (en) | System and Method For Providing Personalized Security Services |