KR101323061B1 - 스피커 인증 방법 및 이 방법을 수행하기 위한 컴퓨터 실행가능 명령어를 갖는 컴퓨터 판독가능 매체 - Google Patents

스피커 인증 방법 및 이 방법을 수행하기 위한 컴퓨터 실행가능 명령어를 갖는 컴퓨터 판독가능 매체 Download PDF

Info

Publication number
KR101323061B1
KR101323061B1 KR1020087020272A KR20087020272A KR101323061B1 KR 101323061 B1 KR101323061 B1 KR 101323061B1 KR 1020087020272 A KR1020087020272 A KR 1020087020272A KR 20087020272 A KR20087020272 A KR 20087020272A KR 101323061 B1 KR101323061 B1 KR 101323061B1
Authority
KR
South Korea
Prior art keywords
speaker
pronunciation
training
mean
test
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020087020272A
Other languages
English (en)
Korean (ko)
Other versions
KR20080102373A (ko
Inventor
정유 장
밍 리우
Original Assignee
마이크로소프트 코포레이션
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 마이크로소프트 코포레이션 filed Critical 마이크로소프트 코포레이션
Publication of KR20080102373A publication Critical patent/KR20080102373A/ko
Application granted granted Critical
Publication of KR101323061B1 publication Critical patent/KR101323061B1/ko
Assigned to 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 reassignment 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 권리의 전부이전등록 Assignors: 마이크로소프트 코포레이션
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/20Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/08Use of distortion metrics or a particular distance between probe pattern and reference templates

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Collating Specific Patterns (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
KR1020087020272A 2006-02-20 2007-02-13 스피커 인증 방법 및 이 방법을 수행하기 위한 컴퓨터 실행가능 명령어를 갖는 컴퓨터 판독가능 매체 Active KR101323061B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11/358,302 US7539616B2 (en) 2006-02-20 2006-02-20 Speaker authentication using adapted background models
US11/358,302 2006-02-20
PCT/US2007/004137 WO2007098039A1 (en) 2006-02-20 2007-02-13 Speaker authentication

Publications (2)

Publication Number Publication Date
KR20080102373A KR20080102373A (ko) 2008-11-25
KR101323061B1 true KR101323061B1 (ko) 2013-10-29

Family

ID=38429414

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020087020272A Active KR101323061B1 (ko) 2006-02-20 2007-02-13 스피커 인증 방법 및 이 방법을 수행하기 위한 컴퓨터 실행가능 명령어를 갖는 컴퓨터 판독가능 매체

Country Status (11)

Country Link
US (1) US7539616B2 (https=)
EP (2) EP2410514B1 (https=)
JP (1) JP4876134B2 (https=)
KR (1) KR101323061B1 (https=)
CN (2) CN101385074B (https=)
AU (1) AU2007217884A1 (https=)
CA (2) CA2861876C (https=)
MX (1) MX2008010478A (https=)
NO (1) NO20083580L (https=)
RU (1) RU2008134112A (https=)
WO (1) WO2007098039A1 (https=)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7877255B2 (en) * 2006-03-31 2011-01-25 Voice Signal Technologies, Inc. Speech recognition using channel verification
KR20080090034A (ko) * 2007-04-03 2008-10-08 삼성전자주식회사 음성 화자 인식 방법 및 시스템
AU2012200605B2 (en) * 2008-09-05 2014-01-23 Auraya Pty Ltd Voice authentication system and methods
AU2009290150B2 (en) * 2008-09-05 2011-11-03 Auraya Pty Ltd Voice authentication system and methods
RU2422920C2 (ru) * 2009-02-24 2011-06-27 Государственное образовательное учреждение высшего профессионального образования "Казанский государственный университет им. В.И. Ульянова-Ленина" Способ аутентификации диктора по парольной фразе
RU2422921C2 (ru) * 2009-08-11 2011-06-27 Государственное образовательное учреждение высшего профессионального образования "Казанский государственный университет им. В.И. Ульянова-Ленина" Способ аутентификации диктора по парольной фразе
CN101833951B (zh) * 2010-03-04 2011-11-09 清华大学 用于说话人识别的多背景模型建立方法
US8645136B2 (en) * 2010-07-20 2014-02-04 Intellisist, Inc. System and method for efficiently reducing transcription error using hybrid voice transcription
US9224388B2 (en) * 2011-03-04 2015-12-29 Qualcomm Incorporated Sound recognition method and system
US9159324B2 (en) 2011-07-01 2015-10-13 Qualcomm Incorporated Identifying people that are proximate to a mobile device user via social graphs, speech models, and user context
US9489950B2 (en) * 2012-05-31 2016-11-08 Agency For Science, Technology And Research Method and system for dual scoring for text-dependent speaker verification
US9036890B2 (en) 2012-06-05 2015-05-19 Outerwall Inc. Optical coin discrimination systems and methods for use with consumer-operated kiosks and the like
CN102737633B (zh) * 2012-06-21 2013-12-25 北京华信恒达软件技术有限公司 一种基于张量子空间分析的说话人识别方法及其装置
ES2605779T3 (es) * 2012-09-28 2017-03-16 Agnitio S.L. Reconocimiento de orador
US20140095161A1 (en) * 2012-09-28 2014-04-03 At&T Intellectual Property I, L.P. System and method for channel equalization using characteristics of an unknown signal
US9240184B1 (en) * 2012-11-15 2016-01-19 Google Inc. Frame-level combination of deep neural network and gaussian mixture models
US8739955B1 (en) * 2013-03-11 2014-06-03 Outerwall Inc. Discriminant verification systems and methods for use in coin discrimination
US9443367B2 (en) 2014-01-17 2016-09-13 Outerwall Inc. Digital image coin discrimination for use with consumer-operated kiosks and the like
US9542948B2 (en) 2014-04-09 2017-01-10 Google Inc. Text-dependent speaker identification
US9384738B2 (en) * 2014-06-24 2016-07-05 Google Inc. Dynamic threshold for speaker verification
US9653093B1 (en) * 2014-08-19 2017-05-16 Amazon Technologies, Inc. Generative modeling of speech using neural networks
JP6239471B2 (ja) * 2014-09-19 2017-11-29 株式会社東芝 認証システム、認証装置および認証方法
CN105513588B (zh) * 2014-09-22 2019-06-25 联想(北京)有限公司 一种信息处理方法及电子设备
CN106384587B (zh) * 2015-07-24 2019-11-15 科大讯飞股份有限公司 一种语音识别方法及系统
CN105096941B (zh) * 2015-09-02 2017-10-31 百度在线网络技术(北京)有限公司 语音识别方法以及装置
US10311219B2 (en) * 2016-06-07 2019-06-04 Vocalzoom Systems Ltd. Device, system, and method of user authentication utilizing an optical microphone
US10141009B2 (en) 2016-06-28 2018-11-27 Pindrop Security, Inc. System and method for cluster-based audio event detection
US20180018973A1 (en) * 2016-07-15 2018-01-18 Google Inc. Speaker verification
US9824692B1 (en) 2016-09-12 2017-11-21 Pindrop Security, Inc. End-to-end speaker recognition using deep neural network
WO2018053537A1 (en) 2016-09-19 2018-03-22 Pindrop Security, Inc. Improvements of speaker recognition in the call center
WO2018053531A1 (en) * 2016-09-19 2018-03-22 Pindrop Security, Inc. Dimensionality reduction of baum-welch statistics for speaker recognition
CA3036561C (en) 2016-09-19 2021-06-29 Pindrop Security, Inc. Channel-compensated low-level features for speaker recognition
FR3058558B1 (fr) * 2016-11-07 2020-01-10 Pw Group Procede et systeme d'authentification par biometrie vocale d'un utilisateur
CN106782564B (zh) * 2016-11-18 2018-09-11 百度在线网络技术(北京)有限公司 用于处理语音数据的方法和装置
US10397398B2 (en) 2017-01-17 2019-08-27 Pindrop Security, Inc. Authentication using DTMF tones
US10832683B2 (en) * 2017-11-29 2020-11-10 ILLUMA Labs LLC. System and method for efficient processing of universal background models for speaker recognition
US10950243B2 (en) * 2017-11-29 2021-03-16 ILLUMA Labs Inc. Method for reduced computation of t-matrix training for speaker recognition
US10950244B2 (en) * 2017-11-29 2021-03-16 ILLUMA Labs LLC. System and method for speaker authentication and identification
WO2019129511A1 (en) * 2017-12-26 2019-07-04 Robert Bosch Gmbh Speaker identification with ultra-short speech segments for far and near field voice assistance applications
US11893999B1 (en) * 2018-05-13 2024-02-06 Amazon Technologies, Inc. Speech based user recognition
US10762905B2 (en) * 2018-07-31 2020-09-01 Cirrus Logic, Inc. Speaker verification
US11355103B2 (en) 2019-01-28 2022-06-07 Pindrop Security, Inc. Unsupervised keyword spotting and word discovery for fraud analytics
WO2020163624A1 (en) 2019-02-06 2020-08-13 Pindrop Security, Inc. Systems and methods of gateway detection in a telephone network
WO2020198354A1 (en) 2019-03-25 2020-10-01 Pindrop Security, Inc. Detection of calls from voice assistants
US12015637B2 (en) 2019-04-08 2024-06-18 Pindrop Security, Inc. Systems and methods for end-to-end architectures for voice spoofing detection
CN110379433B (zh) * 2019-08-02 2021-10-08 清华大学 身份验证的方法、装置、计算机设备及存储介质
US11158325B2 (en) * 2019-10-24 2021-10-26 Cirrus Logic, Inc. Voice biometric system
CN111564152B (zh) * 2020-07-16 2020-11-24 北京声智科技有限公司 语音转换方法、装置、电子设备及存储介质
US12482472B2 (en) * 2020-11-11 2025-11-25 Adeia Guides Inc. Systems and methods for detecting a mimicked voice input signal
CN116417001A (zh) * 2023-04-28 2023-07-11 王力安防科技股份有限公司 一种声纹识别方法、装置、终端及存储介质

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5625748A (en) * 1994-04-18 1997-04-29 Bbn Corporation Topic discriminator using posterior probability or confidence scores
US5864810A (en) * 1995-01-20 1999-01-26 Sri International Method and apparatus for speech recognition adapted to an individual speaker
US5839103A (en) * 1995-06-07 1998-11-17 Rutgers, The State University Of New Jersey Speaker verification system using decision fusion logic
US5787394A (en) * 1995-12-13 1998-07-28 International Business Machines Corporation State-dependent speaker clustering for speaker adaptation
WO1998014934A1 (en) * 1996-10-02 1998-04-09 Sri International Method and system for automatic text-independent grading of pronunciation for language instruction
US5897616A (en) * 1997-06-11 1999-04-27 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6807537B1 (en) * 1997-12-04 2004-10-19 Microsoft Corporation Mixtures of Bayesian networks
US6141644A (en) * 1998-09-04 2000-10-31 Matsushita Electric Industrial Co., Ltd. Speaker verification and speaker identification based on eigenvoices
DE60109240T2 (de) * 2000-07-05 2006-02-16 Matsushita Electric Industrial Co., Ltd., Kadoma Sprecherverifikation und -erkennung
MXPA03010751A (es) * 2001-05-25 2005-03-07 Dolby Lab Licensing Corp Segmentacion de senales de audio en eventos auditivos.
DE60225190T2 (de) * 2002-04-05 2009-09-10 International Business Machines Corp. Merkmal-basierte audio-inhaltsidentifikation
KR100611562B1 (ko) 2003-09-17 2006-08-11 (주)한국파워보이스 음성 암호를 이용한 컴퓨터 보안 방법

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Douglas A. Reynolds et al., ‘Speaker verification using adapted gaussian mixture models’, Digital Signal Processing, Vol.10, Nos.1-3, pp.19~41, July 2000*
Douglas A. Reynolds et al., 'Speaker verification using adapted gaussian mixture models', Digital Signal Processing, Vol.10, Nos.1-3, pp.19~41, July 2000 *

Also Published As

Publication number Publication date
CA2643481C (en) 2016-01-05
JP2009527798A (ja) 2009-07-30
NO20083580L (no) 2008-09-10
MX2008010478A (es) 2008-10-23
CA2861876A1 (en) 2007-08-30
US20070198257A1 (en) 2007-08-23
CN102646416B (zh) 2014-10-29
AU2007217884A1 (en) 2007-08-30
CA2643481A1 (en) 2007-08-30
RU2008134112A (ru) 2010-02-27
WO2007098039A1 (en) 2007-08-30
CA2861876C (en) 2016-04-26
CN101385074A (zh) 2009-03-11
CN102646416A (zh) 2012-08-22
US7539616B2 (en) 2009-05-26
JP4876134B2 (ja) 2012-02-15
EP1989701A1 (en) 2008-11-12
KR20080102373A (ko) 2008-11-25
EP1989701A4 (en) 2011-06-22
EP2410514A3 (en) 2012-02-22
EP2410514B1 (en) 2013-05-29
EP2410514A2 (en) 2012-01-25
CN101385074B (zh) 2012-08-15
EP1989701B1 (en) 2012-06-27

Similar Documents

Publication Publication Date Title
KR101323061B1 (ko) 스피커 인증 방법 및 이 방법을 수행하기 위한 컴퓨터 실행가능 명령어를 갖는 컴퓨터 판독가능 매체
JP6542386B2 (ja) 話者検証のためのニューラルネットワーク
CN102238189B (zh) 声纹密码认证方法及系统
US8532991B2 (en) Speech models generated using competitive training, asymmetric training, and data boosting
CN104143326B (zh) 一种语音命令识别方法和装置
JP6464650B2 (ja) 音声処理装置、音声処理方法、およびプログラム
US20060009965A1 (en) Method and apparatus for distribution-based language model adaptation
US20090171660A1 (en) Method and apparatus for verification of speaker authentification and system for speaker authentication
US6990446B1 (en) Method and apparatus using spectral addition for speaker recognition
CN108417201A (zh) 单信道多说话人身份识别方法及系统
CN110580897B (zh) 音频校验方法、装置、存储介质及电子设备
CN111933121B (zh) 一种声学模型训练方法及装置
US7509257B2 (en) Method and apparatus for adapting reference templates
US20030171931A1 (en) System for creating user-dependent recognition models and for making those models accessible by a user
KR100764247B1 (ko) 2단계 탐색을 이용한 음성인식 장치 및 그 방법
Kurniawati et al. Speaker dependent activation keyword detector based on GMM-UBM.

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20080819

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20120118

Comment text: Request for Examination of Application

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20130412

Patent event code: PE09021S01D

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20130927

GRNT Written decision to grant
PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20131022

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20131022

End annual number: 3

Start annual number: 1

PG1601 Publication of registration
FPAY Annual fee payment

Payment date: 20160921

Year of fee payment: 4

PR1001 Payment of annual fee

Payment date: 20160921

Start annual number: 4

End annual number: 4

FPAY Annual fee payment

Payment date: 20170919

Year of fee payment: 5

PR1001 Payment of annual fee

Payment date: 20170919

Start annual number: 5

End annual number: 5

FPAY Annual fee payment

Payment date: 20180918

Year of fee payment: 6

PR1001 Payment of annual fee

Payment date: 20180918

Start annual number: 6

End annual number: 6

FPAY Annual fee payment

Payment date: 20190917

Year of fee payment: 7

PR1001 Payment of annual fee

Payment date: 20190917

Start annual number: 7

End annual number: 7

FPAY Annual fee payment

Payment date: 20200928

Year of fee payment: 8

PR1001 Payment of annual fee

Payment date: 20200928

Start annual number: 8

End annual number: 8

FPAY Annual fee payment

Payment date: 20210915

Year of fee payment: 9

PR1001 Payment of annual fee

Payment date: 20210915

Start annual number: 9

End annual number: 9

FPAY Annual fee payment

Payment date: 20220915

Year of fee payment: 10

PR1001 Payment of annual fee

Payment date: 20220915

Start annual number: 10

End annual number: 10

PR1001 Payment of annual fee

Payment date: 20240925

Start annual number: 12

End annual number: 12