ATE407420T1 - Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung - Google Patents

Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung

Info

Publication number
ATE407420T1
ATE407420T1 AT02702130T AT02702130T ATE407420T1 AT E407420 T1 ATE407420 T1 AT E407420T1 AT 02702130 T AT02702130 T AT 02702130T AT 02702130 T AT02702130 T AT 02702130T AT E407420 T1 ATE407420 T1 AT E407420T1
Authority
AT
Austria
Prior art keywords
acoustic feature
speaker
recognition system
feature vector
feature vectors
Prior art date
Application number
AT02702130T
Other languages
English (en)
Inventor
Chienchung Chang
Naren Malayath
Byron Yafuso
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Application granted granted Critical
Publication of ATE407420T1 publication Critical patent/ATE407420T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Telephonic Communication Services (AREA)
  • Image Analysis (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Processing (AREA)
  • Devices For Executing Special Programs (AREA)
AT02702130T 2001-01-31 2002-01-30 Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung ATE407420T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/773,831 US7024359B2 (en) 2001-01-31 2001-01-31 Distributed voice recognition system using acoustic feature vector modification

Publications (1)

Publication Number Publication Date
ATE407420T1 true ATE407420T1 (de) 2008-09-15

Family

ID=25099445

Family Applications (1)

Application Number Title Priority Date Filing Date
AT02702130T ATE407420T1 (de) 2001-01-31 2002-01-30 Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung

Country Status (12)

Country Link
US (1) US7024359B2 (de)
EP (1) EP1356453B1 (de)
JP (2) JP4567290B2 (de)
KR (1) KR100879410B1 (de)
CN (1) CN1284133C (de)
AT (1) ATE407420T1 (de)
AU (1) AU2002235513A1 (de)
BR (1) BR0206836A (de)
DE (1) DE60228682D1 (de)
HK (1) HK1062738A1 (de)
TW (1) TW546633B (de)
WO (1) WO2002065453A2 (de)

Families Citing this family (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6487494B2 (en) * 2001-03-29 2002-11-26 Wingcast, Llc System and method for reducing the amount of repetitive data sent by a server to a client for vehicle navigation
US20020143611A1 (en) * 2001-03-29 2002-10-03 Gilad Odinak Vehicle parking validation system and method
US7406421B2 (en) 2001-10-26 2008-07-29 Intellisist Inc. Systems and methods for reviewing informational content in a vehicle
US20050065779A1 (en) * 2001-03-29 2005-03-24 Gilad Odinak Comprehensive multiple feature telematics system
US6885735B2 (en) * 2001-03-29 2005-04-26 Intellisist, Llc System and method for transmitting voice input from a remote location over a wireless data channel
USRE46109E1 (en) 2001-03-29 2016-08-16 Lg Electronics Inc. Vehicle navigation system and method
US7236777B2 (en) 2002-05-16 2007-06-26 Intellisist, Inc. System and method for dynamically configuring wireless network geographic coverage or service levels
US7392191B2 (en) * 2001-03-29 2008-06-24 Intellisist, Inc. Method and device to distinguish between voice conversation and automated speech recognition
US8175886B2 (en) 2001-03-29 2012-05-08 Intellisist, Inc. Determination of signal-processing approach based on signal destination characteristics
CN1409527A (zh) * 2001-09-13 2003-04-09 松下电器产业株式会社 终端器、服务器及语音辨识方法
GB2391679B (en) * 2002-02-04 2004-03-24 Zentian Ltd Speech recognition circuit using parallel processors
US8249880B2 (en) * 2002-02-14 2012-08-21 Intellisist, Inc. Real-time display of system instructions
US8239197B2 (en) * 2002-03-28 2012-08-07 Intellisist, Inc. Efficient conversion of voice messages into text
WO2003084196A1 (en) 2002-03-28 2003-10-09 Martin Dunsmuir Closed-loop command and response system for automatic communications between interacting computer systems over an audio communications channel
TW567465B (en) * 2002-09-02 2003-12-21 Ind Tech Res Inst Configurable distributed speech recognition system
GB0226648D0 (en) * 2002-11-15 2002-12-24 Koninkl Philips Electronics Nv Usage data harvesting
US7533023B2 (en) * 2003-02-12 2009-05-12 Panasonic Corporation Intermediary speech processor in network environments transforming customized speech parameters
DE10353068A1 (de) * 2003-11-13 2005-06-23 Voice Trust Ag Verfahren zur Authentifizierung eines Benutzers anhand dessen Stimmprofils
US20050216266A1 (en) * 2004-03-29 2005-09-29 Yifan Gong Incremental adjustment of state-dependent bias parameters for adaptive speech recognition
US7720012B1 (en) 2004-07-09 2010-05-18 Arrowhead Center, Inc. Speaker identification in the presence of packet losses
GB2418764B (en) * 2004-09-30 2008-04-09 Fluency Voice Technology Ltd Improving pattern recognition accuracy with distortions
US20060095261A1 (en) * 2004-10-30 2006-05-04 Ibm Corporation Voice packet identification based on celp compression parameters
CN1811911B (zh) * 2005-01-28 2010-06-23 北京捷通华声语音技术有限公司 自适应的语音变换处理方法
JP4527679B2 (ja) 2006-03-24 2010-08-18 学校法人早稲田大学 音声の類似度の評価を行う方法および装置
US7725316B2 (en) * 2006-07-05 2010-05-25 General Motors Llc Applying speech recognition adaptation in an automated speech recognition system of a telematics-equipped vehicle
JP4427530B2 (ja) * 2006-09-21 2010-03-10 株式会社東芝 音声認識装置、プログラムおよび音声認識方法
WO2008137616A1 (en) * 2007-05-04 2008-11-13 Nuance Communications, Inc. Multi-class constrained maximum likelihood linear regression
US20090018826A1 (en) * 2007-07-13 2009-01-15 Berlin Andrew A Methods, Systems and Devices for Speech Transduction
US8639510B1 (en) 2007-12-24 2014-01-28 Kai Yu Acoustic scoring unit implemented on a single FPGA or ASIC
US8352265B1 (en) 2007-12-24 2013-01-08 Edward Lin Hardware implemented backend search engine for a high-rate speech recognition system
US8463610B1 (en) 2008-01-18 2013-06-11 Patrick J. Bourke Hardware-implemented scalable modular engine for low-power speech recognition
KR101217525B1 (ko) * 2008-12-22 2013-01-18 한국전자통신연구원 비터비 디코더와 이를 이용한 음성 인식 방법
US9418662B2 (en) * 2009-01-21 2016-08-16 Nokia Technologies Oy Method, apparatus and computer program product for providing compound models for speech recognition adaptation
US8189925B2 (en) * 2009-06-04 2012-05-29 Microsoft Corporation Geocoding by image matching
US8554562B2 (en) * 2009-11-15 2013-10-08 Nuance Communications, Inc. Method and system for speaker diarization
EP2643832A4 (de) * 2010-11-22 2016-10-12 Listening Methods Llc System und verfahren zur mustererkennung und -analyse
US10229701B2 (en) 2013-02-28 2019-03-12 Nuance Communications, Inc. Server-side ASR adaptation to speaker, device and noise condition via non-ASR audio transmission
WO2014133525A1 (en) * 2013-02-28 2014-09-04 Nuance Communication, Inc. Server-side asr adaptation to speaker, device and noise condition via non-asr audio transmission
US9282096B2 (en) 2013-08-31 2016-03-08 Steven Goldstein Methods and systems for voice authentication service leveraging networking
US10405163B2 (en) * 2013-10-06 2019-09-03 Staton Techiya, Llc Methods and systems for establishing and maintaining presence information of neighboring bluetooth devices
US20170092278A1 (en) * 2015-09-30 2017-03-30 Apple Inc. Speaker recognition
IL263655B2 (en) * 2016-06-14 2023-03-01 Netzer Omry Automatic speech recognition
CN106782504B (zh) * 2016-12-29 2019-01-22 百度在线网络技术(北京)有限公司 语音识别方法和装置
EP3719679B1 (de) * 2019-04-03 2021-06-09 Fondation de L'institut de Recherche Idiap Verfahren zum schutz biometrischer vorlagen sowie system und verfahren zur überprüfung der identität eines sprechers
US11545132B2 (en) 2019-08-28 2023-01-03 International Business Machines Corporation Speech characterization using a synthesized reference audio signal
CN118675505A (zh) 2019-12-04 2024-09-20 谷歌有限责任公司 使用说话者相关语音模型的说话者感知
CN113345428B (zh) * 2021-06-04 2023-08-04 北京华捷艾米科技有限公司 语音识别模型的匹配方法、装置、设备和存储介质

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4926488A (en) * 1987-07-09 1990-05-15 International Business Machines Corporation Normalization of speech by adaptive labelling
JP2980382B2 (ja) * 1990-12-19 1999-11-22 富士通株式会社 話者適応音声認識方法および装置
JPH06214596A (ja) * 1993-01-14 1994-08-05 Ricoh Co Ltd 音声認識装置および話者適応化方法
JP3413861B2 (ja) * 1993-01-18 2003-06-09 ヤマハ株式会社 電子楽器の鍵盤装置
ZA948426B (en) 1993-12-22 1995-06-30 Qualcomm Inc Distributed voice recognition system
JPH07210190A (ja) 1993-12-30 1995-08-11 Internatl Business Mach Corp <Ibm> 音声認識方法及びシステム
US5864810A (en) * 1995-01-20 1999-01-26 Sri International Method and apparatus for speech recognition adapted to an individual speaker
JP3697748B2 (ja) 1995-08-21 2005-09-21 セイコーエプソン株式会社 端末、音声認識装置
JP3001037B2 (ja) 1995-12-13 2000-01-17 日本電気株式会社 音声認識装置
WO1999021172A2 (en) * 1997-10-20 1999-04-29 Koninklijke Philips Electronics N.V. Pattern recognition enrolment in a distributed system
JP2000276188A (ja) * 1999-03-24 2000-10-06 Sony Corp 音声認識装置、音声認識方法、音声認識用制御プログラムを記録した記録媒体、通信端末装置、通信方法、音声認識通信の制御用プログラムを記録した記録媒体、サーバ装置、音声認識用データの送受信方法及び音声認識用データの送受信制御プログラムを記録した記録媒体
JP3456444B2 (ja) * 1999-05-10 2003-10-14 日本電気株式会社 音声判定装置及び方法並びに記録媒体
US6421641B1 (en) * 1999-11-12 2002-07-16 International Business Machines Corporation Methods and apparatus for fast adaptation of a band-quantized speech decoding system

Also Published As

Publication number Publication date
TW546633B (en) 2003-08-11
CN1284133C (zh) 2006-11-08
HK1062738A1 (en) 2004-11-19
AU2002235513A1 (en) 2002-08-28
EP1356453B1 (de) 2008-09-03
CN1494712A (zh) 2004-05-05
WO2002065453A2 (en) 2002-08-22
KR100879410B1 (ko) 2009-01-19
WO2002065453A3 (en) 2002-10-24
KR20040062433A (ko) 2004-07-07
DE60228682D1 (de) 2008-10-16
JP4567290B2 (ja) 2010-10-20
US7024359B2 (en) 2006-04-04
US20020103639A1 (en) 2002-08-01
JP4976432B2 (ja) 2012-07-18
JP2009151318A (ja) 2009-07-09
BR0206836A (pt) 2006-01-17
JP2004536330A (ja) 2004-12-02
EP1356453A2 (de) 2003-10-29

Similar Documents

Publication Publication Date Title
ATE407420T1 (de) Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung
DE60125542D1 (de) System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen
DE60222249D1 (de) Spracherkennungsystem mittels impliziter sprecheradaption
GB0207343D0 (en) Signal processing system
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
WO2002021513A8 (en) Combining dtw and hmm in speaker dependent and independent modes for speech recognition
WO2006033044A3 (en) Method of training a robust speaker-dependent speech recognition system with speaker-dependent expressions and robust speaker-dependent speech recognition system
AU3164800A (en) Recognition engines with complementary language models
GB2366434A (en) Selective speaker adaption for an in-vehicle speech recognition system
WO2003019528A1 (fr) Procede de production d&#39;intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal
EP1629464A4 (de) Spracherkennungssystem und verfahren auf phonetischer basis
TW200601263A (en) Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
WO2007117814A3 (en) Voice signal perturbation for speech recognition
DE60004331D1 (de) Sprecher-erkennung
WO2007005098A3 (en) Method and apparatus for generating and updating a voice tag
DE60008893D1 (de) Sprachgesteuertes tragbares Endgerät
DE69413912D1 (de) Sprachumsetzungsverfahren
DE50003680D1 (de) Verfahren zur sprachgesteuerten identifizierung des nutzers eines telekommunikationsanschlusses im telekommunikationsnetz beim dialog mit einem sprachgesteuerten dialogsystem
Kuhn et al. Very fast adaptation with a compact context-dependent eigenvoice model
DE50106405D1 (de) Sprachgeführtes gerätesteuerungsverfahren mit einer optimierung für einen benutzer
ES2179624T3 (es) Procedimiento y dispositivo para aumentar la probabilidad de reconocimiento de los sistemas de reconocimiento de voz.
KR20210083719A (ko) 인공지능인형
MXPA05006672A (es) Sistema y metodo de reconocimiento de voz.
Lee et al. A New Speaker Adaptation Technique using Maximum Model Distance

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties