ATE407420T1 - Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung - Google Patents
Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierungInfo
- Publication number
- ATE407420T1 ATE407420T1 AT02702130T AT02702130T ATE407420T1 AT E407420 T1 ATE407420 T1 AT E407420T1 AT 02702130 T AT02702130 T AT 02702130T AT 02702130 T AT02702130 T AT 02702130T AT E407420 T1 ATE407420 T1 AT E407420T1
- Authority
- AT
- Austria
- Prior art keywords
- acoustic feature
- speaker
- recognition system
- feature vector
- feature vectors
- Prior art date
Links
- 239000013598 vector Substances 0.000 title abstract 6
- 230000004048 modification Effects 0.000 title abstract 3
- 238000012986 modification Methods 0.000 title abstract 3
- 230000006978 adaptation Effects 0.000 abstract 2
- 230000001419 dependent effect Effects 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Telephonic Communication Services (AREA)
- Image Analysis (AREA)
- Mobile Radio Communication Systems (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Image Processing (AREA)
- Devices For Executing Special Programs (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/773,831 US7024359B2 (en) | 2001-01-31 | 2001-01-31 | Distributed voice recognition system using acoustic feature vector modification |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE407420T1 true ATE407420T1 (de) | 2008-09-15 |
Family
ID=25099445
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT02702130T ATE407420T1 (de) | 2001-01-31 | 2002-01-30 | Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung |
Country Status (12)
Country | Link |
---|---|
US (1) | US7024359B2 (de) |
EP (1) | EP1356453B1 (de) |
JP (2) | JP4567290B2 (de) |
KR (1) | KR100879410B1 (de) |
CN (1) | CN1284133C (de) |
AT (1) | ATE407420T1 (de) |
AU (1) | AU2002235513A1 (de) |
BR (1) | BR0206836A (de) |
DE (1) | DE60228682D1 (de) |
HK (1) | HK1062738A1 (de) |
TW (1) | TW546633B (de) |
WO (1) | WO2002065453A2 (de) |
Families Citing this family (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6487494B2 (en) * | 2001-03-29 | 2002-11-26 | Wingcast, Llc | System and method for reducing the amount of repetitive data sent by a server to a client for vehicle navigation |
US20020143611A1 (en) * | 2001-03-29 | 2002-10-03 | Gilad Odinak | Vehicle parking validation system and method |
US7406421B2 (en) | 2001-10-26 | 2008-07-29 | Intellisist Inc. | Systems and methods for reviewing informational content in a vehicle |
US20050065779A1 (en) * | 2001-03-29 | 2005-03-24 | Gilad Odinak | Comprehensive multiple feature telematics system |
US6885735B2 (en) * | 2001-03-29 | 2005-04-26 | Intellisist, Llc | System and method for transmitting voice input from a remote location over a wireless data channel |
USRE46109E1 (en) | 2001-03-29 | 2016-08-16 | Lg Electronics Inc. | Vehicle navigation system and method |
US7236777B2 (en) | 2002-05-16 | 2007-06-26 | Intellisist, Inc. | System and method for dynamically configuring wireless network geographic coverage or service levels |
US7392191B2 (en) * | 2001-03-29 | 2008-06-24 | Intellisist, Inc. | Method and device to distinguish between voice conversation and automated speech recognition |
US8175886B2 (en) | 2001-03-29 | 2012-05-08 | Intellisist, Inc. | Determination of signal-processing approach based on signal destination characteristics |
CN1409527A (zh) * | 2001-09-13 | 2003-04-09 | 松下电器产业株式会社 | 终端器、服务器及语音辨识方法 |
GB2391679B (en) * | 2002-02-04 | 2004-03-24 | Zentian Ltd | Speech recognition circuit using parallel processors |
US8249880B2 (en) * | 2002-02-14 | 2012-08-21 | Intellisist, Inc. | Real-time display of system instructions |
US8239197B2 (en) * | 2002-03-28 | 2012-08-07 | Intellisist, Inc. | Efficient conversion of voice messages into text |
WO2003084196A1 (en) | 2002-03-28 | 2003-10-09 | Martin Dunsmuir | Closed-loop command and response system for automatic communications between interacting computer systems over an audio communications channel |
TW567465B (en) * | 2002-09-02 | 2003-12-21 | Ind Tech Res Inst | Configurable distributed speech recognition system |
GB0226648D0 (en) * | 2002-11-15 | 2002-12-24 | Koninkl Philips Electronics Nv | Usage data harvesting |
US7533023B2 (en) * | 2003-02-12 | 2009-05-12 | Panasonic Corporation | Intermediary speech processor in network environments transforming customized speech parameters |
DE10353068A1 (de) * | 2003-11-13 | 2005-06-23 | Voice Trust Ag | Verfahren zur Authentifizierung eines Benutzers anhand dessen Stimmprofils |
US20050216266A1 (en) * | 2004-03-29 | 2005-09-29 | Yifan Gong | Incremental adjustment of state-dependent bias parameters for adaptive speech recognition |
US7720012B1 (en) | 2004-07-09 | 2010-05-18 | Arrowhead Center, Inc. | Speaker identification in the presence of packet losses |
GB2418764B (en) * | 2004-09-30 | 2008-04-09 | Fluency Voice Technology Ltd | Improving pattern recognition accuracy with distortions |
US20060095261A1 (en) * | 2004-10-30 | 2006-05-04 | Ibm Corporation | Voice packet identification based on celp compression parameters |
CN1811911B (zh) * | 2005-01-28 | 2010-06-23 | 北京捷通华声语音技术有限公司 | 自适应的语音变换处理方法 |
JP4527679B2 (ja) | 2006-03-24 | 2010-08-18 | 学校法人早稲田大学 | 音声の類似度の評価を行う方法および装置 |
US7725316B2 (en) * | 2006-07-05 | 2010-05-25 | General Motors Llc | Applying speech recognition adaptation in an automated speech recognition system of a telematics-equipped vehicle |
JP4427530B2 (ja) * | 2006-09-21 | 2010-03-10 | 株式会社東芝 | 音声認識装置、プログラムおよび音声認識方法 |
WO2008137616A1 (en) * | 2007-05-04 | 2008-11-13 | Nuance Communications, Inc. | Multi-class constrained maximum likelihood linear regression |
US20090018826A1 (en) * | 2007-07-13 | 2009-01-15 | Berlin Andrew A | Methods, Systems and Devices for Speech Transduction |
US8639510B1 (en) | 2007-12-24 | 2014-01-28 | Kai Yu | Acoustic scoring unit implemented on a single FPGA or ASIC |
US8352265B1 (en) | 2007-12-24 | 2013-01-08 | Edward Lin | Hardware implemented backend search engine for a high-rate speech recognition system |
US8463610B1 (en) | 2008-01-18 | 2013-06-11 | Patrick J. Bourke | Hardware-implemented scalable modular engine for low-power speech recognition |
KR101217525B1 (ko) * | 2008-12-22 | 2013-01-18 | 한국전자통신연구원 | 비터비 디코더와 이를 이용한 음성 인식 방법 |
US9418662B2 (en) * | 2009-01-21 | 2016-08-16 | Nokia Technologies Oy | Method, apparatus and computer program product for providing compound models for speech recognition adaptation |
US8189925B2 (en) * | 2009-06-04 | 2012-05-29 | Microsoft Corporation | Geocoding by image matching |
US8554562B2 (en) * | 2009-11-15 | 2013-10-08 | Nuance Communications, Inc. | Method and system for speaker diarization |
EP2643832A4 (de) * | 2010-11-22 | 2016-10-12 | Listening Methods Llc | System und verfahren zur mustererkennung und -analyse |
US10229701B2 (en) | 2013-02-28 | 2019-03-12 | Nuance Communications, Inc. | Server-side ASR adaptation to speaker, device and noise condition via non-ASR audio transmission |
WO2014133525A1 (en) * | 2013-02-28 | 2014-09-04 | Nuance Communication, Inc. | Server-side asr adaptation to speaker, device and noise condition via non-asr audio transmission |
US9282096B2 (en) | 2013-08-31 | 2016-03-08 | Steven Goldstein | Methods and systems for voice authentication service leveraging networking |
US10405163B2 (en) * | 2013-10-06 | 2019-09-03 | Staton Techiya, Llc | Methods and systems for establishing and maintaining presence information of neighboring bluetooth devices |
US20170092278A1 (en) * | 2015-09-30 | 2017-03-30 | Apple Inc. | Speaker recognition |
IL263655B2 (en) * | 2016-06-14 | 2023-03-01 | Netzer Omry | Automatic speech recognition |
CN106782504B (zh) * | 2016-12-29 | 2019-01-22 | 百度在线网络技术(北京)有限公司 | 语音识别方法和装置 |
EP3719679B1 (de) * | 2019-04-03 | 2021-06-09 | Fondation de L'institut de Recherche Idiap | Verfahren zum schutz biometrischer vorlagen sowie system und verfahren zur überprüfung der identität eines sprechers |
US11545132B2 (en) | 2019-08-28 | 2023-01-03 | International Business Machines Corporation | Speech characterization using a synthesized reference audio signal |
CN118675505A (zh) | 2019-12-04 | 2024-09-20 | 谷歌有限责任公司 | 使用说话者相关语音模型的说话者感知 |
CN113345428B (zh) * | 2021-06-04 | 2023-08-04 | 北京华捷艾米科技有限公司 | 语音识别模型的匹配方法、装置、设备和存储介质 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4926488A (en) * | 1987-07-09 | 1990-05-15 | International Business Machines Corporation | Normalization of speech by adaptive labelling |
JP2980382B2 (ja) * | 1990-12-19 | 1999-11-22 | 富士通株式会社 | 話者適応音声認識方法および装置 |
JPH06214596A (ja) * | 1993-01-14 | 1994-08-05 | Ricoh Co Ltd | 音声認識装置および話者適応化方法 |
JP3413861B2 (ja) * | 1993-01-18 | 2003-06-09 | ヤマハ株式会社 | 電子楽器の鍵盤装置 |
ZA948426B (en) | 1993-12-22 | 1995-06-30 | Qualcomm Inc | Distributed voice recognition system |
JPH07210190A (ja) | 1993-12-30 | 1995-08-11 | Internatl Business Mach Corp <Ibm> | 音声認識方法及びシステム |
US5864810A (en) * | 1995-01-20 | 1999-01-26 | Sri International | Method and apparatus for speech recognition adapted to an individual speaker |
JP3697748B2 (ja) | 1995-08-21 | 2005-09-21 | セイコーエプソン株式会社 | 端末、音声認識装置 |
JP3001037B2 (ja) | 1995-12-13 | 2000-01-17 | 日本電気株式会社 | 音声認識装置 |
WO1999021172A2 (en) * | 1997-10-20 | 1999-04-29 | Koninklijke Philips Electronics N.V. | Pattern recognition enrolment in a distributed system |
JP2000276188A (ja) * | 1999-03-24 | 2000-10-06 | Sony Corp | 音声認識装置、音声認識方法、音声認識用制御プログラムを記録した記録媒体、通信端末装置、通信方法、音声認識通信の制御用プログラムを記録した記録媒体、サーバ装置、音声認識用データの送受信方法及び音声認識用データの送受信制御プログラムを記録した記録媒体 |
JP3456444B2 (ja) * | 1999-05-10 | 2003-10-14 | 日本電気株式会社 | 音声判定装置及び方法並びに記録媒体 |
US6421641B1 (en) * | 1999-11-12 | 2002-07-16 | International Business Machines Corporation | Methods and apparatus for fast adaptation of a band-quantized speech decoding system |
-
2001
- 2001-01-31 US US09/773,831 patent/US7024359B2/en not_active Expired - Lifetime
-
2002
- 2002-01-30 AU AU2002235513A patent/AU2002235513A1/en not_active Abandoned
- 2002-01-30 KR KR1020037010130A patent/KR100879410B1/ko active IP Right Grant
- 2002-01-30 EP EP02702130A patent/EP1356453B1/de not_active Expired - Lifetime
- 2002-01-30 CN CNB028060687A patent/CN1284133C/zh not_active Expired - Lifetime
- 2002-01-30 DE DE60228682T patent/DE60228682D1/de not_active Expired - Lifetime
- 2002-01-30 TW TW091101575A patent/TW546633B/zh not_active IP Right Cessation
- 2002-01-30 BR BR0206836-2A patent/BR0206836A/pt unknown
- 2002-01-30 AT AT02702130T patent/ATE407420T1/de not_active IP Right Cessation
- 2002-01-30 WO PCT/US2002/003014 patent/WO2002065453A2/en active Application Filing
- 2002-01-30 JP JP2002565298A patent/JP4567290B2/ja not_active Expired - Lifetime
-
2004
- 2004-07-28 HK HK04105572A patent/HK1062738A1/xx not_active IP Right Cessation
-
2009
- 2009-01-14 JP JP2009006033A patent/JP4976432B2/ja not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
TW546633B (en) | 2003-08-11 |
CN1284133C (zh) | 2006-11-08 |
HK1062738A1 (en) | 2004-11-19 |
AU2002235513A1 (en) | 2002-08-28 |
EP1356453B1 (de) | 2008-09-03 |
CN1494712A (zh) | 2004-05-05 |
WO2002065453A2 (en) | 2002-08-22 |
KR100879410B1 (ko) | 2009-01-19 |
WO2002065453A3 (en) | 2002-10-24 |
KR20040062433A (ko) | 2004-07-07 |
DE60228682D1 (de) | 2008-10-16 |
JP4567290B2 (ja) | 2010-10-20 |
US7024359B2 (en) | 2006-04-04 |
US20020103639A1 (en) | 2002-08-01 |
JP4976432B2 (ja) | 2012-07-18 |
JP2009151318A (ja) | 2009-07-09 |
BR0206836A (pt) | 2006-01-17 |
JP2004536330A (ja) | 2004-12-02 |
EP1356453A2 (de) | 2003-10-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE407420T1 (de) | Verteiltes spracherkennungssystem unter verwendung von akustischer merkmalsvektor- modifizierung | |
DE60125542D1 (de) | System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen | |
DE60222249D1 (de) | Spracherkennungsystem mittels impliziter sprecheradaption | |
GB0207343D0 (en) | Signal processing system | |
ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
WO2002021513A8 (en) | Combining dtw and hmm in speaker dependent and independent modes for speech recognition | |
WO2006033044A3 (en) | Method of training a robust speaker-dependent speech recognition system with speaker-dependent expressions and robust speaker-dependent speech recognition system | |
AU3164800A (en) | Recognition engines with complementary language models | |
GB2366434A (en) | Selective speaker adaption for an in-vehicle speech recognition system | |
WO2003019528A1 (fr) | Procede de production d'intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal | |
EP1629464A4 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
WO2007117814A3 (en) | Voice signal perturbation for speech recognition | |
DE60004331D1 (de) | Sprecher-erkennung | |
WO2007005098A3 (en) | Method and apparatus for generating and updating a voice tag | |
DE60008893D1 (de) | Sprachgesteuertes tragbares Endgerät | |
DE69413912D1 (de) | Sprachumsetzungsverfahren | |
DE50003680D1 (de) | Verfahren zur sprachgesteuerten identifizierung des nutzers eines telekommunikationsanschlusses im telekommunikationsnetz beim dialog mit einem sprachgesteuerten dialogsystem | |
Kuhn et al. | Very fast adaptation with a compact context-dependent eigenvoice model | |
DE50106405D1 (de) | Sprachgeführtes gerätesteuerungsverfahren mit einer optimierung für einen benutzer | |
ES2179624T3 (es) | Procedimiento y dispositivo para aumentar la probabilidad de reconocimiento de los sistemas de reconocimiento de voz. | |
KR20210083719A (ko) | 인공지능인형 | |
MXPA05006672A (es) | Sistema y metodo de reconocimiento de voz. | |
Lee et al. | A New Speaker Adaptation Technique using Maximum Model Distance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |