ES2028863T3 - Reconocedor de voz instruido por el oredor. - Google Patents
Reconocedor de voz instruido por el oredor.Info
- Publication number
- ES2028863T3 ES2028863T3 ES198787302309T ES87302309T ES2028863T3 ES 2028863 T3 ES2028863 T3 ES 2028863T3 ES 198787302309 T ES198787302309 T ES 198787302309T ES 87302309 T ES87302309 T ES 87302309T ES 2028863 T3 ES2028863 T3 ES 2028863T3
- Authority
- ES
- Spain
- Prior art keywords
- word
- vocabulary
- speaker
- instructed
- order
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Image Analysis (AREA)
- Electrically Operated Instructional Devices (AREA)
- Character Discrimination (AREA)
Abstract
DURANTE UNA SECUENCIA DE FORMACION, UN RECONOCEDOR DE VOZ ENTRENADO POR EL ORADOR, DETECTA Y SEÑALA AL ORADOR CUANTOS PARES DE PALABRAS DEL VOCABULARIO SON POTENCIALMENTE CONFUSAS AL RECONOCEDOR. CADA PALABRA DEL VOCABULARIO SE CONVIERTE (106) EN PARAMETROS REPRESENTANDO UN MODELO DE REFERENCIA PRODETERMINADO DE CADA PALABRA. LA SEÑAL CARACTERISTICA DE UNA PALABARA POTENCIAL SE COMPARA CON EL MODELO DE REFERENCIA DE CADA PALABRA DEL VOCABULARIO PREVIAMENTE ALMACENADA EN LA MEMORIA DEL RECONOCEDOR (105). AL ORADOR SE LE SEÑALA (107) CUANDO LA PALABRA DEL VOCABULARIO POTENCIAL ES CONFUSAMENTE SIMILAR A UNA DE LAS PALABRAS DEL VOCABULARIO EXISTENTE.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US84196886A | 1986-03-25 | 1986-03-25 |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2028863T3 true ES2028863T3 (es) | 1992-07-16 |
Family
ID=25286208
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES198787302309T Expired - Lifetime ES2028863T3 (es) | 1986-03-25 | 1987-03-18 | Reconocedor de voz instruido por el oredor. |
Country Status (7)
Country | Link |
---|---|
US (1) | US4972485A (es) |
EP (1) | EP0241163B1 (es) |
JP (1) | JPS62231997A (es) |
KR (1) | KR970001165B1 (es) |
CA (1) | CA1311059C (es) |
DE (1) | DE3775963D1 (es) |
ES (1) | ES2028863T3 (es) |
Families Citing this family (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5315689A (en) * | 1988-05-27 | 1994-05-24 | Kabushiki Kaisha Toshiba | Speech recognition system having word-based and phoneme-based recognition means |
US5465378A (en) * | 1990-05-15 | 1995-11-07 | Compuspeak, Inc. | Report generating system |
ES2128390T3 (es) * | 1992-03-02 | 1999-05-16 | At & T Corp | Metodo de adiestramiento y dispositivo para reconocimiento de voz. |
US6101468A (en) * | 1992-11-13 | 2000-08-08 | Dragon Systems, Inc. | Apparatuses and methods for training and operating speech recognition systems |
US5452397A (en) * | 1992-12-11 | 1995-09-19 | Texas Instruments Incorporated | Method and system for preventing entry of confusingly similar phases in a voice recognition system vocabulary list |
DE69425564D1 (de) * | 1993-03-12 | 2000-09-21 | Stanford Res Inst Int | Verfahren und vorrichtung für sprachunterricht mittels interaktiver sprachsteuerung |
US5465317A (en) * | 1993-05-18 | 1995-11-07 | International Business Machines Corporation | Speech recognition system with improved rejection of words and sounds not in the system vocabulary |
US5737723A (en) * | 1994-08-29 | 1998-04-07 | Lucent Technologies Inc. | Confusable word detection in speech recognition |
US5903864A (en) * | 1995-08-30 | 1999-05-11 | Dragon Systems | Speech recognition |
US5937383A (en) * | 1996-02-02 | 1999-08-10 | International Business Machines Corporation | Apparatus and methods for speech recognition including individual or speaker class dependent decoding history caches for fast word acceptance or rejection |
US5754977A (en) * | 1996-03-06 | 1998-05-19 | Intervoice Limited Partnership | System and method for preventing enrollment of confusable patterns in a reference database |
US5842161A (en) * | 1996-06-25 | 1998-11-24 | Lucent Technologies Inc. | Telecommunications instrument employing variable criteria speech recognition |
US6151575A (en) * | 1996-10-28 | 2000-11-21 | Dragon Systems, Inc. | Rapid adaptation of speech models |
US5829000A (en) * | 1996-10-31 | 1998-10-27 | Microsoft Corporation | Method and system for correcting misrecognized spoken words or phrases |
US5950160A (en) * | 1996-10-31 | 1999-09-07 | Microsoft Corporation | Method and system for displaying a variable number of alternative words during speech recognition |
US5899976A (en) * | 1996-10-31 | 1999-05-04 | Microsoft Corporation | Method and system for buffering recognized words during speech recognition |
US5884258A (en) * | 1996-10-31 | 1999-03-16 | Microsoft Corporation | Method and system for editing phrases during continuous speech recognition |
EP0920692B1 (en) * | 1996-12-24 | 2003-03-26 | Cellon France SAS | A method for training a speech recognition system and an apparatus for practising the method, in particular, a portable telephone apparatus |
US6212498B1 (en) | 1997-03-28 | 2001-04-03 | Dragon Systems, Inc. | Enrollment in speech recognition |
US6012027A (en) * | 1997-05-27 | 2000-01-04 | Ameritech Corporation | Criteria for usable repetitions of an utterance during speech reference enrollment |
US7630895B2 (en) * | 2000-01-21 | 2009-12-08 | At&T Intellectual Property I, L.P. | Speaker verification method |
US6490561B1 (en) * | 1997-06-25 | 2002-12-03 | Dennis L. Wilson | Continuous speech voice transcription |
FR2769118B1 (fr) * | 1997-09-29 | 1999-12-03 | Matra Communication | Procede de reconnaissance de parole |
DE19804047C2 (de) * | 1998-02-03 | 2000-03-16 | Deutsche Telekom Mobil | Verfahren und Einrichtung zur Erhöhung der Erkennungswahrscheinlichkeit von Spracherkennungssystemen |
US6163768A (en) | 1998-06-15 | 2000-12-19 | Dragon Systems, Inc. | Non-interactive enrollment in speech recognition |
US7266498B1 (en) * | 1998-12-18 | 2007-09-04 | Intel Corporation | Method and apparatus for reducing conflicts between speech-enabled applications sharing speech menu |
GB9920257D0 (en) * | 1999-08-26 | 1999-10-27 | Canon Kk | Signal processing system |
US7047196B2 (en) | 2000-06-08 | 2006-05-16 | Agiletv Corporation | System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery |
US8095370B2 (en) | 2001-02-16 | 2012-01-10 | Agiletv Corporation | Dual compression voice recordation non-repudiation system |
US7013276B2 (en) * | 2001-10-05 | 2006-03-14 | Comverse, Inc. | Method of assessing degree of acoustic confusability, and system therefor |
GB2385698B (en) * | 2002-02-26 | 2005-06-15 | Canon Kk | Speech processing apparatus and method |
AU2003283391A1 (en) * | 2002-11-13 | 2004-06-03 | Bernd Schonebeck | Voice processing system, method for allocating acoustic and/or written character strings to words or lexical entries |
US20070055520A1 (en) * | 2005-08-31 | 2007-03-08 | Microsoft Corporation | Incorporation of speech engine training into interactive user tutorial |
US7844456B2 (en) * | 2007-03-09 | 2010-11-30 | Microsoft Corporation | Grammar confusability metric for speech recognition |
US20130325447A1 (en) * | 2012-05-31 | 2013-12-05 | Elwha LLC, a limited liability corporation of the State of Delaware | Speech recognition adaptation systems based on adaptation data |
US10431235B2 (en) | 2012-05-31 | 2019-10-01 | Elwha Llc | Methods and systems for speech adaptation data |
US20130325449A1 (en) | 2012-05-31 | 2013-12-05 | Elwha Llc | Speech recognition adaptation systems based on adaptation data |
US10395672B2 (en) | 2012-05-31 | 2019-08-27 | Elwha Llc | Methods and systems for managing adaptation data |
DK2713367T3 (en) * | 2012-09-28 | 2017-02-20 | Agnitio S L | Speech Recognition |
US9684437B2 (en) * | 2013-07-12 | 2017-06-20 | II Michael L. Thornton | Memorization system and method |
US10121466B2 (en) * | 2015-02-11 | 2018-11-06 | Hand Held Products, Inc. | Methods for training a speech recognition system |
EP3067886A1 (en) * | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
CN107301862A (zh) * | 2016-04-01 | 2017-10-27 | 北京搜狗科技发展有限公司 | 一种语音识别方法、识别模型建立方法、装置及电子设备 |
US10141009B2 (en) | 2016-06-28 | 2018-11-27 | Pindrop Security, Inc. | System and method for cluster-based audio event detection |
US9824692B1 (en) | 2016-09-12 | 2017-11-21 | Pindrop Security, Inc. | End-to-end speaker recognition using deep neural network |
CA3117645C (en) | 2016-09-19 | 2023-01-03 | Pindrop Security, Inc. | Channel-compensated low-level features for speaker recognition |
US10325601B2 (en) | 2016-09-19 | 2019-06-18 | Pindrop Security, Inc. | Speaker recognition in the call center |
US10553218B2 (en) * | 2016-09-19 | 2020-02-04 | Pindrop Security, Inc. | Dimensionality reduction of baum-welch statistics for speaker recognition |
US10397398B2 (en) | 2017-01-17 | 2019-08-27 | Pindrop Security, Inc. | Authentication using DTMF tones |
US10586537B2 (en) * | 2017-11-30 | 2020-03-10 | International Business Machines Corporation | Filtering directive invoking vocal utterances |
WO2020159917A1 (en) | 2019-01-28 | 2020-08-06 | Pindrop Security, Inc. | Unsupervised keyword spotting and word discovery for fraud analytics |
WO2020163624A1 (en) | 2019-02-06 | 2020-08-13 | Pindrop Security, Inc. | Systems and methods of gateway detection in a telephone network |
WO2020198354A1 (en) | 2019-03-25 | 2020-10-01 | Pindrop Security, Inc. | Detection of calls from voice assistants |
US12015637B2 (en) | 2019-04-08 | 2024-06-18 | Pindrop Security, Inc. | Systems and methods for end-to-end architectures for voice spoofing detection |
CN115206299B (zh) * | 2022-09-15 | 2022-11-11 | 成都启英泰伦科技有限公司 | 一种基于命令词语音识别的易混淆词防误识别方法 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3333248A (en) * | 1963-12-20 | 1967-07-25 | Ibm | Self-adaptive systems |
US3548202A (en) * | 1968-11-29 | 1970-12-15 | Ibm | Adaptive logic system for unsupervised learning |
US3816722A (en) * | 1970-09-29 | 1974-06-11 | Nippon Electric Co | Computer for calculating the similarity between patterns and pattern recognition system comprising the similarity computer |
US4297528A (en) * | 1979-09-10 | 1981-10-27 | Interstate Electronics Corp. | Training circuit for audio signal recognition computer |
US4348553A (en) * | 1980-07-02 | 1982-09-07 | International Business Machines Corporation | Parallel pattern verifier with dynamic time warping |
CH644246B (fr) * | 1981-05-15 | 1900-01-01 | Asulab Sa | Dispositif d'introduction de mots a commande par la parole. |
US4499596A (en) * | 1982-06-28 | 1985-02-12 | International Business Machines Corporation | Adaptive facsimile compression using a dynamic extendable decision network |
US4587670A (en) * | 1982-10-15 | 1986-05-06 | At&T Bell Laboratories | Hidden Markov model speech recognition arrangement |
US4618984A (en) * | 1983-06-08 | 1986-10-21 | International Business Machines Corporation | Adaptive automatic discrete utterance recognition |
JPS60218698A (ja) * | 1984-04-16 | 1985-11-01 | 日本電気株式会社 | 音声認識装置 |
JPH0792673B2 (ja) * | 1984-10-02 | 1995-10-09 | 株式会社東芝 | 認識用辞書学習方法 |
US4718094A (en) * | 1984-11-19 | 1988-01-05 | International Business Machines Corp. | Speech recognition system |
-
1987
- 1987-02-26 CA CA000530682A patent/CA1311059C/en not_active Expired - Fee Related
- 1987-03-18 DE DE8787302309T patent/DE3775963D1/de not_active Expired - Fee Related
- 1987-03-18 EP EP87302309A patent/EP0241163B1/en not_active Expired - Lifetime
- 1987-03-18 ES ES198787302309T patent/ES2028863T3/es not_active Expired - Lifetime
- 1987-03-24 KR KR1019870002681A patent/KR970001165B1/ko active IP Right Grant
- 1987-03-25 JP JP62069264A patent/JPS62231997A/ja active Pending
-
1989
- 1989-05-23 US US07/356,589 patent/US4972485A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
US4972485A (en) | 1990-11-20 |
KR970001165B1 (ko) | 1997-01-29 |
JPS62231997A (ja) | 1987-10-12 |
DE3775963D1 (de) | 1992-02-27 |
EP0241163A1 (en) | 1987-10-14 |
KR870009322A (ko) | 1987-10-26 |
EP0241163B1 (en) | 1992-01-15 |
CA1311059C (en) | 1992-12-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2028863T3 (es) | Reconocedor de voz instruido por el oredor. | |
DE68912397D1 (de) | Spracherkennung mit Sprecheranpassung durch Lernprozess. | |
ATE176546T1 (de) | Sprachgesteuerte dienstleistung | |
DE69330427T2 (de) | Spracherkennungssystem für sprachen mit zusammengesetzten wörtern | |
EP0313975A3 (en) | Design and construction of a binary-tree system for language modelling | |
ATE109582T1 (de) | Sprachprozessor. | |
DE3876207D1 (de) | Spracherkennungssystem unter verwendung von markov-modellen. | |
DE3773039D1 (de) | Spracherkennungssystem unter verwendung von markov-modellen. | |
ATE3800T1 (de) | Schaltungsanordnung fuer eine fernsprechteilnehmerstation. | |
ES2139112T3 (es) | Reconocimiento del habla basado en hmms. | |
DE3774605D1 (de) | Spracherkennungssystem. | |
DE3273358D1 (en) | Recognition of speech or speech-like sounds using associative memory | |
DE3853702D1 (de) | Spracherkennung. | |
FR1301743A (fr) | Système de machine à écrire phonétique | |
ATE48486T1 (de) | Schluesselworterkennungssystem unter anwendung eines sprachmusterverkettungsmodels. | |
JPS6419399A (en) | Voice recognition equipment | |
JPS6440898A (en) | Voice recognition equipment | |
JPS56160200A (en) | Hearing aid | |
JPS6481999A (en) | Phoneme string conversion system | |
DE69026795D1 (de) | Spracherkennungssystem mit Spracheintragungsfunktion, die auf zwei Äusserungen von jedem Wort basiert ist | |
JPS6428699A (en) | Continuous voice recognition system | |
JPS6442700A (en) | Voice recognition equipment | |
JPS6428697A (en) | Voice recognition equipment | |
JPS6438798A (en) | Word voice recognition equipment | |
von Raffler-Engel | Investigation of Italo-American Bilinguals |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG2A | Definitive protection |
Ref document number: 241163 Country of ref document: ES |