ES2086345T3 - Metodo para el reconocimiento del habla adaptable al usuario. - Google Patents
Metodo para el reconocimiento del habla adaptable al usuario.Info
- Publication number
- ES2086345T3 ES2086345T3 ES90117539T ES90117539T ES2086345T3 ES 2086345 T3 ES2086345 T3 ES 2086345T3 ES 90117539 T ES90117539 T ES 90117539T ES 90117539 T ES90117539 T ES 90117539T ES 2086345 T3 ES2086345 T3 ES 2086345T3
- Authority
- ES
- Spain
- Prior art keywords
- recognition
- word
- vocabulary
- speech
- diction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title abstract 4
- 230000006978 adaptation Effects 0.000 abstract 1
- 238000003909 pattern recognition Methods 0.000 abstract 1
- 230000011218 segmentation Effects 0.000 abstract 1
- 239000013598 vector Substances 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Electric Clocks (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Abstract
UN PROCEDIMIENTO TAL DEBE SER ADECUADO TANTO PARA EL RECONOCIMIENTO DE PALABRAS INDIVIDUALES, COMO PARA UNA LENGUA HABLADA CONTINUAMENTE. DEBE CARACTERIZARSE POR LA SOLIDEZ DEL RECONOCIMIENTO DE LA MUESTRA DE PALABRA, EN SEGMENTACION DE SILABAS INCORRECTAS Y EN PRONUNCIACIONES VARIABLES, POR EJEMPLO EN UNA ABSORCION DE SILABAS. ADEMAS, DEBE POSIBILITAR UNA RAPIDA ADAPTACION DEL SISTEMA A UN NUEVO ORADOR Y UNA GENERACION Y AMPLIACION DE LAS FRASES DE TEXTOS ESCRITOS, SIN UN ENTRENAMIENTO EXPLICITO DEL SISTEMA POR QUIEN ENSEÑA EL LENGUAJE. UN RECONOCIMIENTO DE PALABRAS Y SERIES DE PALABRAS DEBE SER POSIBLE INCLUSO EN FRASES DE MUCHA AMPLITUD. LOS PROCEDIMIENTOS CONOCIDOS PARA EL RECONOCIMIENTO DE LENGUAS NECESITAN UN PROCEDIMIENTO DE ENTRENAMIENTO MUY COSTOSO. ADEMAS, SE PRODUCE UN FLUJO DE HIPOTESIS INMENSO EN LA LENGUA HABLADA DE CONTINUO Y EN GRAN VOCABULARIO. SEGUN EL INVENTO, SE SEGMENTAN Y CLASIFICAN LOS VECTORES DE CARACTERISTICAS EXTRAIDOS, CON AYUDA DE UN INVENTARIO DE UNIDADES DE PALABRAS, EN UNA FRASE DE HIPOTESIS DE UNIDADES DE PALABRAS ORIENTADAS EN SILABAS. DE LA FRASE DE HIPOTESIS SE PRODUCE, POR UNA COMPARACION TRIDIMENSIONAL DE DINAMICA DE TIEMPO, CON VARIANTES DE PRONUNCIACION DE UN CAUDAL DE PALABRAS DE MUESTRA DE REFERENCIA, UNA FRASE HIPOTETICA DE PALABRAS QUE ARROJA UN ANALISIS SINTACTICO PARA AVERIGUACION DE LA FRASE HABLADA.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE3931638A DE3931638A1 (de) | 1989-09-22 | 1989-09-22 | Verfahren zur sprecheradaptiven erkennung von sprache |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2086345T3 true ES2086345T3 (es) | 1996-07-01 |
Family
ID=6389967
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES90117539T Expired - Lifetime ES2086345T3 (es) | 1989-09-22 | 1990-09-12 | Metodo para el reconocimiento del habla adaptable al usuario. |
Country Status (6)
Country | Link |
---|---|
US (1) | US5170432A (es) |
EP (1) | EP0418711B1 (es) |
AT (1) | ATE134275T1 (es) |
AU (1) | AU640164B2 (es) |
DE (2) | DE3931638A1 (es) |
ES (1) | ES2086345T3 (es) |
Families Citing this family (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IT1229782B (it) * | 1989-05-22 | 1991-09-11 | Face Standard Ind | Metodo ed apparato per riconoscere parole verbali sconosciute mediante estrazione dei parametri e confronto con parole di riferimento |
DE4131387A1 (de) * | 1991-09-20 | 1993-03-25 | Siemens Ag | Verfahren zur erkennung von mustern in zeitvarianten messsignalen |
US5267345A (en) * | 1992-02-10 | 1993-11-30 | International Business Machines Corporation | Speech recognition apparatus which predicts word classes from context and words from word classes |
US5333275A (en) * | 1992-06-23 | 1994-07-26 | Wheatley Barbara J | System and method for time aligning speech |
US5425129A (en) * | 1992-10-29 | 1995-06-13 | International Business Machines Corporation | Method for word spotting in continuous speech |
ES2078834B1 (es) * | 1992-10-30 | 1997-04-16 | Alcatel Standard Electrica | Metodo de segmentacion de cadenas de palabras en la fase de entrenamiento de un reconocedor de palabras conectadas. |
DE4412930A1 (de) * | 1994-04-15 | 1995-10-19 | Philips Patentverwaltung | Verfahren zum Ermitteln einer Folge von Wörtern |
US5745649A (en) * | 1994-07-07 | 1998-04-28 | Nynex Science & Technology Corporation | Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories |
EP0703569B1 (de) * | 1994-09-20 | 2000-03-01 | Philips Patentverwaltung GmbH | System zum Ermitteln von Wörtern aus einem Sprachsignal |
US5873061A (en) * | 1995-05-03 | 1999-02-16 | U.S. Philips Corporation | Method for constructing a model of a new word for addition to a word model database of a speech recognition system |
US5765132A (en) * | 1995-10-26 | 1998-06-09 | Dragon Systems, Inc. | Building speech models for new words in a multi-word utterance |
US5799276A (en) * | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US6064959A (en) * | 1997-03-28 | 2000-05-16 | Dragon Systems, Inc. | Error correction in speech recognition |
US6151575A (en) * | 1996-10-28 | 2000-11-21 | Dragon Systems, Inc. | Rapid adaptation of speech models |
DE19705471C2 (de) * | 1997-02-13 | 1998-04-09 | Sican F & E Gmbh Sibet | Verfahren und Schaltungsanordnung zur Spracherkennung und zur Sprachsteuerung von Vorrichtungen |
US6212498B1 (en) | 1997-03-28 | 2001-04-03 | Dragon Systems, Inc. | Enrollment in speech recognition |
JP5025759B2 (ja) * | 1997-11-17 | 2012-09-12 | ニュアンス コミュニケーションズ,インコーポレイテッド | 発音矯正装置、発音矯正方法および記録媒体 |
JP4267101B2 (ja) | 1997-11-17 | 2009-05-27 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声識別装置、発音矯正装置およびこれらの方法 |
US5927988A (en) * | 1997-12-17 | 1999-07-27 | Jenkins; William M. | Method and apparatus for training of sensory and perceptual systems in LLI subjects |
US6343267B1 (en) | 1998-04-30 | 2002-01-29 | Matsushita Electric Industrial Co., Ltd. | Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques |
US6263309B1 (en) * | 1998-04-30 | 2001-07-17 | Matsushita Electric Industrial Co., Ltd. | Maximum likelihood method for finding an adapted speaker model in eigenvoice space |
ATE225976T1 (de) | 1998-05-15 | 2002-10-15 | Siemens Ag | Verfahren und vorrichtung zur erkennung mindestens eines schlüsselworts in gesprochener sprache durch einen rechner |
US6163768A (en) | 1998-06-15 | 2000-12-19 | Dragon Systems, Inc. | Non-interactive enrollment in speech recognition |
US7937260B1 (en) * | 1998-06-15 | 2011-05-03 | At&T Intellectual Property Ii, L.P. | Concise dynamic grammars using N-best selection |
JP3720595B2 (ja) | 1998-09-17 | 2005-11-30 | キヤノン株式会社 | 音声認識装置及びその方法、コンピュータ可読メモリ |
DE19857070A1 (de) * | 1998-12-10 | 2000-06-15 | Michael Mende | Verfahren und Vorrichtung zur Ermittlung einer orthographischen Wiedergabe eines Textes |
US6434521B1 (en) * | 1999-06-24 | 2002-08-13 | Speechworks International, Inc. | Automatically determining words for updating in a pronunciation dictionary in a speech recognition system |
DE19942869A1 (de) * | 1999-09-08 | 2001-03-15 | Volkswagen Ag | Verfahren und Einrichtung zum Betrieb einer sprachgesteuerten Einrichtung bei Kraftfahrzeugen |
US6571208B1 (en) | 1999-11-29 | 2003-05-27 | Matsushita Electric Industrial Co., Ltd. | Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training |
US6526379B1 (en) | 1999-11-29 | 2003-02-25 | Matsushita Electric Industrial Co., Ltd. | Discriminative clustering methods for automatic speech recognition |
US6868381B1 (en) * | 1999-12-21 | 2005-03-15 | Nortel Networks Limited | Method and apparatus providing hypothesis driven speech modelling for use in speech recognition |
DE10017717B4 (de) * | 2000-04-11 | 2006-01-05 | Leopold Kostal Gmbh & Co. Kg | Spracheingabe gesteuertes Steuergerät |
US7089184B2 (en) | 2001-03-22 | 2006-08-08 | Nurv Center Technologies, Inc. | Speech recognition for recognizing speaker-independent, continuous speech |
US7136852B1 (en) * | 2001-11-27 | 2006-11-14 | Ncr Corp. | Case-based reasoning similarity metrics implementation using user defined functions |
US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
US20030115169A1 (en) * | 2001-12-17 | 2003-06-19 | Hongzhuan Ye | System and method for management of transcribed documents |
DE10220524B4 (de) | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
EP1363271A1 (de) | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
DE60337022D1 (de) * | 2002-09-27 | 2011-06-16 | Callminer Inc | Verfahren zur statistischen analyse von sprache |
DE10304460B3 (de) * | 2003-02-04 | 2004-03-11 | Siemens Ag | Generieren und Löschen von Aussprachevarianten zur Verringerung der Wortfehlerrate in der Spracherkennung |
KR100486733B1 (ko) * | 2003-02-24 | 2005-05-03 | 삼성전자주식회사 | 음소 결합정보를 이용한 연속 음성인식방법 및 장치 |
DE10337823A1 (de) * | 2003-08-18 | 2005-03-17 | Siemens Ag | Sprachsteuerung von Audio- und Videogeräten |
US20060031069A1 (en) * | 2004-08-03 | 2006-02-09 | Sony Corporation | System and method for performing a grapheme-to-phoneme conversion |
US20070094270A1 (en) * | 2005-10-21 | 2007-04-26 | Callminer, Inc. | Method and apparatus for the processing of heterogeneous units of work |
CN103631802B (zh) * | 2012-08-24 | 2015-05-20 | 腾讯科技(深圳)有限公司 | 歌曲信息检索方法、装置及相应的服务器 |
US9747897B2 (en) * | 2013-12-17 | 2017-08-29 | Google Inc. | Identifying substitute pronunciations |
WO2015105994A1 (en) | 2014-01-08 | 2015-07-16 | Callminer, Inc. | Real-time conversational analytics facility |
US9570069B2 (en) * | 2014-09-09 | 2017-02-14 | Disney Enterprises, Inc. | Sectioned memory networks for online word-spotting in continuous speech |
KR102371697B1 (ko) * | 2015-02-11 | 2022-03-08 | 삼성전자주식회사 | 음성 기능 운용 방법 및 이를 지원하는 전자 장치 |
US11691076B2 (en) | 2020-08-10 | 2023-07-04 | Jocelyn Tan | Communication with in-game characters |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4780906A (en) * | 1984-02-17 | 1988-10-25 | Texas Instruments Incorporated | Speaker-independent word recognition method and system based upon zero-crossing rate and energy measurement of analog speech signal |
US4748670A (en) * | 1985-05-29 | 1988-05-31 | International Business Machines Corporation | Apparatus and method for determining a likely word sequence from labels generated by an acoustic processor |
US4941178A (en) * | 1986-04-01 | 1990-07-10 | Gte Laboratories Incorporated | Speech recognition using preclassification and spectral normalization |
US4882757A (en) * | 1986-04-25 | 1989-11-21 | Texas Instruments Incorporated | Speech recognition system |
US4866778A (en) * | 1986-08-11 | 1989-09-12 | Dragon Systems, Inc. | Interactive speech recognition apparatus |
IT1229782B (it) * | 1989-05-22 | 1991-09-11 | Face Standard Ind | Metodo ed apparato per riconoscere parole verbali sconosciute mediante estrazione dei parametri e confronto con parole di riferimento |
-
1989
- 1989-09-22 DE DE3931638A patent/DE3931638A1/de not_active Withdrawn
-
1990
- 1990-09-12 DE DE59010131T patent/DE59010131D1/de not_active Expired - Lifetime
- 1990-09-12 ES ES90117539T patent/ES2086345T3/es not_active Expired - Lifetime
- 1990-09-12 AT AT90117539T patent/ATE134275T1/de active
- 1990-09-12 EP EP90117539A patent/EP0418711B1/de not_active Expired - Lifetime
- 1990-09-17 AU AU62559/90A patent/AU640164B2/en not_active Ceased
- 1990-09-21 US US07/586,086 patent/US5170432A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
DE59010131D1 (de) | 1996-03-28 |
US5170432A (en) | 1992-12-08 |
EP0418711B1 (de) | 1996-02-14 |
ATE134275T1 (de) | 1996-02-15 |
EP0418711A2 (de) | 1991-03-27 |
DE3931638A1 (de) | 1991-04-04 |
AU640164B2 (en) | 1993-08-19 |
AU6255990A (en) | 1991-03-28 |
EP0418711A3 (en) | 1991-09-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2086345T3 (es) | Metodo para el reconocimiento del habla adaptable al usuario. | |
US8666745B2 (en) | Speech recognition system with huge vocabulary | |
EP0917129A3 (en) | Method and apparatus for adapting a speech recognizer to the pronunciation of an non native speaker | |
Ananthakrishnan et al. | An automatic prosody recognizer using a coupled multi-stream acoustic model and a syntactic-prosodic language model | |
US20020152068A1 (en) | New language context dependent data labeling | |
Müller et al. | Towards phoneme inventory discovery for documentation of unwritten languages | |
Tsubota et al. | Recognition and verification of English by Japanese students for computer-assisted language learning system. | |
US20140142925A1 (en) | Self-organizing unit recognition for speech and other data series | |
Kuo et al. | Improved HMM/SVM methods for automatic phoneme segmentation. | |
Rasipuram et al. | Grapheme and multilingual posterior features for under-resourced speech recognition: a study on scottish gaelic | |
Walker et al. | Language-reconfigurable universal phone recognition. | |
Cosi et al. | Italian children's speech recognition for advanced interactive literacy tutors. | |
Lim et al. | Using local & global phonotactic features in Chinese dialect identification | |
Zgank et al. | Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering | |
Elhadj et al. | An accurate recognizer for basic arabic sounds | |
Malhotra et al. | Automatic identification of gender & accent in spoken Hindi utterances with regional Indian accents | |
KR20130067854A (ko) | 코퍼스 기반 언어모델 변별학습 방법 및 그 장치 | |
Tolba et al. | Speech recognition by intelligent machines | |
Pan et al. | Emotion-detecting based model selection for emotional speech recognition | |
Soe et al. | Syllable-based speech recognition system for Myanmar | |
Stüker | Integrating Thai grapheme based acoustic models into the ML-MIX framework-for language independent and cross-language ASR. | |
Kertkeidkachorn et al. | Using tone information in Thai spelling speech recognition | |
Yang et al. | Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM | |
Kumaran et al. | Attention shift decoding for conversational speech recognition. | |
Yang et al. | Unsupervised prosodic phrase boundary labeling of Mandarin speech synthesis database using context-dependent HMM |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG2A | Definitive protection |
Ref document number: 418711 Country of ref document: ES |