ES2086345T3 - Metodo para el reconocimiento del habla adaptable al usuario. - Google Patents

Metodo para el reconocimiento del habla adaptable al usuario.

Info

Publication number
ES2086345T3
ES2086345T3 ES90117539T ES90117539T ES2086345T3 ES 2086345 T3 ES2086345 T3 ES 2086345T3 ES 90117539 T ES90117539 T ES 90117539T ES 90117539 T ES90117539 T ES 90117539T ES 2086345 T3 ES2086345 T3 ES 2086345T3
Authority
ES
Spain
Prior art keywords
recognition
word
vocabulary
speech
diction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
ES90117539T
Other languages
English (en)
Inventor
Heidi Dr Hackbarth
Manfred Dr Immendorfer
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alcatel Lucent NV
Original Assignee
Alcatel NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alcatel NV filed Critical Alcatel NV
Application granted granted Critical
Publication of ES2086345T3 publication Critical patent/ES2086345T3/es
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electric Clocks (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

UN PROCEDIMIENTO TAL DEBE SER ADECUADO TANTO PARA EL RECONOCIMIENTO DE PALABRAS INDIVIDUALES, COMO PARA UNA LENGUA HABLADA CONTINUAMENTE. DEBE CARACTERIZARSE POR LA SOLIDEZ DEL RECONOCIMIENTO DE LA MUESTRA DE PALABRA, EN SEGMENTACION DE SILABAS INCORRECTAS Y EN PRONUNCIACIONES VARIABLES, POR EJEMPLO EN UNA ABSORCION DE SILABAS. ADEMAS, DEBE POSIBILITAR UNA RAPIDA ADAPTACION DEL SISTEMA A UN NUEVO ORADOR Y UNA GENERACION Y AMPLIACION DE LAS FRASES DE TEXTOS ESCRITOS, SIN UN ENTRENAMIENTO EXPLICITO DEL SISTEMA POR QUIEN ENSEÑA EL LENGUAJE. UN RECONOCIMIENTO DE PALABRAS Y SERIES DE PALABRAS DEBE SER POSIBLE INCLUSO EN FRASES DE MUCHA AMPLITUD. LOS PROCEDIMIENTOS CONOCIDOS PARA EL RECONOCIMIENTO DE LENGUAS NECESITAN UN PROCEDIMIENTO DE ENTRENAMIENTO MUY COSTOSO. ADEMAS, SE PRODUCE UN FLUJO DE HIPOTESIS INMENSO EN LA LENGUA HABLADA DE CONTINUO Y EN GRAN VOCABULARIO. SEGUN EL INVENTO, SE SEGMENTAN Y CLASIFICAN LOS VECTORES DE CARACTERISTICAS EXTRAIDOS, CON AYUDA DE UN INVENTARIO DE UNIDADES DE PALABRAS, EN UNA FRASE DE HIPOTESIS DE UNIDADES DE PALABRAS ORIENTADAS EN SILABAS. DE LA FRASE DE HIPOTESIS SE PRODUCE, POR UNA COMPARACION TRIDIMENSIONAL DE DINAMICA DE TIEMPO, CON VARIANTES DE PRONUNCIACION DE UN CAUDAL DE PALABRAS DE MUESTRA DE REFERENCIA, UNA FRASE HIPOTETICA DE PALABRAS QUE ARROJA UN ANALISIS SINTACTICO PARA AVERIGUACION DE LA FRASE HABLADA.
ES90117539T 1989-09-22 1990-09-12 Metodo para el reconocimiento del habla adaptable al usuario. Expired - Lifetime ES2086345T3 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE3931638A DE3931638A1 (de) 1989-09-22 1989-09-22 Verfahren zur sprecheradaptiven erkennung von sprache

Publications (1)

Publication Number Publication Date
ES2086345T3 true ES2086345T3 (es) 1996-07-01

Family

ID=6389967

Family Applications (1)

Application Number Title Priority Date Filing Date
ES90117539T Expired - Lifetime ES2086345T3 (es) 1989-09-22 1990-09-12 Metodo para el reconocimiento del habla adaptable al usuario.

Country Status (6)

Country Link
US (1) US5170432A (es)
EP (1) EP0418711B1 (es)
AT (1) ATE134275T1 (es)
AU (1) AU640164B2 (es)
DE (2) DE3931638A1 (es)
ES (1) ES2086345T3 (es)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IT1229782B (it) * 1989-05-22 1991-09-11 Face Standard Ind Metodo ed apparato per riconoscere parole verbali sconosciute mediante estrazione dei parametri e confronto con parole di riferimento
DE4131387A1 (de) * 1991-09-20 1993-03-25 Siemens Ag Verfahren zur erkennung von mustern in zeitvarianten messsignalen
US5267345A (en) * 1992-02-10 1993-11-30 International Business Machines Corporation Speech recognition apparatus which predicts word classes from context and words from word classes
US5333275A (en) * 1992-06-23 1994-07-26 Wheatley Barbara J System and method for time aligning speech
US5425129A (en) * 1992-10-29 1995-06-13 International Business Machines Corporation Method for word spotting in continuous speech
ES2078834B1 (es) * 1992-10-30 1997-04-16 Alcatel Standard Electrica Metodo de segmentacion de cadenas de palabras en la fase de entrenamiento de un reconocedor de palabras conectadas.
DE4412930A1 (de) * 1994-04-15 1995-10-19 Philips Patentverwaltung Verfahren zum Ermitteln einer Folge von Wörtern
US5745649A (en) * 1994-07-07 1998-04-28 Nynex Science & Technology Corporation Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories
EP0703569B1 (de) * 1994-09-20 2000-03-01 Philips Patentverwaltung GmbH System zum Ermitteln von Wörtern aus einem Sprachsignal
US5873061A (en) * 1995-05-03 1999-02-16 U.S. Philips Corporation Method for constructing a model of a new word for addition to a word model database of a speech recognition system
US5765132A (en) * 1995-10-26 1998-06-09 Dragon Systems, Inc. Building speech models for new words in a multi-word utterance
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US6064959A (en) * 1997-03-28 2000-05-16 Dragon Systems, Inc. Error correction in speech recognition
US6151575A (en) * 1996-10-28 2000-11-21 Dragon Systems, Inc. Rapid adaptation of speech models
DE19705471C2 (de) * 1997-02-13 1998-04-09 Sican F & E Gmbh Sibet Verfahren und Schaltungsanordnung zur Spracherkennung und zur Sprachsteuerung von Vorrichtungen
US6212498B1 (en) 1997-03-28 2001-04-03 Dragon Systems, Inc. Enrollment in speech recognition
JP5025759B2 (ja) * 1997-11-17 2012-09-12 ニュアンス コミュニケーションズ,インコーポレイテッド 発音矯正装置、発音矯正方法および記録媒体
JP4267101B2 (ja) 1997-11-17 2009-05-27 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声識別装置、発音矯正装置およびこれらの方法
US5927988A (en) * 1997-12-17 1999-07-27 Jenkins; William M. Method and apparatus for training of sensory and perceptual systems in LLI subjects
US6343267B1 (en) 1998-04-30 2002-01-29 Matsushita Electric Industrial Co., Ltd. Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
US6263309B1 (en) * 1998-04-30 2001-07-17 Matsushita Electric Industrial Co., Ltd. Maximum likelihood method for finding an adapted speaker model in eigenvoice space
ATE225976T1 (de) 1998-05-15 2002-10-15 Siemens Ag Verfahren und vorrichtung zur erkennung mindestens eines schlüsselworts in gesprochener sprache durch einen rechner
US6163768A (en) 1998-06-15 2000-12-19 Dragon Systems, Inc. Non-interactive enrollment in speech recognition
US7937260B1 (en) * 1998-06-15 2011-05-03 At&T Intellectual Property Ii, L.P. Concise dynamic grammars using N-best selection
JP3720595B2 (ja) 1998-09-17 2005-11-30 キヤノン株式会社 音声認識装置及びその方法、コンピュータ可読メモリ
DE19857070A1 (de) * 1998-12-10 2000-06-15 Michael Mende Verfahren und Vorrichtung zur Ermittlung einer orthographischen Wiedergabe eines Textes
US6434521B1 (en) * 1999-06-24 2002-08-13 Speechworks International, Inc. Automatically determining words for updating in a pronunciation dictionary in a speech recognition system
DE19942869A1 (de) * 1999-09-08 2001-03-15 Volkswagen Ag Verfahren und Einrichtung zum Betrieb einer sprachgesteuerten Einrichtung bei Kraftfahrzeugen
US6571208B1 (en) 1999-11-29 2003-05-27 Matsushita Electric Industrial Co., Ltd. Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
US6526379B1 (en) 1999-11-29 2003-02-25 Matsushita Electric Industrial Co., Ltd. Discriminative clustering methods for automatic speech recognition
US6868381B1 (en) * 1999-12-21 2005-03-15 Nortel Networks Limited Method and apparatus providing hypothesis driven speech modelling for use in speech recognition
DE10017717B4 (de) * 2000-04-11 2006-01-05 Leopold Kostal Gmbh & Co. Kg Spracheingabe gesteuertes Steuergerät
US7089184B2 (en) 2001-03-22 2006-08-08 Nurv Center Technologies, Inc. Speech recognition for recognizing speaker-independent, continuous speech
US7136852B1 (en) * 2001-11-27 2006-11-14 Ncr Corp. Case-based reasoning similarity metrics implementation using user defined functions
US6990445B2 (en) * 2001-12-17 2006-01-24 Xl8 Systems, Inc. System and method for speech recognition and transcription
US20030115169A1 (en) * 2001-12-17 2003-06-19 Hongzhuan Ye System and method for management of transcribed documents
DE10220524B4 (de) 2002-05-08 2006-08-10 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
EP1363271A1 (de) 2002-05-08 2003-11-19 Sap Ag Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
DE60337022D1 (de) * 2002-09-27 2011-06-16 Callminer Inc Verfahren zur statistischen analyse von sprache
DE10304460B3 (de) * 2003-02-04 2004-03-11 Siemens Ag Generieren und Löschen von Aussprachevarianten zur Verringerung der Wortfehlerrate in der Spracherkennung
KR100486733B1 (ko) * 2003-02-24 2005-05-03 삼성전자주식회사 음소 결합정보를 이용한 연속 음성인식방법 및 장치
DE10337823A1 (de) * 2003-08-18 2005-03-17 Siemens Ag Sprachsteuerung von Audio- und Videogeräten
US20060031069A1 (en) * 2004-08-03 2006-02-09 Sony Corporation System and method for performing a grapheme-to-phoneme conversion
US20070094270A1 (en) * 2005-10-21 2007-04-26 Callminer, Inc. Method and apparatus for the processing of heterogeneous units of work
CN103631802B (zh) * 2012-08-24 2015-05-20 腾讯科技(深圳)有限公司 歌曲信息检索方法、装置及相应的服务器
US9747897B2 (en) * 2013-12-17 2017-08-29 Google Inc. Identifying substitute pronunciations
WO2015105994A1 (en) 2014-01-08 2015-07-16 Callminer, Inc. Real-time conversational analytics facility
US9570069B2 (en) * 2014-09-09 2017-02-14 Disney Enterprises, Inc. Sectioned memory networks for online word-spotting in continuous speech
KR102371697B1 (ko) * 2015-02-11 2022-03-08 삼성전자주식회사 음성 기능 운용 방법 및 이를 지원하는 전자 장치
US11691076B2 (en) 2020-08-10 2023-07-04 Jocelyn Tan Communication with in-game characters

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4780906A (en) * 1984-02-17 1988-10-25 Texas Instruments Incorporated Speaker-independent word recognition method and system based upon zero-crossing rate and energy measurement of analog speech signal
US4748670A (en) * 1985-05-29 1988-05-31 International Business Machines Corporation Apparatus and method for determining a likely word sequence from labels generated by an acoustic processor
US4941178A (en) * 1986-04-01 1990-07-10 Gte Laboratories Incorporated Speech recognition using preclassification and spectral normalization
US4882757A (en) * 1986-04-25 1989-11-21 Texas Instruments Incorporated Speech recognition system
US4866778A (en) * 1986-08-11 1989-09-12 Dragon Systems, Inc. Interactive speech recognition apparatus
IT1229782B (it) * 1989-05-22 1991-09-11 Face Standard Ind Metodo ed apparato per riconoscere parole verbali sconosciute mediante estrazione dei parametri e confronto con parole di riferimento

Also Published As

Publication number Publication date
DE59010131D1 (de) 1996-03-28
US5170432A (en) 1992-12-08
EP0418711B1 (de) 1996-02-14
ATE134275T1 (de) 1996-02-15
EP0418711A2 (de) 1991-03-27
DE3931638A1 (de) 1991-04-04
AU640164B2 (en) 1993-08-19
AU6255990A (en) 1991-03-28
EP0418711A3 (en) 1991-09-04

Similar Documents

Publication Publication Date Title
ES2086345T3 (es) Metodo para el reconocimiento del habla adaptable al usuario.
US8666745B2 (en) Speech recognition system with huge vocabulary
EP0917129A3 (en) Method and apparatus for adapting a speech recognizer to the pronunciation of an non native speaker
Ananthakrishnan et al. An automatic prosody recognizer using a coupled multi-stream acoustic model and a syntactic-prosodic language model
US20020152068A1 (en) New language context dependent data labeling
Müller et al. Towards phoneme inventory discovery for documentation of unwritten languages
Tsubota et al. Recognition and verification of English by Japanese students for computer-assisted language learning system.
US20140142925A1 (en) Self-organizing unit recognition for speech and other data series
Kuo et al. Improved HMM/SVM methods for automatic phoneme segmentation.
Rasipuram et al. Grapheme and multilingual posterior features for under-resourced speech recognition: a study on scottish gaelic
Walker et al. Language-reconfigurable universal phone recognition.
Cosi et al. Italian children's speech recognition for advanced interactive literacy tutors.
Lim et al. Using local & global phonotactic features in Chinese dialect identification
Zgank et al. Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering
Elhadj et al. An accurate recognizer for basic arabic sounds
Malhotra et al. Automatic identification of gender & accent in spoken Hindi utterances with regional Indian accents
KR20130067854A (ko) 코퍼스 기반 언어모델 변별학습 방법 및 그 장치
Tolba et al. Speech recognition by intelligent machines
Pan et al. Emotion-detecting based model selection for emotional speech recognition
Soe et al. Syllable-based speech recognition system for Myanmar
Stüker Integrating Thai grapheme based acoustic models into the ML-MIX framework-for language independent and cross-language ASR.
Kertkeidkachorn et al. Using tone information in Thai spelling speech recognition
Yang et al. Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM
Kumaran et al. Attention shift decoding for conversational speech recognition.
Yang et al. Unsupervised prosodic phrase boundary labeling of Mandarin speech synthesis database using context-dependent HMM

Legal Events

Date Code Title Description
FG2A Definitive protection

Ref document number: 418711

Country of ref document: ES