ES2047029T3 - Sistema y tecnica de reconocimiento de voz. - Google Patents

Sistema y tecnica de reconocimiento de voz.

Info

Publication number
ES2047029T3
ES2047029T3 ES88302730T ES88302730T ES2047029T3 ES 2047029 T3 ES2047029 T3 ES 2047029T3 ES 88302730 T ES88302730 T ES 88302730T ES 88302730 T ES88302730 T ES 88302730T ES 2047029 T3 ES2047029 T3 ES 2047029T3
Authority
ES
Spain
Prior art keywords
technique
words
selection
alignment
corresponding part
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
ES88302730T
Other languages
English (en)
Inventor
Stephen Eliot Levinson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
American Telephone and Telegraph Co Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by American Telephone and Telegraph Co Inc filed Critical American Telephone and Telegraph Co Inc
Application granted granted Critical
Publication of ES2047029T3 publication Critical patent/ES2047029T3/es
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

SE PRESENTA UN SISTEMA Y UNA TECNICA DE RECONOCIMIENTO DE LAS PALABRAS QUE ES DE TIPO ACUSTICA/FONETICA. SE HACE UN LOCUTOR-INDEPENDIENTE Y ES CAPAZ DE UN RECONOCIMIENTO DE LAS PALABRAS CONTINUO DURANTE UN DISCURSO FLUIDO MEDIANTE UNA COMBINACION DE TECNICAS QUE INCLUYEN, ENTRE, USANDO UN MODELO DE MARKOV LLAMADO DE DURACION-VARIABLE-CONTINUAMENTE LATENTE PARA LA IDENTIFICACION DE SEGMENTOS DE PALABRAS Y HACIENDO TODOS LOS PASOS DE LA TECNICA EN RESPUESTA A LA INFORMACION PERMANENTE, Y USANDO UN PASO SEPARADO PARA EL ALINEAMIENTO DE LOS MIEMBROS DE LAS ORDENACIONES DE PALABRAS CANDIDATAS CON LAS SEÑALES DE CARACTERISTICAS ACUSTICAS REPRESENTATIVAS DE LA PARTE CORRESPONDIENTE DE LA PRONUNCIACION, INCLUYENDO EL USAR PARES DE SEGMENTOS FONETICOS CANDIDATOS EN LA TECNICA DE ALINEAMIENTO. ALGUNA AMBIGUEDAD RESIDUAL EN LA SELECCION DE PALABRAS SE RESUELVE ENTONCES MAS RAPIDAMENTE QUE EN EL ARTIFICIO ANTERIOR, PRESCINDIENDO DE LA TECNICA DE SELECCION DE SENTENCIA ULTIMA EMPLEADA.
ES88302730T 1987-04-03 1988-03-28 Sistema y tecnica de reconocimiento de voz. Expired - Lifetime ES2047029T3 (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US07/034,467 US4852180A (en) 1987-04-03 1987-04-03 Speech recognition by acoustic/phonetic system and technique
SG22094A SG22094G (en) 1987-04-03 1994-02-07 Speech recognition system and technique

Publications (1)

Publication Number Publication Date
ES2047029T3 true ES2047029T3 (es) 1994-02-16

Family

ID=26663871

Family Applications (1)

Application Number Title Priority Date Filing Date
ES88302730T Expired - Lifetime ES2047029T3 (es) 1987-04-03 1988-03-28 Sistema y tecnica de reconocimiento de voz.

Country Status (9)

Country Link
US (1) US4852180A (es)
EP (1) EP0285353B1 (es)
JP (1) JPS63259697A (es)
AU (1) AU596510B2 (es)
CA (1) CA1336207C (es)
DE (1) DE3886080T2 (es)
ES (1) ES2047029T3 (es)
HK (1) HK107994A (es)
SG (1) SG22094G (es)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0296800A (ja) * 1988-10-03 1990-04-09 Nec Corp 連続音声認識装置
US5278911A (en) * 1989-05-18 1994-01-11 Smiths Industries Public Limited Company Speech recognition using a neural net
WO1991013431A1 (en) * 1990-02-26 1991-09-05 Motorola, Inc Method and apparatus for recognizing string of word commands in a hierarchical command structure
US5222188A (en) * 1990-08-21 1993-06-22 Emerson & Stern Associates, Inc. Method and apparatus for speech recognition based on subsyllable spellings
US5208897A (en) * 1990-08-21 1993-05-04 Emerson & Stern Associates, Inc. Method and apparatus for speech recognition based on subsyllable spellings
DE69022237T2 (de) * 1990-10-16 1996-05-02 Ibm Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell.
JP2979711B2 (ja) * 1991-04-24 1999-11-15 日本電気株式会社 パターン認識方式および標準パターン学習方式
KR100309207B1 (ko) * 1993-03-12 2001-12-17 에드워드 이. 데이비스 음성-대화식언어명령방법및장치
US5704004A (en) * 1993-12-01 1997-12-30 Industrial Technology Research Institute Apparatus and method for normalizing and categorizing linear prediction code vectors using Bayesian categorization technique
US5615299A (en) * 1994-06-20 1997-03-25 International Business Machines Corporation Speech recognition using dynamic features
GB2290684A (en) * 1994-06-22 1996-01-03 Ibm Speech synthesis using hidden Markov model to determine speech unit durations
WO1996008005A1 (en) * 1994-09-07 1996-03-14 Motorola Inc. System for recognizing spoken sounds from continuous speech and method of using same
US5594834A (en) * 1994-09-30 1997-01-14 Motorola, Inc. Method and system for recognizing a boundary between sounds in continuous speech
US5596679A (en) * 1994-10-26 1997-01-21 Motorola, Inc. Method and system for identifying spoken sounds in continuous speech by comparing classifier outputs
US5638486A (en) * 1994-10-26 1997-06-10 Motorola, Inc. Method and system for continuous speech recognition using voting techniques
US5687287A (en) * 1995-05-22 1997-11-11 Lucent Technologies Inc. Speaker verification method and apparatus using mixture decomposition discrimination
WO1998014934A1 (en) * 1996-10-02 1998-04-09 Sri International Method and system for automatic text-independent grading of pronunciation for language instruction
US6018708A (en) * 1997-08-26 2000-01-25 Nortel Networks Corporation Method and apparatus for performing speech recognition utilizing a supplementary lexicon of frequently used orthographies
US5983177A (en) * 1997-12-18 1999-11-09 Nortel Networks Corporation Method and apparatus for obtaining transcriptions from multiple training utterances
DE19857070A1 (de) * 1998-12-10 2000-06-15 Michael Mende Verfahren und Vorrichtung zur Ermittlung einer orthographischen Wiedergabe eines Textes
US6671669B1 (en) * 2000-07-18 2003-12-30 Qualcomm Incorporated combined engine system and method for voice recognition
US7089184B2 (en) 2001-03-22 2006-08-08 Nurv Center Technologies, Inc. Speech recognition for recognizing speaker-independent, continuous speech
US7769592B2 (en) * 2002-02-22 2010-08-03 Nuance Communications, Inc. Automatic selection of a disambiguation data field for a speech interface
US7697700B2 (en) * 2006-05-04 2010-04-13 Sony Computer Entertainment Inc. Noise removal for electronic device with far field microphone on console
US7062436B1 (en) 2003-02-11 2006-06-13 Microsoft Corporation Word-specific acoustic models in a speech recognition system
US7076422B2 (en) * 2003-03-13 2006-07-11 Microsoft Corporation Modelling and processing filled pauses and noises in speech recognition
US7487094B1 (en) 2003-06-20 2009-02-03 Utopy, Inc. System and method of call classification with context modeling based on composite words
US7433820B2 (en) * 2004-05-12 2008-10-07 International Business Machines Corporation Asynchronous Hidden Markov Model method and system
US20050282563A1 (en) * 2004-06-17 2005-12-22 Ixi Mobile (R&D) Ltd. Message recognition and display system and method for a mobile communication device
US8924212B1 (en) 2005-08-26 2014-12-30 At&T Intellectual Property Ii, L.P. System and method for robust access and entry to large structured data using voice form-filling
US8654963B2 (en) 2008-12-19 2014-02-18 Genesys Telecommunications Laboratories, Inc. Method and system for integrating an interaction management system with a business rules management system
US8494857B2 (en) * 2009-01-06 2013-07-23 Regents Of The University Of Minnesota Automatic measurement of speech fluency
US8463606B2 (en) 2009-07-13 2013-06-11 Genesys Telecommunications Laboratories, Inc. System for analyzing interactions and reporting analytic results to human-operated and system interfaces in real time
US9576593B2 (en) 2012-03-15 2017-02-21 Regents Of The University Of Minnesota Automated verbal fluency assessment
US9230548B2 (en) * 2012-06-06 2016-01-05 Cypress Semiconductor Corporation Hybrid hashing scheme for active HMMS
US9912816B2 (en) 2012-11-29 2018-03-06 Genesys Telecommunications Laboratories, Inc. Workload distribution with resource awareness
US9542936B2 (en) 2012-12-29 2017-01-10 Genesys Telecommunications Laboratories, Inc. Fast out-of-vocabulary search in automatic speech recognition systems
CN109478399B (zh) * 2016-07-22 2023-07-25 雅马哈株式会社 演奏分析方法、自动演奏方法及自动演奏系统
CN108022593A (zh) * 2018-01-16 2018-05-11 成都福兰特电子技术股份有限公司 一种高灵敏度语音识别系统及其控制方法

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US458670A (en) * 1891-09-01 Combined bin and sifter
US4277644A (en) * 1979-07-16 1981-07-07 Bell Telephone Laboratories, Incorporated Syntactic continuous speech recognizer
US4761815A (en) * 1981-05-01 1988-08-02 Figgie International, Inc. Speech recognition system based on word state duration and/or weight
US4587670A (en) * 1982-10-15 1986-05-06 At&T Bell Laboratories Hidden Markov model speech recognition arrangement
US4718094A (en) * 1984-11-19 1988-01-05 International Business Machines Corp. Speech recognition system
US4783804A (en) * 1985-03-21 1988-11-08 American Telephone And Telegraph Company, At&T Bell Laboratories Hidden Markov model speech recognition arrangement

Also Published As

Publication number Publication date
HK107994A (en) 1994-10-14
SG22094G (en) 1995-03-17
AU596510B2 (en) 1990-05-03
US4852180A (en) 1989-07-25
JPS63259697A (ja) 1988-10-26
DE3886080D1 (de) 1994-01-20
AU1404288A (en) 1988-10-06
EP0285353A3 (en) 1989-08-23
EP0285353A2 (en) 1988-10-05
DE3886080T2 (de) 1994-05-11
CA1336207C (en) 1995-07-04
EP0285353B1 (en) 1993-12-08

Similar Documents

Publication Publication Date Title
ES2047029T3 (es) Sistema y tecnica de reconocimiento de voz.
Ostendorf et al. The Boston University radio news corpus
ES2018761A4 (es) Sistema para el reconocimiento de una conversacion.
EP1143415A1 (en) Generation of multiple proper name pronunciations for speech recognition
CA2069675A1 (en) Flexible vocabulary recognition
IT989203B (it) Sistema perfezionato per l iden tificazione di suoni fonici
DE69922104D1 (de) Spracherkenner mit durch buchstabierte Worteingabe adaptierbarem Wortschatz
ES2086345T3 (es) Metodo para el reconocimiento del habla adaptable al usuario.
ATE317583T1 (de) Texteditierung von erkannter sprache bei gleichzeitiger wiedergabe
BR9712979A (pt) Processo para adaptação de um modelo acústico hidden markov em um sistema de identificação de fala
ES2173389T3 (es) Procedimiento y dispositivo para la sintesis de señales vocales.
BR9913524A (pt) Reconhecedor de voz, e, processo de reconhecimento de voz
Hadlich The phonological history of Vegliote
Riad et al. The origin of Danish stød
JPS5774799A (en) Word voice notifying system
JPS5361203A (en) Language information input devicw
Lamel et al. A phone-based approach to non-linguistic speech feature identification
AR241344A1 (es) Indicador de registro habilitado para una unidad almacenadora de señal.
DE60022976D1 (de) Spracherkennungseinrichtung mit transfermitteln
Benarousse et al. The NATO native and non-native (N4) speech corpus
ES2169572T3 (es) Procedimiento de reconocimiento de voz empleando una gramatica.
Price et al. The use of relative duration in syntactic disambiguation.
Raffler-Engel Investigation of Italo-American Bilinguals
JPS4949241B1 (es)
Ircing et al. Two-pass recognition of Czech speech using adaptive vocabulary

Legal Events

Date Code Title Description
FG2A Definitive protection

Ref document number: 285353

Country of ref document: ES