MXPA02005387A - Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados. - Google Patents

Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados.

Info

Publication number
MXPA02005387A
MXPA02005387A MXPA02005387A MXPA02005387A MXPA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A
Authority
MX
Mexico
Prior art keywords
disjoint
speech recognition
language models
decoding step
determining
Prior art date
Application number
MXPA02005387A
Other languages
English (en)
Inventor
Soufflet Frederic
Original Assignee
Thomson Licensing Sa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing Sa filed Critical Thomson Licensing Sa
Publication of MXPA02005387A publication Critical patent/MXPA02005387A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/085Methods for reducing search complexity, pruning

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

El objetivo de la invencion es un proceso para reconocimiento de voz que comprende un paso de adquirir una senal acustica, un paso de descodificacion acustica-fonetica y un paso de descodificacion lingüistica. De acuerdo a la invencion, la descodificacion lingüistica comprende los pasos de: aplicacion desarticulada de una pluralidad de modelos de lenguaje para el analisis de una secuencia de audio para la determinacion de una pluralidad de secuencias de palabras candidatas; determinacion por un motor de busqueda de la secuencia mas probable de palabras de entre las secuencias candidatas. El objetivo de la invencion es ademas un dispositivo para la implementacion del proceso.
MXPA02005387A 1999-12-02 2000-12-01 Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados. MXPA02005387A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR9915189 1999-12-02
PCT/FR2000/003356 WO2001041126A1 (fr) 1999-12-02 2000-12-01 Procede et dispositif de reconnaissance vocale a modeles de langage disjoints

Publications (1)

Publication Number Publication Date
MXPA02005387A true MXPA02005387A (es) 2004-04-21

Family

ID=9552792

Family Applications (1)

Application Number Title Priority Date Filing Date
MXPA02005387A MXPA02005387A (es) 1999-12-02 2000-12-01 Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados.

Country Status (8)

Country Link
US (1) US20030093272A1 (es)
EP (1) EP1234303B1 (es)
JP (1) JP2003515778A (es)
CN (1) CN1254787C (es)
AU (1) AU2181601A (es)
DE (1) DE60023736T2 (es)
MX (1) MXPA02005387A (es)
WO (1) WO2001041126A1 (es)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3426176B2 (ja) * 1999-12-27 2003-07-14 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声認識装置、方法、コンピュータ・システム及び記憶媒体
DE10024895A1 (de) * 2000-05-19 2001-11-22 Thomson Brandt Gmbh System zur Bedienung eines Gerätes der Unterhaltungselektronik
US7395205B2 (en) * 2001-02-13 2008-07-01 International Business Machines Corporation Dynamic language model mixtures with history-based buckets
DE10220522B4 (de) * 2002-05-08 2005-11-17 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse
EP1361740A1 (de) * 2002-05-08 2003-11-12 Sap Ag Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs
DE10220521B4 (de) * 2002-05-08 2005-11-24 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen
DE10220524B4 (de) * 2002-05-08 2006-08-10 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
EP1363271A1 (de) * 2002-05-08 2003-11-19 Sap Ag Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
JP2004240086A (ja) * 2003-02-05 2004-08-26 Nippon Telegr & Teleph Corp <Ntt> 音声認識信頼性評価方法、装置、音声認識信頼性評価プログラム及びこのプログラムを記録した記録媒体
US7321852B2 (en) * 2003-10-28 2008-01-22 International Business Machines Corporation System and method for transcribing audio files of various languages
US8036893B2 (en) * 2004-07-22 2011-10-11 Nuance Communications, Inc. Method and system for identifying and correcting accent-induced speech recognition difficulties
US7584103B2 (en) * 2004-08-20 2009-09-01 Multimodal Technologies, Inc. Automated extraction of semantic content and generation of a structured document from speech
US7831423B2 (en) * 2006-05-25 2010-11-09 Multimodal Technologies, Inc. Replacing text representing a concept with an alternate written form of the concept
US20070299665A1 (en) 2006-06-22 2007-12-27 Detlef Koll Automatic Decision Support
US7805305B2 (en) * 2006-10-12 2010-09-28 Nuance Communications, Inc. Enhancement to Viterbi speech processing algorithm for hybrid speech models that conserves memory
US8239366B2 (en) * 2010-09-08 2012-08-07 Nuance Communications, Inc. Method and apparatus for processing spoken search queries
US8959102B2 (en) 2010-10-08 2015-02-17 Mmodal Ip Llc Structured searching of dynamic structured document corpuses
WO2018140420A1 (en) 2017-01-24 2018-08-02 Honeywell International, Inc. Voice control of an integrated room automation system
US10984329B2 (en) 2017-06-14 2021-04-20 Ademco Inc. Voice activated virtual assistant with a fused response
US20190332848A1 (en) 2018-04-27 2019-10-31 Honeywell International Inc. Facial enrollment and recognition system
US20190390866A1 (en) 2018-06-22 2019-12-26 Honeywell International Inc. Building management system with natural language interface

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0830960B2 (ja) * 1988-12-06 1996-03-27 日本電気株式会社 高速音声認識装置
JP2905674B2 (ja) * 1993-10-04 1999-06-14 株式会社エイ・ティ・アール音声翻訳通信研究所 不特定話者連続音声認識方法
DE4412745A1 (de) * 1994-04-14 1996-11-07 Philips Patentverwaltung Verfahren zum Ermitteln einer Folge von Wörtern und Anordnung zur Durchführung des Verfahrens
JP2871557B2 (ja) * 1995-11-08 1999-03-17 株式会社エイ・ティ・アール音声翻訳通信研究所 音声認識装置
US5870706A (en) * 1996-04-10 1999-02-09 Lucent Technologies, Inc. Method and apparatus for an improved language recognition system
US5953701A (en) * 1998-01-22 1999-09-14 International Business Machines Corporation Speech recognition models combining gender-dependent and gender-independent phone states and using phonetic-context-dependence
GB9802836D0 (en) * 1998-02-10 1998-04-08 Canon Kk Pattern matching method and apparatus
US6233559B1 (en) * 1998-04-01 2001-05-15 Motorola, Inc. Speech control of multiple applications using applets
US6502072B2 (en) * 1998-11-20 2002-12-31 Microsoft Corporation Two-tier noise rejection in speech recognition
EP1055228A1 (en) * 1998-12-17 2000-11-29 ScanSoft, Inc. Speech operated automatic inquiry system
US6526380B1 (en) * 1999-03-26 2003-02-25 Koninklijke Philips Electronics N.V. Speech recognition system having parallel large vocabulary recognition engines
JP2001051690A (ja) * 1999-08-16 2001-02-23 Nec Corp パターン認識装置

Also Published As

Publication number Publication date
DE60023736T2 (de) 2006-08-10
WO2001041126A1 (fr) 2001-06-07
JP2003515778A (ja) 2003-05-07
CN1254787C (zh) 2006-05-03
AU2181601A (en) 2001-06-12
CN1402868A (zh) 2003-03-12
EP1234303B1 (fr) 2005-11-02
DE60023736D1 (de) 2005-12-08
US20030093272A1 (en) 2003-05-15
EP1234303A1 (fr) 2002-08-28

Similar Documents

Publication Publication Date Title
MXPA02005387A (es) Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados.
Kanthak et al. Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition
US7280968B2 (en) Synthetically generated speech responses including prosodic characteristics of speech inputs
EP1629464A4 (en) LANGUAGE RECOGNITION SYSTEM AND PHONETIC BASIC PROCEDURE
EP0977174A3 (en) Search optimization system and method for continuous speech recognition
DE69427083D1 (de) Spracherkennungssystem für mehrere sprachen
WO2007118100A3 (en) Automatic language model update
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
ATE183010T1 (de) Auf mikrosegmenten basierendes sprachsyntheseverfahren
US20030154080A1 (en) Method and apparatus for modification of audio input to a data processing system
DE69827667D1 (de) Vokoder basierter spracherkenner
JP4684409B2 (ja) 音声認識方法及び音声認識装置
US20020010575A1 (en) Method and system for the automatic segmentation of an audio stream into semantic or syntactic units
WO1996000962A3 (en) Method and device for adapting a speech recognition equipment for dialectal variations in a language
US10143027B1 (en) Device selection for routing of communications
WO2004012183A3 (en) Concatenative text-to-speech conversion
ATE172317T1 (de) Sprachumsetzungsverfahren
Price et al. Combining linguistic with statistical methods in modeling prosody
Geutner et al. Transcribing multilingual broadcast news using hypothesis driven lexical adaptation
Lööf et al. Evaluation of automatic transcription systems for the judicial domain
Rahul et al. Design of Manipuri keywords spotting system using HMM
US10674552B1 (en) Routing of communications to a device
ES2169572T3 (es) Procedimiento de reconocimiento de voz empleando una gramatica.
Álvarez et al. Improving a long audio aligner through phone-relatedness matrices for english, spanish and basque
JPH10133678A (ja) 音声再生装置

Legal Events

Date Code Title Description
FG Grant or registration