MXPA02005387A - Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados. - Google Patents
Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados.Info
- Publication number
- MXPA02005387A MXPA02005387A MXPA02005387A MXPA02005387A MXPA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A MX PA02005387 A MXPA02005387 A MX PA02005387A
- Authority
- MX
- Mexico
- Prior art keywords
- disjoint
- speech recognition
- language models
- decoding step
- determining
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
Abstract
El objetivo de la invencion es un proceso para reconocimiento de voz que comprende un paso de adquirir una senal acustica, un paso de descodificacion acustica-fonetica y un paso de descodificacion lingüistica. De acuerdo a la invencion, la descodificacion lingüistica comprende los pasos de: aplicacion desarticulada de una pluralidad de modelos de lenguaje para el analisis de una secuencia de audio para la determinacion de una pluralidad de secuencias de palabras candidatas; determinacion por un motor de busqueda de la secuencia mas probable de palabras de entre las secuencias candidatas. El objetivo de la invencion es ademas un dispositivo para la implementacion del proceso.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9915189 | 1999-12-02 | ||
PCT/FR2000/003356 WO2001041126A1 (fr) | 1999-12-02 | 2000-12-01 | Procede et dispositif de reconnaissance vocale a modeles de langage disjoints |
Publications (1)
Publication Number | Publication Date |
---|---|
MXPA02005387A true MXPA02005387A (es) | 2004-04-21 |
Family
ID=9552792
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MXPA02005387A MXPA02005387A (es) | 1999-12-02 | 2000-12-01 | Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados. |
Country Status (8)
Country | Link |
---|---|
US (1) | US20030093272A1 (es) |
EP (1) | EP1234303B1 (es) |
JP (1) | JP2003515778A (es) |
CN (1) | CN1254787C (es) |
AU (1) | AU2181601A (es) |
DE (1) | DE60023736T2 (es) |
MX (1) | MXPA02005387A (es) |
WO (1) | WO2001041126A1 (es) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3426176B2 (ja) * | 1999-12-27 | 2003-07-14 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、方法、コンピュータ・システム及び記憶媒体 |
DE10024895A1 (de) * | 2000-05-19 | 2001-11-22 | Thomson Brandt Gmbh | System zur Bedienung eines Gerätes der Unterhaltungselektronik |
US7395205B2 (en) * | 2001-02-13 | 2008-07-01 | International Business Machines Corporation | Dynamic language model mixtures with history-based buckets |
DE10220522B4 (de) * | 2002-05-08 | 2005-11-17 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse |
EP1361740A1 (de) * | 2002-05-08 | 2003-11-12 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs |
DE10220521B4 (de) * | 2002-05-08 | 2005-11-24 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen |
DE10220524B4 (de) * | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
EP1363271A1 (de) * | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
JP2004240086A (ja) * | 2003-02-05 | 2004-08-26 | Nippon Telegr & Teleph Corp <Ntt> | 音声認識信頼性評価方法、装置、音声認識信頼性評価プログラム及びこのプログラムを記録した記録媒体 |
US7321852B2 (en) * | 2003-10-28 | 2008-01-22 | International Business Machines Corporation | System and method for transcribing audio files of various languages |
US8036893B2 (en) * | 2004-07-22 | 2011-10-11 | Nuance Communications, Inc. | Method and system for identifying and correcting accent-induced speech recognition difficulties |
US7584103B2 (en) * | 2004-08-20 | 2009-09-01 | Multimodal Technologies, Inc. | Automated extraction of semantic content and generation of a structured document from speech |
US7831423B2 (en) * | 2006-05-25 | 2010-11-09 | Multimodal Technologies, Inc. | Replacing text representing a concept with an alternate written form of the concept |
US20070299665A1 (en) | 2006-06-22 | 2007-12-27 | Detlef Koll | Automatic Decision Support |
US7805305B2 (en) * | 2006-10-12 | 2010-09-28 | Nuance Communications, Inc. | Enhancement to Viterbi speech processing algorithm for hybrid speech models that conserves memory |
US8239366B2 (en) * | 2010-09-08 | 2012-08-07 | Nuance Communications, Inc. | Method and apparatus for processing spoken search queries |
US8959102B2 (en) | 2010-10-08 | 2015-02-17 | Mmodal Ip Llc | Structured searching of dynamic structured document corpuses |
WO2018140420A1 (en) | 2017-01-24 | 2018-08-02 | Honeywell International, Inc. | Voice control of an integrated room automation system |
US10984329B2 (en) | 2017-06-14 | 2021-04-20 | Ademco Inc. | Voice activated virtual assistant with a fused response |
US20190332848A1 (en) | 2018-04-27 | 2019-10-31 | Honeywell International Inc. | Facial enrollment and recognition system |
US20190390866A1 (en) | 2018-06-22 | 2019-12-26 | Honeywell International Inc. | Building management system with natural language interface |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0830960B2 (ja) * | 1988-12-06 | 1996-03-27 | 日本電気株式会社 | 高速音声認識装置 |
JP2905674B2 (ja) * | 1993-10-04 | 1999-06-14 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 不特定話者連続音声認識方法 |
DE4412745A1 (de) * | 1994-04-14 | 1996-11-07 | Philips Patentverwaltung | Verfahren zum Ermitteln einer Folge von Wörtern und Anordnung zur Durchführung des Verfahrens |
JP2871557B2 (ja) * | 1995-11-08 | 1999-03-17 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 音声認識装置 |
US5870706A (en) * | 1996-04-10 | 1999-02-09 | Lucent Technologies, Inc. | Method and apparatus for an improved language recognition system |
US5953701A (en) * | 1998-01-22 | 1999-09-14 | International Business Machines Corporation | Speech recognition models combining gender-dependent and gender-independent phone states and using phonetic-context-dependence |
GB9802836D0 (en) * | 1998-02-10 | 1998-04-08 | Canon Kk | Pattern matching method and apparatus |
US6233559B1 (en) * | 1998-04-01 | 2001-05-15 | Motorola, Inc. | Speech control of multiple applications using applets |
US6502072B2 (en) * | 1998-11-20 | 2002-12-31 | Microsoft Corporation | Two-tier noise rejection in speech recognition |
EP1055228A1 (en) * | 1998-12-17 | 2000-11-29 | ScanSoft, Inc. | Speech operated automatic inquiry system |
US6526380B1 (en) * | 1999-03-26 | 2003-02-25 | Koninklijke Philips Electronics N.V. | Speech recognition system having parallel large vocabulary recognition engines |
JP2001051690A (ja) * | 1999-08-16 | 2001-02-23 | Nec Corp | パターン認識装置 |
-
2000
- 2000-12-01 JP JP2001542100A patent/JP2003515778A/ja active Pending
- 2000-12-01 WO PCT/FR2000/003356 patent/WO2001041126A1/fr active IP Right Grant
- 2000-12-01 EP EP00985378A patent/EP1234303B1/fr not_active Expired - Lifetime
- 2000-12-01 US US10/148,301 patent/US20030093272A1/en not_active Abandoned
- 2000-12-01 AU AU21816/01A patent/AU2181601A/en not_active Abandoned
- 2000-12-01 DE DE60023736T patent/DE60023736T2/de not_active Expired - Lifetime
- 2000-12-01 CN CNB00816567XA patent/CN1254787C/zh not_active Expired - Fee Related
- 2000-12-01 MX MXPA02005387A patent/MXPA02005387A/es active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
DE60023736T2 (de) | 2006-08-10 |
WO2001041126A1 (fr) | 2001-06-07 |
JP2003515778A (ja) | 2003-05-07 |
CN1254787C (zh) | 2006-05-03 |
AU2181601A (en) | 2001-06-12 |
CN1402868A (zh) | 2003-03-12 |
EP1234303B1 (fr) | 2005-11-02 |
DE60023736D1 (de) | 2005-12-08 |
US20030093272A1 (en) | 2003-05-15 |
EP1234303A1 (fr) | 2002-08-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MXPA02005387A (es) | Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados. | |
Kanthak et al. | Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition | |
US7280968B2 (en) | Synthetically generated speech responses including prosodic characteristics of speech inputs | |
EP1629464A4 (en) | LANGUAGE RECOGNITION SYSTEM AND PHONETIC BASIC PROCEDURE | |
EP0977174A3 (en) | Search optimization system and method for continuous speech recognition | |
DE69427083D1 (de) | Spracherkennungssystem für mehrere sprachen | |
WO2007118100A3 (en) | Automatic language model update | |
MX9505299A (es) | Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion. | |
ATE183010T1 (de) | Auf mikrosegmenten basierendes sprachsyntheseverfahren | |
US20030154080A1 (en) | Method and apparatus for modification of audio input to a data processing system | |
DE69827667D1 (de) | Vokoder basierter spracherkenner | |
JP4684409B2 (ja) | 音声認識方法及び音声認識装置 | |
US20020010575A1 (en) | Method and system for the automatic segmentation of an audio stream into semantic or syntactic units | |
WO1996000962A3 (en) | Method and device for adapting a speech recognition equipment for dialectal variations in a language | |
US10143027B1 (en) | Device selection for routing of communications | |
WO2004012183A3 (en) | Concatenative text-to-speech conversion | |
ATE172317T1 (de) | Sprachumsetzungsverfahren | |
Price et al. | Combining linguistic with statistical methods in modeling prosody | |
Geutner et al. | Transcribing multilingual broadcast news using hypothesis driven lexical adaptation | |
Lööf et al. | Evaluation of automatic transcription systems for the judicial domain | |
Rahul et al. | Design of Manipuri keywords spotting system using HMM | |
US10674552B1 (en) | Routing of communications to a device | |
ES2169572T3 (es) | Procedimiento de reconocimiento de voz empleando una gramatica. | |
Álvarez et al. | Improving a long audio aligner through phone-relatedness matrices for english, spanish and basque | |
JPH10133678A (ja) | 音声再生装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |