DE69709539T2 - Verfahren und system zur erkennung eines gesprochenen textes - Google Patents
Verfahren und system zur erkennung eines gesprochenen textesInfo
- Publication number
- DE69709539T2 DE69709539T2 DE69709539T DE69709539T DE69709539T2 DE 69709539 T2 DE69709539 T2 DE 69709539T2 DE 69709539 T DE69709539 T DE 69709539T DE 69709539 T DE69709539 T DE 69709539T DE 69709539 T2 DE69709539 T2 DE 69709539T2
- Authority
- DE
- Germany
- Prior art keywords
- data
- text
- speech recognition
- language model
- depending
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP96890151 | 1996-09-27 | ||
PCT/IB1997/000833 WO1998013822A1 (en) | 1996-09-27 | 1997-07-04 | Method of and system for recognizing a spoken text |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69709539D1 DE69709539D1 (de) | 2002-02-14 |
DE69709539T2 true DE69709539T2 (de) | 2002-08-29 |
Family
ID=8226210
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69709539T Expired - Lifetime DE69709539T2 (de) | 1996-09-27 | 1997-07-04 | Verfahren und system zur erkennung eines gesprochenen textes |
Country Status (7)
Country | Link |
---|---|
US (1) | US6101467A (de) |
EP (1) | EP0865651B1 (de) |
JP (1) | JP4339931B2 (de) |
KR (1) | KR100453021B1 (de) |
AT (1) | ATE211847T1 (de) |
DE (1) | DE69709539T2 (de) |
WO (1) | WO1998013822A1 (de) |
Families Citing this family (49)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE514872C2 (sv) * | 1998-09-09 | 2001-05-07 | Sandvik Ab | Skär för spårsvarvning |
JP2001100781A (ja) | 1999-09-30 | 2001-04-13 | Sony Corp | 音声処理装置および音声処理方法、並びに記録媒体 |
US6925436B1 (en) * | 2000-01-28 | 2005-08-02 | International Business Machines Corporation | Indexing with translation model for feature regularization |
US7146321B2 (en) * | 2001-10-31 | 2006-12-05 | Dictaphone Corporation | Distributed speech recognition system |
US7133829B2 (en) * | 2001-10-31 | 2006-11-07 | Dictaphone Corporation | Dynamic insertion of a speech recognition engine within a distributed speech recognition system |
US6785654B2 (en) * | 2001-11-30 | 2004-08-31 | Dictaphone Corporation | Distributed speech recognition system with speech recognition engines offering multiple functionalities |
US20030115169A1 (en) * | 2001-12-17 | 2003-06-19 | Hongzhuan Ye | System and method for management of transcribed documents |
US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
DE10208466A1 (de) * | 2002-02-27 | 2004-01-29 | BSH Bosch und Siemens Hausgeräte GmbH | Elektrisches Haushaltsgerät |
US20030167174A1 (en) * | 2002-03-01 | 2003-09-04 | Koninlijke Philips Electronics N.V. | Automatic audio recorder-player and operating method therefor |
US7292975B2 (en) * | 2002-05-01 | 2007-11-06 | Nuance Communications, Inc. | Systems and methods for evaluating speaker suitability for automatic speech recognition aided transcription |
US7236931B2 (en) | 2002-05-01 | 2007-06-26 | Usb Ag, Stamford Branch | Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems |
EP1363271A1 (de) * | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
DE10220522B4 (de) * | 2002-05-08 | 2005-11-17 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse |
DE10220524B4 (de) * | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
DE10220520A1 (de) * | 2002-05-08 | 2003-11-20 | Sap Ag | Verfahren zur Erkennung von Sprachinformation |
DE10220521B4 (de) * | 2002-05-08 | 2005-11-24 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen |
US7895039B2 (en) * | 2005-02-04 | 2011-02-22 | Vocollect, Inc. | Methods and systems for optimizing model adaptation for a speech recognition system |
US8200495B2 (en) | 2005-02-04 | 2012-06-12 | Vocollect, Inc. | Methods and systems for considering information about an expected response when performing speech recognition |
US7949533B2 (en) * | 2005-02-04 | 2011-05-24 | Vococollect, Inc. | Methods and systems for assessing and improving the performance of a speech recognition system |
US7827032B2 (en) | 2005-02-04 | 2010-11-02 | Vocollect, Inc. | Methods and systems for adapting a model for a speech recognition system |
US7865362B2 (en) | 2005-02-04 | 2011-01-04 | Vocollect, Inc. | Method and system for considering information about an expected response when performing speech recognition |
ES2237345B1 (es) * | 2005-02-28 | 2006-06-16 | Prous Institute For Biomedical Research S.A. | Procedimiento de conversion de fonemas a texto escrito y sistema informatico y programa informatico correspondientes. |
US8032372B1 (en) | 2005-09-13 | 2011-10-04 | Escription, Inc. | Dictation selection |
US7756708B2 (en) * | 2006-04-03 | 2010-07-13 | Google Inc. | Automatic language model update |
US20080221900A1 (en) * | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile local search environment speech processing facility |
US20110054900A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Hybrid command and control between resident and remote speech recognition facilities in a mobile voice-to-speech application |
US10056077B2 (en) | 2007-03-07 | 2018-08-21 | Nuance Communications, Inc. | Using speech recognition results based on an unstructured language model with a music system |
US20110054897A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Transmitting signal quality information in mobile dictation application |
US20080312934A1 (en) * | 2007-03-07 | 2008-12-18 | Cerra Joseph P | Using results of unstructured language model based speech recognition to perform an action on a mobile communications facility |
US8886545B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Dealing with switch latency in speech recognition |
US8886540B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Using speech recognition results based on an unstructured language model in a mobile communication facility application |
US8635243B2 (en) | 2007-03-07 | 2014-01-21 | Research In Motion Limited | Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application |
US8949130B2 (en) | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Internal and external speech recognition use with a mobile communication facility |
US8838457B2 (en) | 2007-03-07 | 2014-09-16 | Vlingo Corporation | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
US20080288252A1 (en) * | 2007-03-07 | 2008-11-20 | Cerra Joseph P | Speech recognition of speech recorded by a mobile communication facility |
US20080221884A1 (en) * | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile environment speech processing facility |
US8949266B2 (en) | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Multiple web-based content category searching in mobile search application |
US20110054894A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Speech recognition through the collection of contact information in mobile dictation application |
TWI319563B (en) * | 2007-05-31 | 2010-01-11 | Cyberon Corp | Method and module for improving personal speech recognition capability |
US9128981B1 (en) | 2008-07-29 | 2015-09-08 | James L. Geer | Phone assisted ‘photographic memory’ |
US8379801B2 (en) | 2009-11-24 | 2013-02-19 | Sorenson Communications, Inc. | Methods and systems related to text caption error correction |
KR20120046627A (ko) * | 2010-11-02 | 2012-05-10 | 삼성전자주식회사 | 화자 적응 방법 및 장치 |
KR101197010B1 (ko) | 2011-03-30 | 2012-11-05 | 포항공과대학교 산학협력단 | 음성 처리 장치 및 방법 |
US8914290B2 (en) | 2011-05-20 | 2014-12-16 | Vocollect, Inc. | Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment |
US9978395B2 (en) | 2013-03-15 | 2018-05-22 | Vocollect, Inc. | Method and system for mitigating delay in receiving audio stream during production of sound from audio stream |
US9558747B2 (en) * | 2014-12-10 | 2017-01-31 | Honeywell International Inc. | High intelligibility voice announcement system |
US10714121B2 (en) | 2016-07-27 | 2020-07-14 | Vocollect, Inc. | Distinguishing user speech from background speech in speech-dense environments |
US9741337B1 (en) * | 2017-04-03 | 2017-08-22 | Green Key Technologies Llc | Adaptive self-trained computer engines with associated databases and methods of use thereof |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5315689A (en) * | 1988-05-27 | 1994-05-24 | Kabushiki Kaisha Toshiba | Speech recognition system having word-based and phoneme-based recognition means |
AT390685B (de) * | 1988-10-25 | 1990-06-11 | Philips Nv | System zur textverarbeitung |
US5027406A (en) * | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
AT391035B (de) * | 1988-12-07 | 1990-08-10 | Philips Nv | System zur spracherkennung |
US5983179A (en) * | 1992-11-13 | 1999-11-09 | Dragon Systems, Inc. | Speech recognition system which turns its voice response on for confirmation when it has been turned off without confirmation |
US5615296A (en) * | 1993-11-12 | 1997-03-25 | International Business Machines Corporation | Continuous speech recognition and voice response system and method to enable conversational dialogues with microprocessors |
JP2692581B2 (ja) * | 1994-06-07 | 1997-12-17 | 日本電気株式会社 | 音響カテゴリ平均値計算装置及び適応化装置 |
US5787230A (en) * | 1994-12-09 | 1998-07-28 | Lee; Lin-Shan | System and method of intelligent Mandarin speech input for Chinese computers |
DE69517705T2 (de) * | 1995-11-04 | 2000-11-23 | Ibm | Verfahren und vorrichtung zur anpassung der grösse eines sprachmodells in einem spracherkennungssystem |
US5857099A (en) * | 1996-09-27 | 1999-01-05 | Allvoice Computing Plc | Speech-to-text dictation system with audio message capability |
US5864805A (en) * | 1996-12-20 | 1999-01-26 | International Business Machines Corporation | Method and apparatus for error correction in a continuous dictation system |
-
1997
- 1997-07-04 DE DE69709539T patent/DE69709539T2/de not_active Expired - Lifetime
- 1997-07-04 JP JP51543998A patent/JP4339931B2/ja not_active Expired - Lifetime
- 1997-07-04 KR KR10-1998-0703881A patent/KR100453021B1/ko not_active IP Right Cessation
- 1997-07-04 AT AT97927323T patent/ATE211847T1/de not_active IP Right Cessation
- 1997-07-04 WO PCT/IB1997/000833 patent/WO1998013822A1/en active IP Right Grant
- 1997-07-04 EP EP97927323A patent/EP0865651B1/de not_active Expired - Lifetime
- 1997-09-29 US US08/939,548 patent/US6101467A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
US6101467A (en) | 2000-08-08 |
JP4339931B2 (ja) | 2009-10-07 |
JP2000502470A (ja) | 2000-02-29 |
ATE211847T1 (de) | 2002-01-15 |
KR100453021B1 (ko) | 2005-04-08 |
WO1998013822A1 (en) | 1998-04-02 |
EP0865651B1 (de) | 2002-01-09 |
DE69709539D1 (de) | 2002-02-14 |
EP0865651A1 (de) | 1998-09-23 |
KR19990071605A (ko) | 1999-09-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE211847T1 (de) | Verfahren und system zur erkennung eines gesprochenen textes | |
EP2126900B1 (de) | Verfahren und system zur erstellung von einträgen in einem spracherkennungs-lexikon | |
CN1183510C (zh) | 根据基音信息识别声调语言的方法与设备 | |
ATE325413T1 (de) | Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte | |
ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
SE500277C2 (sv) | Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk | |
JPS6466698A (en) | Voice recognition equipment | |
AU2002233237A1 (en) | Mobile terminal controllable by spoken utterances | |
EP0664535A3 (de) | Spracherkennungssystem für zusammenhängende Sätze mit grossem Wortschatz sowie Verfahren zur Sprachdarstellung mittels evolutionärer Grammatik als kontextfreie Grammatik. | |
EP1022722A3 (de) | Sprecheradaptation auf der Basis von Stimm-Eigenvektoren | |
WO1996023298A3 (en) | System amd method for generating and using context dependent sub-syllable models to recognize a tonal language | |
MX9505299A (es) | Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion. | |
JPH10198396A (ja) | ユーザが定義したフレーズの話者に依存しない認識方法及びシステム | |
GB2309563A (en) | Information processing system | |
WO1999034353A1 (en) | Feedback modification for accent reduction | |
JPH10504404A (ja) | 音声認識のための方法および装置 | |
AU2002233238A1 (en) | Mobile terminal controllable by spoken utterances | |
EP0071716B1 (de) | Allophonvokoder | |
ATE216118T1 (de) | Verfahren zur automatischen erkennung eines gesprochenen textes | |
Seresangtakul et al. | Analysis of pitch contour of Thai tone using Fujisaki's model | |
WO2004008433A3 (en) | System and method for mandarin chinese speech recognition using an optimized phone set | |
JPH01202798A (ja) | 音声認識方法 | |
ATE378673T1 (de) | System und verfahren zur sprecherunabhängigen echtzeitspracherkennung | |
KR100322202B1 (ko) | 신경망을 이용한 음성인식장치 및 그 방법 | |
WO1994002936A1 (en) | Voice recognition apparatus and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8327 | Change in the person/name/address of the patent owner |
Owner name: NUANCE COMMUNICATIONS AUSTRIA GMBH, WIEN, AT |
|
8328 | Change in the person/name/address of the agent |
Representative=s name: VOSSIUS & PARTNER, 81675 MUENCHEN |