DE69709539T2 - Verfahren und system zur erkennung eines gesprochenen textes - Google Patents

Verfahren und system zur erkennung eines gesprochenen textes

Info

Publication number
DE69709539T2
DE69709539T2 DE69709539T DE69709539T DE69709539T2 DE 69709539 T2 DE69709539 T2 DE 69709539T2 DE 69709539 T DE69709539 T DE 69709539T DE 69709539 T DE69709539 T DE 69709539T DE 69709539 T2 DE69709539 T2 DE 69709539T2
Authority
DE
Germany
Prior art keywords
data
text
speech recognition
language model
depending
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69709539T
Other languages
English (en)
Other versions
DE69709539D1 (de
Inventor
Heinrich Bartosik
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Austria GmbH
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Application granted granted Critical
Publication of DE69709539D1 publication Critical patent/DE69709539D1/de
Publication of DE69709539T2 publication Critical patent/DE69709539T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
DE69709539T 1996-09-27 1997-07-04 Verfahren und system zur erkennung eines gesprochenen textes Expired - Lifetime DE69709539T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP96890151 1996-09-27
PCT/IB1997/000833 WO1998013822A1 (en) 1996-09-27 1997-07-04 Method of and system for recognizing a spoken text

Publications (2)

Publication Number Publication Date
DE69709539D1 DE69709539D1 (de) 2002-02-14
DE69709539T2 true DE69709539T2 (de) 2002-08-29

Family

ID=8226210

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69709539T Expired - Lifetime DE69709539T2 (de) 1996-09-27 1997-07-04 Verfahren und system zur erkennung eines gesprochenen textes

Country Status (7)

Country Link
US (1) US6101467A (de)
EP (1) EP0865651B1 (de)
JP (1) JP4339931B2 (de)
KR (1) KR100453021B1 (de)
AT (1) ATE211847T1 (de)
DE (1) DE69709539T2 (de)
WO (1) WO1998013822A1 (de)

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE514872C2 (sv) * 1998-09-09 2001-05-07 Sandvik Ab Skär för spårsvarvning
JP2001100781A (ja) 1999-09-30 2001-04-13 Sony Corp 音声処理装置および音声処理方法、並びに記録媒体
US6925436B1 (en) * 2000-01-28 2005-08-02 International Business Machines Corporation Indexing with translation model for feature regularization
US7146321B2 (en) * 2001-10-31 2006-12-05 Dictaphone Corporation Distributed speech recognition system
US7133829B2 (en) * 2001-10-31 2006-11-07 Dictaphone Corporation Dynamic insertion of a speech recognition engine within a distributed speech recognition system
US6785654B2 (en) * 2001-11-30 2004-08-31 Dictaphone Corporation Distributed speech recognition system with speech recognition engines offering multiple functionalities
US20030115169A1 (en) * 2001-12-17 2003-06-19 Hongzhuan Ye System and method for management of transcribed documents
US6990445B2 (en) * 2001-12-17 2006-01-24 Xl8 Systems, Inc. System and method for speech recognition and transcription
DE10208466A1 (de) * 2002-02-27 2004-01-29 BSH Bosch und Siemens Hausgeräte GmbH Elektrisches Haushaltsgerät
US20030167174A1 (en) * 2002-03-01 2003-09-04 Koninlijke Philips Electronics N.V. Automatic audio recorder-player and operating method therefor
US7292975B2 (en) * 2002-05-01 2007-11-06 Nuance Communications, Inc. Systems and methods for evaluating speaker suitability for automatic speech recognition aided transcription
US7236931B2 (en) 2002-05-01 2007-06-26 Usb Ag, Stamford Branch Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems
EP1363271A1 (de) * 2002-05-08 2003-11-19 Sap Ag Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
DE10220522B4 (de) * 2002-05-08 2005-11-17 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse
DE10220524B4 (de) * 2002-05-08 2006-08-10 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
DE10220520A1 (de) * 2002-05-08 2003-11-20 Sap Ag Verfahren zur Erkennung von Sprachinformation
DE10220521B4 (de) * 2002-05-08 2005-11-24 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen
US7895039B2 (en) * 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US8200495B2 (en) 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7949533B2 (en) * 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US7827032B2 (en) 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US7865362B2 (en) 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
ES2237345B1 (es) * 2005-02-28 2006-06-16 Prous Institute For Biomedical Research S.A. Procedimiento de conversion de fonemas a texto escrito y sistema informatico y programa informatico correspondientes.
US8032372B1 (en) 2005-09-13 2011-10-04 Escription, Inc. Dictation selection
US7756708B2 (en) * 2006-04-03 2010-07-13 Google Inc. Automatic language model update
US20080221900A1 (en) * 2007-03-07 2008-09-11 Cerra Joseph P Mobile local search environment speech processing facility
US20110054900A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Hybrid command and control between resident and remote speech recognition facilities in a mobile voice-to-speech application
US10056077B2 (en) 2007-03-07 2018-08-21 Nuance Communications, Inc. Using speech recognition results based on an unstructured language model with a music system
US20110054897A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Transmitting signal quality information in mobile dictation application
US20080312934A1 (en) * 2007-03-07 2008-12-18 Cerra Joseph P Using results of unstructured language model based speech recognition to perform an action on a mobile communications facility
US8886545B2 (en) 2007-03-07 2014-11-11 Vlingo Corporation Dealing with switch latency in speech recognition
US8886540B2 (en) 2007-03-07 2014-11-11 Vlingo Corporation Using speech recognition results based on an unstructured language model in a mobile communication facility application
US8635243B2 (en) 2007-03-07 2014-01-21 Research In Motion Limited Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application
US8949130B2 (en) 2007-03-07 2015-02-03 Vlingo Corporation Internal and external speech recognition use with a mobile communication facility
US8838457B2 (en) 2007-03-07 2014-09-16 Vlingo Corporation Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility
US20080288252A1 (en) * 2007-03-07 2008-11-20 Cerra Joseph P Speech recognition of speech recorded by a mobile communication facility
US20080221884A1 (en) * 2007-03-07 2008-09-11 Cerra Joseph P Mobile environment speech processing facility
US8949266B2 (en) 2007-03-07 2015-02-03 Vlingo Corporation Multiple web-based content category searching in mobile search application
US20110054894A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Speech recognition through the collection of contact information in mobile dictation application
TWI319563B (en) * 2007-05-31 2010-01-11 Cyberon Corp Method and module for improving personal speech recognition capability
US9128981B1 (en) 2008-07-29 2015-09-08 James L. Geer Phone assisted ‘photographic memory’
US8379801B2 (en) 2009-11-24 2013-02-19 Sorenson Communications, Inc. Methods and systems related to text caption error correction
KR20120046627A (ko) * 2010-11-02 2012-05-10 삼성전자주식회사 화자 적응 방법 및 장치
KR101197010B1 (ko) 2011-03-30 2012-11-05 포항공과대학교 산학협력단 음성 처리 장치 및 방법
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US9558747B2 (en) * 2014-12-10 2017-01-31 Honeywell International Inc. High intelligibility voice announcement system
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments
US9741337B1 (en) * 2017-04-03 2017-08-22 Green Key Technologies Llc Adaptive self-trained computer engines with associated databases and methods of use thereof

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5315689A (en) * 1988-05-27 1994-05-24 Kabushiki Kaisha Toshiba Speech recognition system having word-based and phoneme-based recognition means
AT390685B (de) * 1988-10-25 1990-06-11 Philips Nv System zur textverarbeitung
US5027406A (en) * 1988-12-06 1991-06-25 Dragon Systems, Inc. Method for interactive speech recognition and training
AT391035B (de) * 1988-12-07 1990-08-10 Philips Nv System zur spracherkennung
US5983179A (en) * 1992-11-13 1999-11-09 Dragon Systems, Inc. Speech recognition system which turns its voice response on for confirmation when it has been turned off without confirmation
US5615296A (en) * 1993-11-12 1997-03-25 International Business Machines Corporation Continuous speech recognition and voice response system and method to enable conversational dialogues with microprocessors
JP2692581B2 (ja) * 1994-06-07 1997-12-17 日本電気株式会社 音響カテゴリ平均値計算装置及び適応化装置
US5787230A (en) * 1994-12-09 1998-07-28 Lee; Lin-Shan System and method of intelligent Mandarin speech input for Chinese computers
DE69517705T2 (de) * 1995-11-04 2000-11-23 Ibm Verfahren und vorrichtung zur anpassung der grösse eines sprachmodells in einem spracherkennungssystem
US5857099A (en) * 1996-09-27 1999-01-05 Allvoice Computing Plc Speech-to-text dictation system with audio message capability
US5864805A (en) * 1996-12-20 1999-01-26 International Business Machines Corporation Method and apparatus for error correction in a continuous dictation system

Also Published As

Publication number Publication date
US6101467A (en) 2000-08-08
JP4339931B2 (ja) 2009-10-07
JP2000502470A (ja) 2000-02-29
ATE211847T1 (de) 2002-01-15
KR100453021B1 (ko) 2005-04-08
WO1998013822A1 (en) 1998-04-02
EP0865651B1 (de) 2002-01-09
DE69709539D1 (de) 2002-02-14
EP0865651A1 (de) 1998-09-23
KR19990071605A (ko) 1999-09-27

Similar Documents

Publication Publication Date Title
ATE211847T1 (de) Verfahren und system zur erkennung eines gesprochenen textes
EP2126900B1 (de) Verfahren und system zur erstellung von einträgen in einem spracherkennungs-lexikon
CN1183510C (zh) 根据基音信息识别声调语言的方法与设备
ATE325413T1 (de) Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
SE500277C2 (sv) Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk
JPS6466698A (en) Voice recognition equipment
AU2002233237A1 (en) Mobile terminal controllable by spoken utterances
EP0664535A3 (de) Spracherkennungssystem für zusammenhängende Sätze mit grossem Wortschatz sowie Verfahren zur Sprachdarstellung mittels evolutionärer Grammatik als kontextfreie Grammatik.
EP1022722A3 (de) Sprecheradaptation auf der Basis von Stimm-Eigenvektoren
WO1996023298A3 (en) System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
JPH10198396A (ja) ユーザが定義したフレーズの話者に依存しない認識方法及びシステム
GB2309563A (en) Information processing system
WO1999034353A1 (en) Feedback modification for accent reduction
JPH10504404A (ja) 音声認識のための方法および装置
AU2002233238A1 (en) Mobile terminal controllable by spoken utterances
EP0071716B1 (de) Allophonvokoder
ATE216118T1 (de) Verfahren zur automatischen erkennung eines gesprochenen textes
Seresangtakul et al. Analysis of pitch contour of Thai tone using Fujisaki's model
WO2004008433A3 (en) System and method for mandarin chinese speech recognition using an optimized phone set
JPH01202798A (ja) 音声認識方法
ATE378673T1 (de) System und verfahren zur sprecherunabhängigen echtzeitspracherkennung
KR100322202B1 (ko) 신경망을 이용한 음성인식장치 및 그 방법
WO1994002936A1 (en) Voice recognition apparatus and method

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8327 Change in the person/name/address of the patent owner

Owner name: NUANCE COMMUNICATIONS AUSTRIA GMBH, WIEN, AT

8328 Change in the person/name/address of the agent

Representative=s name: VOSSIUS & PARTNER, 81675 MUENCHEN