DE10204924A1 - Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen - Google Patents

Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen

Info

Publication number
DE10204924A1
DE10204924A1 DE10204924A DE10204924A DE10204924A1 DE 10204924 A1 DE10204924 A1 DE 10204924A1 DE 10204924 A DE10204924 A DE 10204924A DE 10204924 A DE10204924 A DE 10204924A DE 10204924 A1 DE10204924 A1 DE 10204924A1
Authority
DE
Germany
Prior art keywords
speech recognition
recognition result
transcription
text
transcribed text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
DE10204924A
Other languages
German (de)
English (en)
Inventor
Eric Thelen
Dietrich Klakow
Holger Scholl
Uli Waibel
Josef Reisinger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Philips Intellectual Property and Standards GmbH
Original Assignee
Philips Intellectual Property and Standards GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Intellectual Property and Standards GmbH filed Critical Philips Intellectual Property and Standards GmbH
Priority to DE10204924A priority Critical patent/DE10204924A1/de
Priority to EP03702838A priority patent/EP1479070B1/en
Priority to AT03702838T priority patent/ATE358869T1/de
Priority to JP2003566843A priority patent/JP2005517216A/ja
Priority to AU2003205955A priority patent/AU2003205955A1/en
Priority to PCT/IB2003/000374 priority patent/WO2003067573A1/en
Priority to US10/503,420 priority patent/US20060167685A1/en
Priority to DE60312963T priority patent/DE60312963T2/de
Publication of DE10204924A1 publication Critical patent/DE10204924A1/de
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/142Image acquisition using hand-held instruments; Constructional details of the instruments
    • G06V30/1423Image acquisition using hand-held instruments; Constructional details of the instruments the instrument generating sequences of position coordinates corresponding to handwriting

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Machine Translation (AREA)
  • Image Processing (AREA)
  • Radar Systems Or Details Thereof (AREA)
DE10204924A 2002-02-07 2002-02-07 Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen Withdrawn DE10204924A1 (de)

Priority Applications (8)

Application Number Priority Date Filing Date Title
DE10204924A DE10204924A1 (de) 2002-02-07 2002-02-07 Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen
EP03702838A EP1479070B1 (en) 2002-02-07 2003-01-30 Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances
AT03702838T ATE358869T1 (de) 2002-02-07 2003-01-30 Methode und gerät zur schnellen, durch mustererkennung gestützen transkription von gesprochenen und geschriebenen äusserungen
JP2003566843A JP2005517216A (ja) 2002-02-07 2003-01-30 話されたおよび書かれたことばの高速かつパターン認識に支援された書き起こし方法および装置
AU2003205955A AU2003205955A1 (en) 2002-02-07 2003-01-30 Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances
PCT/IB2003/000374 WO2003067573A1 (en) 2002-02-07 2003-01-30 Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances
US10/503,420 US20060167685A1 (en) 2002-02-07 2003-01-30 Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances
DE60312963T DE60312963T2 (de) 2002-02-07 2003-01-30 Methode und gerät zur schnellen, durch mustererkennung gestützen transkription von gesprochenen und geschriebenen äusserungen

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE10204924A DE10204924A1 (de) 2002-02-07 2002-02-07 Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen

Publications (1)

Publication Number Publication Date
DE10204924A1 true DE10204924A1 (de) 2003-08-21

Family

ID=27618362

Family Applications (2)

Application Number Title Priority Date Filing Date
DE10204924A Withdrawn DE10204924A1 (de) 2002-02-07 2002-02-07 Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen
DE60312963T Expired - Lifetime DE60312963T2 (de) 2002-02-07 2003-01-30 Methode und gerät zur schnellen, durch mustererkennung gestützen transkription von gesprochenen und geschriebenen äusserungen

Family Applications After (1)

Application Number Title Priority Date Filing Date
DE60312963T Expired - Lifetime DE60312963T2 (de) 2002-02-07 2003-01-30 Methode und gerät zur schnellen, durch mustererkennung gestützen transkription von gesprochenen und geschriebenen äusserungen

Country Status (7)

Country Link
US (1) US20060167685A1 (https=)
EP (1) EP1479070B1 (https=)
JP (1) JP2005517216A (https=)
AT (1) ATE358869T1 (https=)
AU (1) AU2003205955A1 (https=)
DE (2) DE10204924A1 (https=)
WO (1) WO2003067573A1 (https=)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050273337A1 (en) * 2004-06-02 2005-12-08 Adoram Erell Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
US20070011012A1 (en) * 2005-07-11 2007-01-11 Steve Yurick Method, system, and apparatus for facilitating captioning of multi-media content
KR100654183B1 (ko) * 2005-11-07 2006-12-08 한국전자통신연구원 음성 인식을 이용한 문자 입력 시스템 및 그 방법
US9230222B2 (en) * 2008-07-23 2016-01-05 The Quantum Group, Inc. System and method enabling bi-translation for improved prescription accuracy
JP2013025299A (ja) * 2011-07-26 2013-02-04 Toshiba Corp 書き起こし支援システムおよび書き起こし支援方法
JP6165619B2 (ja) * 2013-12-13 2017-07-19 株式会社東芝 情報処理装置、情報処理方法、および情報処理プログラム
CN109285548A (zh) * 2017-07-19 2019-01-29 阿里巴巴集团控股有限公司 信息处理方法、系统、电子设备、和计算机存储介质
US11017778B1 (en) 2018-12-04 2021-05-25 Sorenson Ip Holdings, Llc Switching between speech recognition systems
US10573312B1 (en) 2018-12-04 2020-02-25 Sorenson Ip Holdings, Llc Transcription generation from multiple speech recognition systems
CN110956959B (zh) * 2019-11-25 2023-07-25 科大讯飞股份有限公司 语音识别纠错方法、相关设备及可读存储介质
US11488604B2 (en) 2020-08-19 2022-11-01 Sorenson Ip Holdings, Llc Transcription of audio

Family Cites Families (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0122880A2 (en) * 1983-04-19 1984-10-24 E.S.P. Elektronische Spezialprojekte Aktiengesellschaft Electronic apparatus for high-speed writing on electronic typewriters, printers, photocomposers, processors and the like
JPS6091435A (ja) * 1983-10-25 1985-05-22 Fujitsu Ltd 文字入力装置
JPS62229300A (ja) * 1986-03-31 1987-10-08 キヤノン株式会社 音声認識装置
US5027406A (en) * 1988-12-06 1991-06-25 Dragon Systems, Inc. Method for interactive speech recognition and training
EP0505621A3 (en) * 1991-03-28 1993-06-02 International Business Machines Corporation Improved message recognition employing integrated speech and handwriting information
US5502774A (en) * 1992-06-09 1996-03-26 International Business Machines Corporation Automatic recognition of a consistent message using multiple complimentary sources of information
JP2986345B2 (ja) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声記録指標化装置及び方法
JPH0883092A (ja) * 1994-09-14 1996-03-26 Nippon Telegr & Teleph Corp <Ntt> 情報入力装置及び情報入力方法
US5818437A (en) * 1995-07-26 1998-10-06 Tegic Communications, Inc. Reduced keyboard disambiguating computer
JP3254977B2 (ja) * 1995-08-31 2002-02-12 松下電器産業株式会社 音声認識方法及び音声認識装置
US5855000A (en) * 1995-09-08 1998-12-29 Carnegie Mellon University Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
US6122613A (en) * 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
WO1999000790A1 (en) * 1997-06-27 1999-01-07 M.H. Segan Limited Partnership Speech recognition computer input and device
US6219453B1 (en) * 1997-08-11 2001-04-17 At&T Corp. Method and apparatus for performing an automatic correction of misrecognized words produced by an optical character recognition technique by using a Hidden Markov Model based algorithm
US6418431B1 (en) * 1998-03-30 2002-07-09 Microsoft Corporation Information retrieval and speech recognition based on language models
US6078885A (en) * 1998-05-08 2000-06-20 At&T Corp Verbal, fully automatic dictionary updates by end-users of speech synthesis and recognition systems
US6438523B1 (en) * 1998-05-20 2002-08-20 John A. Oberteuffer Processing handwritten and hand-drawn input and speech input
FI981154A7 (fi) * 1998-05-25 1999-11-26 Nokia Mobile Phones Ltd Menetelmä ja laite puheen tunnistamiseksi
JP2000056796A (ja) * 1998-08-07 2000-02-25 Asahi Chem Ind Co Ltd 音声入力装置および方法
US6457031B1 (en) * 1998-09-02 2002-09-24 International Business Machines Corp. Method of marking previously dictated text for deferred correction in a speech recognition proofreader
US6167376A (en) * 1998-12-21 2000-12-26 Ditzik; Richard Joseph Computer system with integrated telephony, handwriting and speech recognition functions
JP2000339305A (ja) * 1999-05-31 2000-12-08 Toshiba Corp 文書作成装置、及び文書作成方法
US6442518B1 (en) * 1999-07-14 2002-08-27 Compaq Information Technologies Group, L.P. Method for refining time alignments of closed captions
US6904405B2 (en) * 1999-07-17 2005-06-07 Edwin A. Suominen Message recognition using shared language model
JP2001042996A (ja) * 1999-07-28 2001-02-16 Toshiba Corp 文書作成装置、文書作成方法
US6789231B1 (en) * 1999-10-05 2004-09-07 Microsoft Corporation Method and system for providing alternatives for text derived from stochastic input sources
JP2001159896A (ja) * 1999-12-02 2001-06-12 Nec Software Okinawa Ltd 音声認識機能を利用した簡易文字入力方法
US7149970B1 (en) * 2000-06-23 2006-12-12 Microsoft Corporation Method and system for filtering and selecting from a candidate list generated by a stochastic input method
US7243069B2 (en) * 2000-07-28 2007-07-10 International Business Machines Corporation Speech recognition by automated context creation
US6836759B1 (en) * 2000-08-22 2004-12-28 Microsoft Corporation Method and system of handling the selection of alternates for recognized words
US6788815B2 (en) * 2000-11-10 2004-09-07 Microsoft Corporation System and method for accepting disparate types of user input
US20020152071A1 (en) * 2001-04-12 2002-10-17 David Chaiken Human-augmented, automatic speech recognition engine
US20020152075A1 (en) * 2001-04-16 2002-10-17 Shao-Tsu Kung Composite input method
US6839667B2 (en) * 2001-05-16 2005-01-04 International Business Machines Corporation Method of speech recognition by presenting N-best word candidates
US6996525B2 (en) * 2001-06-15 2006-02-07 Intel Corporation Selecting one of multiple speech recognizers in a system based on performance predections resulting from experience
US7058575B2 (en) * 2001-06-27 2006-06-06 Intel Corporation Integrating keyword spotting with graph decoder to improve the robustness of speech recognition
US7467089B2 (en) * 2001-09-05 2008-12-16 Roth Daniel L Combined speech and handwriting recognition
US6708148B2 (en) * 2001-10-12 2004-03-16 Koninklijke Philips Electronics N.V. Correction device to mark parts of a recognized text
US7124085B2 (en) * 2001-12-13 2006-10-17 Matsushita Electric Industrial Co., Ltd. Constraint-based speech recognition system and method
US7103542B2 (en) * 2001-12-14 2006-09-05 Ben Franklin Patent Holding Llc Automatically improving a voice recognition system
US20030112277A1 (en) * 2001-12-14 2003-06-19 Koninklijke Philips Electronics N.V. Input of data using a combination of data input systems
US6986106B2 (en) * 2002-05-13 2006-01-10 Microsoft Corporation Correction widget
US7137076B2 (en) * 2002-07-30 2006-11-14 Microsoft Corporation Correcting recognition results associated with user input
US7228275B1 (en) * 2002-10-21 2007-06-05 Toyota Infotechnology Center Co., Ltd. Speech recognition system having multiple speech recognizers

Also Published As

Publication number Publication date
WO2003067573A1 (en) 2003-08-14
DE60312963T2 (de) 2007-12-13
EP1479070B1 (en) 2007-04-04
ATE358869T1 (de) 2007-04-15
DE60312963D1 (de) 2007-05-16
AU2003205955A1 (en) 2003-09-02
EP1479070A1 (en) 2004-11-24
JP2005517216A (ja) 2005-06-09
US20060167685A1 (en) 2006-07-27

Similar Documents

Publication Publication Date Title
DE60207742T2 (de) Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes
DE69914131T2 (de) Positionshandhabung bei der Spracherkennung
US20260051317A1 (en) Training of speech recognition systems
DE69923379T2 (de) Nicht-interaktive Registrierung zur Spracherkennung
DE3910467C2 (de) Verfahren und Vorrichtung zur Erzeugung von Berichten
DE69737987T2 (de) Verfahren und System zum Puffern erkannter Wörter während der Spracherkennung
DE68928097T2 (de) Spracherkennungssystem
DE60215272T2 (de) Verfahren und Vorrichtung zur sprachlichen Dateneingabe bei ungünstigen Bedingungen
DE69922104T2 (de) Spracherkenner mit durch buchstabierte Worteingabe adaptierbarem Wortschatz
DE69721938T2 (de) Verfahren und System zum Anzeigen einer variabelen Anzahl alternativer Wörter während der Spracherkennung
US20210233530A1 (en) Transcription generation from multiple speech recognition systems
DE60209103T2 (de) Texteditierung von erkannter sprache bei gleichzeitiger wiedergabe
DE69514382T2 (de) Spracherkennung
DE69632517T2 (de) Erkennung kontinuierlicher Sprache
DE102017124264B4 (de) Computerimplementiertes Verfahren und Rechensystem zum Bestimmen phonetischer Beziehungen
DE60010827T2 (de) Hintergrundsystem für Audiodatenbeseitigung
DE69725091T2 (de) Verfahren und System zum Editieren von Sätzen während der kontinuierlichen Spracherkennung
US20090037171A1 (en) Real-time voice transcription system
DE19847419A1 (de) Verfahren zur automatischen Erkennung einer buchstabierten sprachlichen Äußerung
DE60318385T2 (de) Sprachverarbeitungseinrichtung und -verfahren, aufzeichnungsmedium und programm
DE60312963T2 (de) Methode und gerät zur schnellen, durch mustererkennung gestützen transkription von gesprochenen und geschriebenen äusserungen
EP1058235B1 (de) Wiedergabeverfahren für sprachgesteuerte Systeme mit text-basierter Sprachsynthese
EP1282897B1 (de) Verfahren zum erzeugen einer sprachdatenbank für einen zielwortschatz zum trainieren eines spracherkennungssystems
DE112006000322T5 (de) Audioerkennungssystem zur Erzeugung von Antwort-Audio unter Verwendung extrahierter Audiodaten
DE112015003357T5 (de) Verfahren und System zum Erkennen einer eine Wortabfolge enthaltenden Sprachansage

Legal Events

Date Code Title Description
8139 Disposal/non-payment of the annual fee