DE60209518D1 - Korrekturvorrichtung mit markierung von teilen eines erkannten textes - Google Patents

Korrekturvorrichtung mit markierung von teilen eines erkannten textes

Info

Publication number
DE60209518D1
DE60209518D1 DE60209518T DE60209518T DE60209518D1 DE 60209518 D1 DE60209518 D1 DE 60209518D1 DE 60209518 T DE60209518 T DE 60209518T DE 60209518 T DE60209518 T DE 60209518T DE 60209518 D1 DE60209518 D1 DE 60209518D1
Authority
DE
Germany
Prior art keywords
text
correction device
reproduced
recognized
parts
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60209518T
Other languages
English (en)
Other versions
DE60209518T2 (de
Inventor
Wolfgang Gschwendtner
Kresimir Rajic
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Austria GmbH
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of DE60209518D1 publication Critical patent/DE60209518D1/de
Application granted granted Critical
Publication of DE60209518T2 publication Critical patent/DE60209518T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)
  • Facsimile Heads (AREA)
  • Image Processing (AREA)
DE60209518T 2001-10-12 2002-10-10 Korrekturvorrichtung, die Teile eines erkannten Texts kennzeichnet Expired - Lifetime DE60209518T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP01000534 2001-10-12
EP01000534 2001-10-12
PCT/IB2002/004178 WO2003034405A1 (en) 2001-10-12 2002-10-10 Correction device marking parts of a recognized text

Publications (2)

Publication Number Publication Date
DE60209518D1 true DE60209518D1 (de) 2006-04-27
DE60209518T2 DE60209518T2 (de) 2006-08-24

Family

ID=8176072

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60209518T Expired - Lifetime DE60209518T2 (de) 2001-10-12 2002-10-10 Korrekturvorrichtung, die Teile eines erkannten Texts kennzeichnet

Country Status (7)

Country Link
US (1) US6708148B2 (de)
EP (1) EP1442452B1 (de)
JP (1) JP4336580B2 (de)
CN (1) CN1312612C (de)
AT (1) ATE319161T1 (de)
DE (1) DE60209518T2 (de)
WO (1) WO2003034405A1 (de)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4850322B2 (ja) * 1998-03-03 2012-01-11 ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー テキストブロックデータ変更用の音声認識装置及びテキスト変更手段を具備したテキスト処理システム
DE10204924A1 (de) * 2002-02-07 2003-08-21 Philips Intellectual Property Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen
US20040024598A1 (en) * 2002-07-03 2004-02-05 Amit Srivastava Thematic segmentation of speech
US20040006628A1 (en) * 2002-07-03 2004-01-08 Scott Shepard Systems and methods for providing real-time alerting
US20040021765A1 (en) * 2002-07-03 2004-02-05 Francis Kubala Speech recognition system for managing telemeetings
US20040204939A1 (en) * 2002-10-17 2004-10-14 Daben Liu Systems and methods for speaker change detection
US8849648B1 (en) 2002-12-24 2014-09-30 At&T Intellectual Property Ii, L.P. System and method of extracting clauses for spoken language understanding
US8818793B1 (en) 2002-12-24 2014-08-26 At&T Intellectual Property Ii, L.P. System and method of extracting clauses for spoken language understanding
US7263483B2 (en) * 2003-04-28 2007-08-28 Dictaphone Corporation USB dictation device
JP4972645B2 (ja) * 2005-08-26 2012-07-11 ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー サウンド及び手作業により転写されるテキストを同期させるシステム及び方法
US20070067348A1 (en) * 2005-09-18 2007-03-22 Andreyev Dmitriy S Repeated Segment Manager
US20070094022A1 (en) * 2005-10-20 2007-04-26 Hahn Koo Method and device for recognizing human intent
US8036889B2 (en) * 2006-02-27 2011-10-11 Nuance Communications, Inc. Systems and methods for filtering dictated and non-dictated sections of documents
US7716040B2 (en) * 2006-06-22 2010-05-11 Multimodal Technologies, Inc. Verification of extracted data
US7925986B2 (en) 2006-10-06 2011-04-12 Veveo, Inc. Methods and systems for a linear character selection display interface for ambiguous text input
US20080313574A1 (en) * 2007-05-25 2008-12-18 Veveo, Inc. System and method for search with reduced physical interaction requirements
JP2009169139A (ja) * 2008-01-17 2009-07-30 Alpine Electronics Inc 音声認識装置
US8121842B2 (en) 2008-12-12 2012-02-21 Microsoft Corporation Audio output of a document from mobile device
CN102955767A (zh) * 2011-08-29 2013-03-06 王道平 一种修改文字的方法
KR20140008835A (ko) * 2012-07-12 2014-01-22 삼성전자주식회사 음성 인식 오류 수정 방법 및 이를 적용한 방송 수신 장치
JP2014240940A (ja) * 2013-06-12 2014-12-25 株式会社東芝 書き起こし支援装置、方法、及びプログラム
JP6417104B2 (ja) * 2014-04-16 2018-10-31 株式会社日立システムズ テキスト編集装置、テキスト編集方法、及びプログラム
CN105702252B (zh) * 2016-03-31 2019-09-17 海信集团有限公司 一种语音识别方法及装置
CN106710597B (zh) * 2017-01-04 2020-12-11 广东小天才科技有限公司 语音数据的录音方法及装置
US10229685B2 (en) * 2017-01-18 2019-03-12 International Business Machines Corporation Symbol sequence estimation in speech
CN108364653B (zh) * 2018-02-12 2021-08-13 王磊 语音数据处理方法及处理装置
KR20210047173A (ko) * 2019-10-21 2021-04-29 엘지전자 주식회사 오인식된 단어를 바로잡아 음성을 인식하는 인공 지능 장치 및 그 방법
CN111460765B (zh) * 2020-03-30 2020-12-29 掌阅科技股份有限公司 电子书籍标注处理方法、电子设备及存储介质

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AT390685B (de) * 1988-10-25 1990-06-11 Philips Nv System zur textverarbeitung
JP2619962B2 (ja) * 1990-02-28 1997-06-11 株式会社日立製作所 図形編集方法および装置
US5855000A (en) * 1995-09-08 1998-12-29 Carnegie Mellon University Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
GB2302199B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US5909667A (en) * 1997-03-05 1999-06-01 International Business Machines Corporation Method and apparatus for fast voice selection of error words in dictated text
KR100223300B1 (ko) * 1997-09-10 1999-10-15 서평원 분산 제어와 난블로킹 교환 시스템
US6457031B1 (en) * 1998-09-02 2002-09-24 International Business Machines Corp. Method of marking previously dictated text for deferred correction in a speech recognition proofreader
US6161087A (en) * 1998-10-05 2000-12-12 Lernout & Hauspie Speech Products N.V. Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording
US6360237B1 (en) * 1998-10-05 2002-03-19 Lernout & Hauspie Speech Products N.V. Method and system for performing text edits during audio recording playback
US6611802B2 (en) * 1999-06-11 2003-08-26 International Business Machines Corporation Method and system for proofreading and correcting dictated text
CN1207664C (zh) * 1999-07-27 2005-06-22 国际商业机器公司 对语音识别结果中的错误进行校正的方法和语音识别系统
EP2261893B1 (de) * 1999-12-20 2016-03-30 Nuance Communications Austria GmbH Audiowiedergabe für texteingabe in einem spracherkennungssystem

Also Published As

Publication number Publication date
US6708148B2 (en) 2004-03-16
CN1312612C (zh) 2007-04-25
EP1442452B1 (de) 2006-03-01
JP4336580B2 (ja) 2009-09-30
ATE319161T1 (de) 2006-03-15
JP2005505805A (ja) 2005-02-24
WO2003034405A1 (en) 2003-04-24
US20030110030A1 (en) 2003-06-12
CN1568501A (zh) 2005-01-19
DE60209518T2 (de) 2006-08-24
EP1442452A1 (de) 2004-08-04

Similar Documents

Publication Publication Date Title
DE60209518D1 (de) Korrekturvorrichtung mit markierung von teilen eines erkannten textes
DE60209103D1 (de) Texteditierung von erkannter sprache bei gleichzeitiger wiedergabe
ATE496363T1 (de) Spracherkennungsvorrichtung mit markierung von erkannten textteilen
WO2004003688A8 (en) A method for comparing a transcribed text file with a previously created file
EP1205898A3 (de) Leselehrtechnik für Kinder im Leselernalter
KR970004468A (ko) 피압축 음성 정보의 제1 및 제2연속적인 각 프레임의 적어도 일부를 신뢰성있게 수신하지 못한 경우, 상기 벡터 신호를 디코드된 음성 신호를 발생하는데 사용하는, 음성 디코더내에서 이용하기 위한 방법
WO2004070701A3 (en) Linguistic prosodic model-based text to speech
EP1455268A3 (de) Auf Benutzereingang basierte Datendarstellung
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
EP1081586A3 (de) Sprachsteuerung eines elektronischen Post-Klient (E-Mail)
CA2307300A1 (en) Method and system for proofreading and correcting dictated text
TR200102364T2 (tr) Otomatikleştirilmiş transkripsiyon sistemi ve iki konuşma dönüştürme seferini ve bilgisayar-yardımlı düzeltme kullanan yöntem.
DE60225348D1 (de) Auswahl eines Musikstücks anhand von Metadaten und einer externen Tempo-Eingabe
WO2006031609A3 (en) Machine learning
HK1063373A1 (en) Musical tone and voice reproduction device and control method thereof, and server device.
WO2004027651A3 (en) Information research initiated from a scanned image media
EP1050872A3 (de) Verfahren und System zur Auswahl erkannter Wörter bei der Korrektur erkannter Sprache
EP1168306A3 (de) Verfahren und Vorrichtung zur Verbesserung von der Verständlichkeit eines digital komprimierten Sprachsignals
SE9502202D0 (sv) Metod vid tal-till-textomvandling
CA2366892A1 (en) Method and apparatus for speaker recognition using a speaker dependent transform
ATE407411T1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
WO2007005098A3 (en) Method and apparatus for generating and updating a voice tag
ATE239966T1 (de) Anwendung von referenzdaten für spracherkennung
AU2002233238A1 (en) Mobile terminal controllable by spoken utterances
DE60020504D1 (de) Anpassung eines spracherkenners an korrigierte texte

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8327 Change in the person/name/address of the patent owner

Owner name: NUANCE COMMUNICATIONS AUSTRIA GMBH, WIEN, AT

8328 Change in the person/name/address of the agent

Representative=s name: VOSSIUS & PARTNER, 81675 MUENCHEN