ATE325413T1 - Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte - Google Patents

Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte

Info

Publication number
ATE325413T1
ATE325413T1 AT02777662T AT02777662T ATE325413T1 AT E325413 T1 ATE325413 T1 AT E325413T1 AT 02777662 T AT02777662 T AT 02777662T AT 02777662 T AT02777662 T AT 02777662T AT E325413 T1 ATE325413 T1 AT E325413T1
Authority
AT
Austria
Prior art keywords
text
file
texts
recognized
dictation
Prior art date
Application number
AT02777662T
Other languages
English (en)
Inventor
Kwaku Frimpong-Ansah
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE325413T1 publication Critical patent/ATE325413T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Character Discrimination (AREA)
AT02777662T 2001-10-31 2002-10-24 Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte ATE325413T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP01890304 2001-10-31

Publications (1)

Publication Number Publication Date
ATE325413T1 true ATE325413T1 (de) 2006-06-15

Family

ID=8185163

Family Applications (1)

Application Number Title Priority Date Filing Date
AT02777662T ATE325413T1 (de) 2001-10-31 2002-10-24 Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte

Country Status (7)

Country Link
US (1) US7184956B2 (de)
EP (1) EP1442451B1 (de)
JP (1) JP4145796B2 (de)
CN (1) CN1269105C (de)
AT (1) ATE325413T1 (de)
DE (1) DE60211197T2 (de)
WO (1) WO2003038808A1 (de)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE286294T1 (de) 2001-03-29 2005-01-15 Koninkl Philips Electronics Nv Synchronisierung eines audio- und eines textcursors während der editierung
DE10142232B4 (de) 2001-08-29 2021-04-29 Roche Diabetes Care Gmbh Verfahren zur Herstellung eines analytischen Hilfsmittels mit Lanzette und Testelement
JP5025261B2 (ja) * 2003-03-31 2012-09-12 ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー 信頼水準の指示により音声認識の結果を訂正するためのシステム
EP1471502A1 (de) 2003-04-25 2004-10-27 Sony International (Europe) GmbH Verfahren zur Korrektur eines spracherkannten Textes
JP2005301953A (ja) * 2004-04-12 2005-10-27 Kenichi Asano 聞き手の側のペースで音声とそれに対応する文章を関連させる方法
JP2005301811A (ja) * 2004-04-14 2005-10-27 Olympus Corp データ処理装置、関連データ生成装置、データ処理システム、データ処理ソフトウェア、関連データ生成ソフトウェア、データ処理方法、及び、関連データ生成方法
US8504369B1 (en) 2004-06-02 2013-08-06 Nuance Communications, Inc. Multi-cursor transcription editing
US7844464B2 (en) * 2005-07-22 2010-11-30 Multimodal Technologies, Inc. Content-based audio playback emphasis
US7836412B1 (en) 2004-12-03 2010-11-16 Escription, Inc. Transcription editing
US7640158B2 (en) 2005-11-08 2009-12-29 Multimodal Technologies, Inc. Automatic detection and application of editing patterns in draft documents
US7708702B2 (en) * 2006-01-26 2010-05-04 Roche Diagnostics Operations, Inc. Stack magazine system
US20070208567A1 (en) * 2006-03-01 2007-09-06 At&T Corp. Error Correction In Automatic Speech Recognition Transcripts
US7831423B2 (en) * 2006-05-25 2010-11-09 Multimodal Technologies, Inc. Replacing text representing a concept with an alternate written form of the concept
WO2007150004A2 (en) * 2006-06-22 2007-12-27 Multimodal Technologies, Inc. Verification of extracted data
US8286071B1 (en) * 2006-06-29 2012-10-09 Escription, Inc. Insertion of standard text in transcriptions
US8521510B2 (en) * 2006-08-31 2013-08-27 At&T Intellectual Property Ii, L.P. Method and system for providing an automated web transcription service
US8943394B2 (en) * 2008-11-19 2015-01-27 Robert Bosch Gmbh System and method for interacting with live agents in an automated call center
US8572488B2 (en) * 2010-03-29 2013-10-29 Avid Technology, Inc. Spot dialog editor
US8831940B2 (en) 2010-03-30 2014-09-09 Nvoq Incorporated Hierarchical quick note to allow dictated code phrases to be transcribed to standard clauses
US9760920B2 (en) 2011-03-23 2017-09-12 Audible, Inc. Synchronizing digital content
US9703781B2 (en) 2011-03-23 2017-07-11 Audible, Inc. Managing related digital content
US9706247B2 (en) 2011-03-23 2017-07-11 Audible, Inc. Synchronized digital content samples
US8855797B2 (en) 2011-03-23 2014-10-07 Audible, Inc. Managing playback of synchronized content
US8948892B2 (en) 2011-03-23 2015-02-03 Audible, Inc. Managing playback of synchronized content
US8862255B2 (en) * 2011-03-23 2014-10-14 Audible, Inc. Managing playback of synchronized content
US9697871B2 (en) 2011-03-23 2017-07-04 Audible, Inc. Synchronizing recorded audio content and companion content
US9734153B2 (en) 2011-03-23 2017-08-15 Audible, Inc. Managing related digital content
DE102011080145A1 (de) 2011-07-29 2013-01-31 Robert Bosch Gmbh Verfahren und Vorrichtung zur Verarbeitung von Befindlichkeitsdaten eines Patienten
US9037956B2 (en) 2012-03-29 2015-05-19 Audible, Inc. Content customization
US8849676B2 (en) 2012-03-29 2014-09-30 Audible, Inc. Content customization
GB2502944A (en) * 2012-03-30 2013-12-18 Jpal Ltd Segmentation and transcription of speech
US9075760B2 (en) 2012-05-07 2015-07-07 Audible, Inc. Narration settings distribution for content customization
US9317500B2 (en) 2012-05-30 2016-04-19 Audible, Inc. Synchronizing translated digital content
US9141257B1 (en) 2012-06-18 2015-09-22 Audible, Inc. Selecting and conveying supplemental content
US8972265B1 (en) 2012-06-18 2015-03-03 Audible, Inc. Multiple voices in audio content
US9536439B1 (en) 2012-06-27 2017-01-03 Audible, Inc. Conveying questions with content
US9679608B2 (en) 2012-06-28 2017-06-13 Audible, Inc. Pacing content
US9099089B2 (en) 2012-08-02 2015-08-04 Audible, Inc. Identifying corresponding regions of content
US9367196B1 (en) 2012-09-26 2016-06-14 Audible, Inc. Conveying branched content
US9632647B1 (en) 2012-10-09 2017-04-25 Audible, Inc. Selecting presentation positions in dynamic content
US9223830B1 (en) 2012-10-26 2015-12-29 Audible, Inc. Content presentation analysis
US9280906B2 (en) 2013-02-04 2016-03-08 Audible. Inc. Prompting a user for input during a synchronous presentation of audio content and textual content
US9472113B1 (en) 2013-02-05 2016-10-18 Audible, Inc. Synchronizing playback of digital content with physical content
US9317486B1 (en) 2013-06-07 2016-04-19 Audible, Inc. Synchronizing playback of digital content with captured physical content
US9489360B2 (en) 2013-09-05 2016-11-08 Audible, Inc. Identifying extra material in companion content
CN106782627B (zh) * 2015-11-23 2019-08-27 广州酷狗计算机科技有限公司 音频文件的重录方法及装置
JP2018091954A (ja) 2016-12-01 2018-06-14 オリンパス株式会社 音声認識装置、及び音声認識方法
CN108647190B (zh) * 2018-04-25 2022-04-29 北京华夏电通科技股份有限公司 一种语音识别文本插入笔录文档的方法、装置及系统
CN108984529B (zh) * 2018-07-16 2022-06-03 北京华宇信息技术有限公司 实时庭审语音识别自动纠错方法、存储介质及计算装置
CN110889309A (zh) * 2018-09-07 2020-03-17 上海怀若智能科技有限公司 金融单据分类管理系统及方法

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5566272A (en) * 1993-10-27 1996-10-15 Lucent Technologies Inc. Automatic speech recognition (ASR) processing using confidence measures
US5712957A (en) 1995-09-08 1998-01-27 Carnegie Mellon University Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
GB2302199B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US6006183A (en) * 1997-12-16 1999-12-21 International Business Machines Corp. Speech recognition confidence level display
DE19821422A1 (de) * 1998-05-13 1999-11-18 Philips Patentverwaltung Verfahren zum Darstellen von aus einem Sprachsignal ermittelten Wörtern
US6064961A (en) 1998-09-02 2000-05-16 International Business Machines Corporation Display for proofreading text
US6366296B1 (en) * 1998-09-11 2002-04-02 Xerox Corporation Media browser using multimodal analysis
DE19842405A1 (de) * 1998-09-16 2000-03-23 Philips Corp Intellectual Pty Spracherkennungsverfahren mit Konfidenzmaßbewertung
US6219638B1 (en) * 1998-11-03 2001-04-17 International Business Machines Corporation Telephone messaging and editing system
FI116991B (fi) * 1999-01-18 2006-04-28 Nokia Corp Menetelmä puheen tunnistamisessa, puheentunnistuslaite ja puheella ohjattava langaton viestin
EP1169678B1 (de) * 1999-12-20 2015-01-21 Nuance Communications Austria GmbH Audiowiedergabe für texteingabe in einem spracherkennungssystem
US7092496B1 (en) * 2000-09-18 2006-08-15 International Business Machines Corporation Method and apparatus for processing information signals based on content
US6973428B2 (en) * 2001-05-24 2005-12-06 International Business Machines Corporation System and method for searching, analyzing and displaying text transcripts of speech after imperfect speech recognition

Also Published As

Publication number Publication date
EP1442451A1 (de) 2004-08-04
DE60211197T2 (de) 2007-05-03
DE60211197D1 (de) 2006-06-08
CN1578976A (zh) 2005-02-09
US7184956B2 (en) 2007-02-27
CN1269105C (zh) 2006-08-09
JP2005507536A (ja) 2005-03-17
JP4145796B2 (ja) 2008-09-03
EP1442451B1 (de) 2006-05-03
US20030083885A1 (en) 2003-05-01
WO2003038808A1 (en) 2003-05-08

Similar Documents

Publication Publication Date Title
ATE325413T1 (de) Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte
US7881930B2 (en) ASR-aided transcription with segmented feedback training
DE602004018290D1 (de) Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
ATE404967T1 (de) Text-zu-sprache-system und verfahren, computerprogramm dafür
WO2003065349A3 (en) Text to speech
US9240181B2 (en) Automatic collection of speaker name pronunciations
AP2001002243A0 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction.
WO2006023631A3 (en) Document transcription system training
ATE524777T1 (de) Automatische aktualisierung eines sprachmodells
DE60134044D1 (de) Spracherkennung durch wort-in-phrase-befehl
DE602006011622D1 (de) Inhaltbasierte audiowiedergabebetonung
ATE407411T1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
JP2001296880A (ja) 固有名の複数のもっともらしい発音を生成する方法および装置
DE60207742D1 (de) Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes
Ostendorf et al. A sequential repetition model for improved disfluency detection.
IL131712A (en) Automatic update to language templates
ATE363120T1 (de) Audio-dialogsystem und sprachgesteuertes browsing-verfahren
ATE514162T1 (de) Dynamische erzeugung von kontexten zur spracherkennung
ATE449401T1 (de) Automatische erzeugung einer wortaussprache für die spracherkennung
ATE405920T1 (de) Erzeugen einer spracherkennungsgrammatik für alphanumerische ausdrücke
JPWO2021181451A5 (de)
Demuynck et al. Automatic generation of phonetic transcriptions for large speech corpora.
JP2004271895A (ja) 複数言語音声認識システムおよび発音学習システム
JP2021009253A (ja) プログラム、情報処理装置、及び情報処理方法

Legal Events

Date Code Title Description
UEP Publication of translation of european patent specification

Ref document number: 1442451

Country of ref document: EP

EEIH Change in the person of patent owner