JP5025261B2 - 信頼水準の指示により音声認識の結果を訂正するためのシステム - Google Patents

信頼水準の指示により音声認識の結果を訂正するためのシステム Download PDF

Info

Publication number
JP5025261B2
JP5025261B2 JP2006506791A JP2006506791A JP5025261B2 JP 5025261 B2 JP5025261 B2 JP 5025261B2 JP 2006506791 A JP2006506791 A JP 2006506791A JP 2006506791 A JP2006506791 A JP 2006506791A JP 5025261 B2 JP5025261 B2 JP 5025261B2
Authority
JP
Japan
Prior art keywords
information
confidence level
words
recognized
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2006506791A
Other languages
English (en)
Japanese (ja)
Other versions
JP2006522363A (ja
JP2006522363A5 (enExample
Inventor
スタングルマイヤー,クラウス
Original Assignee
ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー filed Critical ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー
Publication of JP2006522363A publication Critical patent/JP2006522363A/ja
Publication of JP2006522363A5 publication Critical patent/JP2006522363A5/ja
Application granted granted Critical
Publication of JP5025261B2 publication Critical patent/JP5025261B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)
JP2006506791A 2003-03-31 2004-03-30 信頼水準の指示により音声認識の結果を訂正するためのシステム Expired - Fee Related JP5025261B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP03100853.5 2003-03-31
EP03100853 2003-03-31
PCT/IB2004/050360 WO2004088635A1 (en) 2003-03-31 2004-03-30 System for correction of speech recognition results with confidence level indication

Publications (3)

Publication Number Publication Date
JP2006522363A JP2006522363A (ja) 2006-09-28
JP2006522363A5 JP2006522363A5 (enExample) 2012-06-14
JP5025261B2 true JP5025261B2 (ja) 2012-09-12

Family

ID=33104160

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2006506791A Expired - Fee Related JP5025261B2 (ja) 2003-03-31 2004-03-30 信頼水準の指示により音声認識の結果を訂正するためのシステム

Country Status (4)

Country Link
US (1) US20060195318A1 (enExample)
EP (1) EP1611570B1 (enExample)
JP (1) JP5025261B2 (enExample)
WO (1) WO2004088635A1 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2575373C (en) * 2004-07-30 2015-09-08 Dictaphone Corporation A system and method for report level confidence
US7844464B2 (en) * 2005-07-22 2010-11-30 Multimodal Technologies, Inc. Content-based audio playback emphasis
JP4659681B2 (ja) * 2005-06-13 2011-03-30 パナソニック株式会社 コンテンツタグ付け支援装置およびコンテンツタグ付け支援方法
US8032372B1 (en) 2005-09-13 2011-10-04 Escription, Inc. Dictation selection
US8510109B2 (en) 2007-08-22 2013-08-13 Canyon Ip Holdings Llc Continuous speech transcription performance indication
WO2007150005A2 (en) * 2006-06-22 2007-12-27 Multimodal Technologies, Inc. Automatic decision support
US9973450B2 (en) 2007-09-17 2018-05-15 Amazon Technologies, Inc. Methods and systems for dynamically updating web service profile information by parsing transcribed message strings
US8667532B2 (en) * 2007-04-18 2014-03-04 Google Inc. Content recognition for targeting video advertisements
US9064024B2 (en) 2007-08-21 2015-06-23 Google Inc. Bundle generation
KR20090047159A (ko) * 2007-11-07 2009-05-12 삼성전자주식회사 오디오-북 재생 방법 및 장치
US9824372B1 (en) 2008-02-11 2017-11-21 Google Llc Associating advertisements with videos
US9152708B1 (en) 2009-12-14 2015-10-06 Google Inc. Target-video specific co-watched video clusters
US20110313762A1 (en) * 2010-06-20 2011-12-22 International Business Machines Corporation Speech output with confidence indication
US8554558B2 (en) * 2010-07-12 2013-10-08 Nuance Communications, Inc. Visualizing automatic speech recognition and machine translation output
US8639508B2 (en) * 2011-02-14 2014-01-28 General Motors Llc User-specific confidence thresholds for speech recognition
US9064492B2 (en) 2012-07-09 2015-06-23 Nuance Communications, Inc. Detecting potential significant errors in speech recognition results
JP2014202848A (ja) * 2013-04-03 2014-10-27 株式会社東芝 テキスト生成装置、方法、及びプログラム
US11169773B2 (en) 2014-04-01 2021-11-09 TekWear, LLC Systems, methods, and apparatuses for agricultural data collection, analysis, and management via a mobile device
WO2018022301A1 (en) * 2016-07-12 2018-02-01 TekWear, LLC Systems, methods, and apparatuses for agricultural data collection, analysis, and management via a mobile device
CN106409296A (zh) * 2016-09-14 2017-02-15 安徽声讯信息技术有限公司 基于分核处理技术的语音快速转写校正系统
US12387720B2 (en) 2020-11-20 2025-08-12 SoundHound AI IP, LLC. Neural sentence generator for virtual assistants
US12223948B2 (en) * 2022-02-03 2025-02-11 Soundhound, Inc. Token confidence scores for automatic speech recognition
US12394411B2 (en) 2022-10-27 2025-08-19 SoundHound AI IP, LLC. Domain specific neural sentence generator for multi-domain virtual assistants

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5975299A (ja) * 1982-10-25 1984-04-27 株式会社日立製作所 音声認識装置
JPS63269200A (ja) * 1987-04-28 1988-11-07 キヤノン株式会社 音声認識装置
AT390685B (de) * 1988-10-25 1990-06-11 Philips Nv System zur textverarbeitung
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
GB2303955B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US5950160A (en) * 1996-10-31 1999-09-07 Microsoft Corporation Method and system for displaying a variable number of alternative words during speech recognition
US6122613A (en) * 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
US6173259B1 (en) * 1997-03-27 2001-01-09 Speech Machines Plc Speech to text conversion
US6006183A (en) * 1997-12-16 1999-12-21 International Business Machines Corp. Speech recognition confidence level display
US6195637B1 (en) * 1998-03-25 2001-02-27 International Business Machines Corp. Marking and deferring correction of misrecognition errors
DE19821422A1 (de) * 1998-05-13 1999-11-18 Philips Patentverwaltung Verfahren zum Darstellen von aus einem Sprachsignal ermittelten Wörtern
US6611802B2 (en) * 1999-06-11 2003-08-26 International Business Machines Corporation Method and system for proofreading and correcting dictated text
JP2001142482A (ja) * 1999-11-10 2001-05-25 Nippon Hoso Kyokai <Nhk> 音声字幕化装置
EP1169678B1 (en) * 1999-12-20 2015-01-21 Nuance Communications Austria GmbH Audio playback for text edition in a speech recognition system
WO2002009093A1 (en) * 2000-07-20 2002-01-31 Koninklijke Philips Electronics N.V. Feedback of recognized command confidence level
US7092496B1 (en) * 2000-09-18 2006-08-15 International Business Machines Corporation Method and apparatus for processing information signals based on content
US20020152071A1 (en) * 2001-04-12 2002-10-17 David Chaiken Human-augmented, automatic speech recognition engine
EP1262954A1 (en) * 2001-05-30 2002-12-04 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for verbal entry of digits or commands
US20020184022A1 (en) * 2001-06-05 2002-12-05 Davenport Gary F. Proofreading assistance techniques for a voice recognition system
JP4145796B2 (ja) * 2001-10-31 2008-09-03 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ テキストファイルのディクテーションを筆記するための及びテキストを修正するための方法及びシステム

Also Published As

Publication number Publication date
EP1611570A1 (en) 2006-01-04
JP2006522363A (ja) 2006-09-28
US20060195318A1 (en) 2006-08-31
EP1611570B1 (en) 2017-06-28
WO2004088635A1 (en) 2004-10-14

Similar Documents

Publication Publication Date Title
JP5025261B2 (ja) 信頼水準の指示により音声認識の結果を訂正するためのシステム
JP4173371B2 (ja) 認識音声に対する同期再生中の文字編集
EP1430474B1 (en) Correcting a text recognized by speech recognition through comparison of phonetic sequences in the recognized text with a phonetic transcription of a manually input correction word
JP6463825B2 (ja) 多重話者音声認識修正システム
US8706495B2 (en) Synchronise an audio cursor and a text cursor during editing
US8924216B2 (en) System and method for synchronizing sound and manually transcribed text
JP2006351028A (ja) 音声認識中に可変数の代替ワードを表示する方法及びシステム
JP2019148681A (ja) テキスト修正装置、テキスト修正方法およびテキスト修正プログラム
JP2013152365A (ja) 書き起こし支援システムおよび書き起こし支援方法
US20140303974A1 (en) Text generator, text generating method, and computer program product
EP2682931B1 (en) Method and apparatus for recording and playing user voice in mobile terminal
JP2002132287A (ja) 音声収録方法および音声収録装置および記憶媒体
JP2013025299A (ja) 書き起こし支援システムおよび書き起こし支援方法
JP2005141089A (ja) 情報処理装置、情報処理方法ならびに記録媒体、プログラム
US8140338B2 (en) Method and system for speech based document history tracking
CN114125184B (zh) 一种提词方法、装置、终端及存储介质
JP2005509906A (ja) 所定ウィンドウにてテキストを編集する装置
JP5892598B2 (ja) 音声文字変換作業支援装置、音声文字変換システム、音声文字変換作業支援方法及びプログラム
JP4272611B2 (ja) 映像処理方法、映像処理装置、映像処理用プログラム及びそのプログラムを記録したコンピュータ読み取り可能な記録媒体
JP2020017885A (ja) 情報処理装置およびプログラム
JP2004288008A (ja) プレゼンテーション用プログラム及びプレゼンテーション用装置
KR101501705B1 (ko) 음성 데이터를 이용한 문서 생성 장치, 방법 및 컴퓨터 판독 가능 기록 매체
JP2002268683A (ja) 情報処理方法及び装置
JP6387044B2 (ja) テキスト処理装置、テキスト処理方法およびテキスト処理プログラム
JP2012190088A (ja) 音声記録装置、方法及びプログラム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20070328

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20070328

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20090715

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100330

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100630

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20100907

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20110107

A911 Transfer to examiner for re-examination before appeal (zenchi)

Free format text: JAPANESE INTERMEDIATE CODE: A911

Effective date: 20110118

A912 Re-examination (zenchi) completed and case transferred to appeal board

Free format text: JAPANESE INTERMEDIATE CODE: A912

Effective date: 20110401

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20120423

A524 Written submission of copy of amendment under article 19 pct

Free format text: JAPANESE INTERMEDIATE CODE: A524

Effective date: 20120423

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20120619

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20150629

Year of fee payment: 3

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees