JP5025261B2 - 信頼水準の指示により音声認識の結果を訂正するためのシステム - Google Patents
信頼水準の指示により音声認識の結果を訂正するためのシステム Download PDFInfo
- Publication number
- JP5025261B2 JP5025261B2 JP2006506791A JP2006506791A JP5025261B2 JP 5025261 B2 JP5025261 B2 JP 5025261B2 JP 2006506791 A JP2006506791 A JP 2006506791A JP 2006506791 A JP2006506791 A JP 2006506791A JP 5025261 B2 JP5025261 B2 JP 5025261B2
- Authority
- JP
- Japan
- Prior art keywords
- information
- confidence level
- words
- recognized
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/221—Announcement of recognition results
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP03100853.5 | 2003-03-31 | ||
| EP03100853 | 2003-03-31 | ||
| PCT/IB2004/050360 WO2004088635A1 (en) | 2003-03-31 | 2004-03-30 | System for correction of speech recognition results with confidence level indication |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2006522363A JP2006522363A (ja) | 2006-09-28 |
| JP2006522363A5 JP2006522363A5 (enExample) | 2012-06-14 |
| JP5025261B2 true JP5025261B2 (ja) | 2012-09-12 |
Family
ID=33104160
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2006506791A Expired - Fee Related JP5025261B2 (ja) | 2003-03-31 | 2004-03-30 | 信頼水準の指示により音声認識の結果を訂正するためのシステム |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20060195318A1 (enExample) |
| EP (1) | EP1611570B1 (enExample) |
| JP (1) | JP5025261B2 (enExample) |
| WO (1) | WO2004088635A1 (enExample) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2575373C (en) * | 2004-07-30 | 2015-09-08 | Dictaphone Corporation | A system and method for report level confidence |
| US7844464B2 (en) * | 2005-07-22 | 2010-11-30 | Multimodal Technologies, Inc. | Content-based audio playback emphasis |
| JP4659681B2 (ja) * | 2005-06-13 | 2011-03-30 | パナソニック株式会社 | コンテンツタグ付け支援装置およびコンテンツタグ付け支援方法 |
| US8032372B1 (en) | 2005-09-13 | 2011-10-04 | Escription, Inc. | Dictation selection |
| US8510109B2 (en) | 2007-08-22 | 2013-08-13 | Canyon Ip Holdings Llc | Continuous speech transcription performance indication |
| WO2007150005A2 (en) * | 2006-06-22 | 2007-12-27 | Multimodal Technologies, Inc. | Automatic decision support |
| US9973450B2 (en) | 2007-09-17 | 2018-05-15 | Amazon Technologies, Inc. | Methods and systems for dynamically updating web service profile information by parsing transcribed message strings |
| US8667532B2 (en) * | 2007-04-18 | 2014-03-04 | Google Inc. | Content recognition for targeting video advertisements |
| US9064024B2 (en) | 2007-08-21 | 2015-06-23 | Google Inc. | Bundle generation |
| KR20090047159A (ko) * | 2007-11-07 | 2009-05-12 | 삼성전자주식회사 | 오디오-북 재생 방법 및 장치 |
| US9824372B1 (en) | 2008-02-11 | 2017-11-21 | Google Llc | Associating advertisements with videos |
| US9152708B1 (en) | 2009-12-14 | 2015-10-06 | Google Inc. | Target-video specific co-watched video clusters |
| US20110313762A1 (en) * | 2010-06-20 | 2011-12-22 | International Business Machines Corporation | Speech output with confidence indication |
| US8554558B2 (en) * | 2010-07-12 | 2013-10-08 | Nuance Communications, Inc. | Visualizing automatic speech recognition and machine translation output |
| US8639508B2 (en) * | 2011-02-14 | 2014-01-28 | General Motors Llc | User-specific confidence thresholds for speech recognition |
| US9064492B2 (en) | 2012-07-09 | 2015-06-23 | Nuance Communications, Inc. | Detecting potential significant errors in speech recognition results |
| JP2014202848A (ja) * | 2013-04-03 | 2014-10-27 | 株式会社東芝 | テキスト生成装置、方法、及びプログラム |
| US11169773B2 (en) | 2014-04-01 | 2021-11-09 | TekWear, LLC | Systems, methods, and apparatuses for agricultural data collection, analysis, and management via a mobile device |
| WO2018022301A1 (en) * | 2016-07-12 | 2018-02-01 | TekWear, LLC | Systems, methods, and apparatuses for agricultural data collection, analysis, and management via a mobile device |
| CN106409296A (zh) * | 2016-09-14 | 2017-02-15 | 安徽声讯信息技术有限公司 | 基于分核处理技术的语音快速转写校正系统 |
| US12387720B2 (en) | 2020-11-20 | 2025-08-12 | SoundHound AI IP, LLC. | Neural sentence generator for virtual assistants |
| US12223948B2 (en) * | 2022-02-03 | 2025-02-11 | Soundhound, Inc. | Token confidence scores for automatic speech recognition |
| US12394411B2 (en) | 2022-10-27 | 2025-08-19 | SoundHound AI IP, LLC. | Domain specific neural sentence generator for multi-domain virtual assistants |
Family Cites Families (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5975299A (ja) * | 1982-10-25 | 1984-04-27 | 株式会社日立製作所 | 音声認識装置 |
| JPS63269200A (ja) * | 1987-04-28 | 1988-11-07 | キヤノン株式会社 | 音声認識装置 |
| AT390685B (de) * | 1988-10-25 | 1990-06-11 | Philips Nv | System zur textverarbeitung |
| US5960447A (en) * | 1995-11-13 | 1999-09-28 | Holt; Douglas | Word tagging and editing system for speech recognition |
| GB2303955B (en) * | 1996-09-24 | 1997-05-14 | Allvoice Computing Plc | Data processing method and apparatus |
| US5950160A (en) * | 1996-10-31 | 1999-09-07 | Microsoft Corporation | Method and system for displaying a variable number of alternative words during speech recognition |
| US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
| US6173259B1 (en) * | 1997-03-27 | 2001-01-09 | Speech Machines Plc | Speech to text conversion |
| US6006183A (en) * | 1997-12-16 | 1999-12-21 | International Business Machines Corp. | Speech recognition confidence level display |
| US6195637B1 (en) * | 1998-03-25 | 2001-02-27 | International Business Machines Corp. | Marking and deferring correction of misrecognition errors |
| DE19821422A1 (de) * | 1998-05-13 | 1999-11-18 | Philips Patentverwaltung | Verfahren zum Darstellen von aus einem Sprachsignal ermittelten Wörtern |
| US6611802B2 (en) * | 1999-06-11 | 2003-08-26 | International Business Machines Corporation | Method and system for proofreading and correcting dictated text |
| JP2001142482A (ja) * | 1999-11-10 | 2001-05-25 | Nippon Hoso Kyokai <Nhk> | 音声字幕化装置 |
| EP1169678B1 (en) * | 1999-12-20 | 2015-01-21 | Nuance Communications Austria GmbH | Audio playback for text edition in a speech recognition system |
| WO2002009093A1 (en) * | 2000-07-20 | 2002-01-31 | Koninklijke Philips Electronics N.V. | Feedback of recognized command confidence level |
| US7092496B1 (en) * | 2000-09-18 | 2006-08-15 | International Business Machines Corporation | Method and apparatus for processing information signals based on content |
| US20020152071A1 (en) * | 2001-04-12 | 2002-10-17 | David Chaiken | Human-augmented, automatic speech recognition engine |
| EP1262954A1 (en) * | 2001-05-30 | 2002-12-04 | Telefonaktiebolaget L M Ericsson (Publ) | Method and apparatus for verbal entry of digits or commands |
| US20020184022A1 (en) * | 2001-06-05 | 2002-12-05 | Davenport Gary F. | Proofreading assistance techniques for a voice recognition system |
| JP4145796B2 (ja) * | 2001-10-31 | 2008-09-03 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | テキストファイルのディクテーションを筆記するための及びテキストを修正するための方法及びシステム |
-
2004
- 2004-03-30 JP JP2006506791A patent/JP5025261B2/ja not_active Expired - Fee Related
- 2004-03-30 US US10/550,877 patent/US20060195318A1/en not_active Abandoned
- 2004-03-30 WO PCT/IB2004/050360 patent/WO2004088635A1/en not_active Ceased
- 2004-03-30 EP EP04724340.7A patent/EP1611570B1/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1611570A1 (en) | 2006-01-04 |
| JP2006522363A (ja) | 2006-09-28 |
| US20060195318A1 (en) | 2006-08-31 |
| EP1611570B1 (en) | 2017-06-28 |
| WO2004088635A1 (en) | 2004-10-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP5025261B2 (ja) | 信頼水準の指示により音声認識の結果を訂正するためのシステム | |
| JP4173371B2 (ja) | 認識音声に対する同期再生中の文字編集 | |
| EP1430474B1 (en) | Correcting a text recognized by speech recognition through comparison of phonetic sequences in the recognized text with a phonetic transcription of a manually input correction word | |
| JP6463825B2 (ja) | 多重話者音声認識修正システム | |
| US8706495B2 (en) | Synchronise an audio cursor and a text cursor during editing | |
| US8924216B2 (en) | System and method for synchronizing sound and manually transcribed text | |
| JP2006351028A (ja) | 音声認識中に可変数の代替ワードを表示する方法及びシステム | |
| JP2019148681A (ja) | テキスト修正装置、テキスト修正方法およびテキスト修正プログラム | |
| JP2013152365A (ja) | 書き起こし支援システムおよび書き起こし支援方法 | |
| US20140303974A1 (en) | Text generator, text generating method, and computer program product | |
| EP2682931B1 (en) | Method and apparatus for recording and playing user voice in mobile terminal | |
| JP2002132287A (ja) | 音声収録方法および音声収録装置および記憶媒体 | |
| JP2013025299A (ja) | 書き起こし支援システムおよび書き起こし支援方法 | |
| JP2005141089A (ja) | 情報処理装置、情報処理方法ならびに記録媒体、プログラム | |
| US8140338B2 (en) | Method and system for speech based document history tracking | |
| CN114125184B (zh) | 一种提词方法、装置、终端及存储介质 | |
| JP2005509906A (ja) | 所定ウィンドウにてテキストを編集する装置 | |
| JP5892598B2 (ja) | 音声文字変換作業支援装置、音声文字変換システム、音声文字変換作業支援方法及びプログラム | |
| JP4272611B2 (ja) | 映像処理方法、映像処理装置、映像処理用プログラム及びそのプログラムを記録したコンピュータ読み取り可能な記録媒体 | |
| JP2020017885A (ja) | 情報処理装置およびプログラム | |
| JP2004288008A (ja) | プレゼンテーション用プログラム及びプレゼンテーション用装置 | |
| KR101501705B1 (ko) | 음성 데이터를 이용한 문서 생성 장치, 방법 및 컴퓨터 판독 가능 기록 매체 | |
| JP2002268683A (ja) | 情報処理方法及び装置 | |
| JP6387044B2 (ja) | テキスト処理装置、テキスト処理方法およびテキスト処理プログラム | |
| JP2012190088A (ja) | 音声記録装置、方法及びプログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20070328 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20070328 |
|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20090715 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20100330 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100630 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20100907 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20110107 |
|
| A911 | Transfer to examiner for re-examination before appeal (zenchi) |
Free format text: JAPANESE INTERMEDIATE CODE: A911 Effective date: 20110118 |
|
| A912 | Re-examination (zenchi) completed and case transferred to appeal board |
Free format text: JAPANESE INTERMEDIATE CODE: A912 Effective date: 20110401 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20120423 |
|
| A524 | Written submission of copy of amendment under article 19 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A524 Effective date: 20120423 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20120619 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20150629 Year of fee payment: 3 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |