JP2005517216A - 話されたおよび書かれたことばの高速かつパターン認識に支援された書き起こし方法および装置 - Google Patents
話されたおよび書かれたことばの高速かつパターン認識に支援された書き起こし方法および装置 Download PDFInfo
- Publication number
- JP2005517216A JP2005517216A JP2003566843A JP2003566843A JP2005517216A JP 2005517216 A JP2005517216 A JP 2005517216A JP 2003566843 A JP2003566843 A JP 2003566843A JP 2003566843 A JP2003566843 A JP 2003566843A JP 2005517216 A JP2005517216 A JP 2005517216A
- Authority
- JP
- Japan
- Prior art keywords
- text
- manually
- speech recognition
- word
- transcription
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/142—Image acquisition using hand-held instruments; Constructional details of the instruments
- G06V30/1423—Image acquisition using hand-held instruments; Constructional details of the instruments the instrument generating sequences of position coordinates corresponding to handwriting
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Machine Translation (AREA)
- Image Processing (AREA)
- Radar Systems Or Details Thereof (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE10204924A DE10204924A1 (de) | 2002-02-07 | 2002-02-07 | Verfahren und Vorrichtung zur schnellen mustererkennungsunterstützten Transkription gesprochener und schriftlicher Äußerungen |
| PCT/IB2003/000374 WO2003067573A1 (en) | 2002-02-07 | 2003-01-30 | Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2005517216A true JP2005517216A (ja) | 2005-06-09 |
| JP2005517216A5 JP2005517216A5 (https=) | 2010-05-27 |
Family
ID=27618362
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2003566843A Pending JP2005517216A (ja) | 2002-02-07 | 2003-01-30 | 話されたおよび書かれたことばの高速かつパターン認識に支援された書き起こし方法および装置 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20060167685A1 (https=) |
| EP (1) | EP1479070B1 (https=) |
| JP (1) | JP2005517216A (https=) |
| AT (1) | ATE358869T1 (https=) |
| AU (1) | AU2003205955A1 (https=) |
| DE (2) | DE10204924A1 (https=) |
| WO (1) | WO2003067573A1 (https=) |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050273337A1 (en) * | 2004-06-02 | 2005-12-08 | Adoram Erell | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition |
| US20070011012A1 (en) * | 2005-07-11 | 2007-01-11 | Steve Yurick | Method, system, and apparatus for facilitating captioning of multi-media content |
| KR100654183B1 (ko) * | 2005-11-07 | 2006-12-08 | 한국전자통신연구원 | 음성 인식을 이용한 문자 입력 시스템 및 그 방법 |
| US9230222B2 (en) * | 2008-07-23 | 2016-01-05 | The Quantum Group, Inc. | System and method enabling bi-translation for improved prescription accuracy |
| JP2013025299A (ja) * | 2011-07-26 | 2013-02-04 | Toshiba Corp | 書き起こし支援システムおよび書き起こし支援方法 |
| JP6165619B2 (ja) * | 2013-12-13 | 2017-07-19 | 株式会社東芝 | 情報処理装置、情報処理方法、および情報処理プログラム |
| CN109285548A (zh) * | 2017-07-19 | 2019-01-29 | 阿里巴巴集团控股有限公司 | 信息处理方法、系统、电子设备、和计算机存储介质 |
| US11017778B1 (en) | 2018-12-04 | 2021-05-25 | Sorenson Ip Holdings, Llc | Switching between speech recognition systems |
| US10573312B1 (en) | 2018-12-04 | 2020-02-25 | Sorenson Ip Holdings, Llc | Transcription generation from multiple speech recognition systems |
| CN110956959B (zh) * | 2019-11-25 | 2023-07-25 | 科大讯飞股份有限公司 | 语音识别纠错方法、相关设备及可读存储介质 |
| US11488604B2 (en) | 2020-08-19 | 2022-11-01 | Sorenson Ip Holdings, Llc | Transcription of audio |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS6091435A (ja) * | 1983-10-25 | 1985-05-22 | Fujitsu Ltd | 文字入力装置 |
| JPS62229300A (ja) * | 1986-03-31 | 1987-10-08 | キヤノン株式会社 | 音声認識装置 |
| JPH0883092A (ja) * | 1994-09-14 | 1996-03-26 | Nippon Telegr & Teleph Corp <Ntt> | 情報入力装置及び情報入力方法 |
| JPH0968998A (ja) * | 1995-08-31 | 1997-03-11 | Matsushita Electric Ind Co Ltd | 音声認識方法及び音声認識装置 |
| US5855000A (en) * | 1995-09-08 | 1998-12-29 | Carnegie Mellon University | Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input |
| JP2000056792A (ja) * | 1998-05-25 | 2000-02-25 | Nokia Mobile Phones Ltd | ユ―ザの発話を認識するための方法及び装置 |
| JP2000056796A (ja) * | 1998-08-07 | 2000-02-25 | Asahi Chem Ind Co Ltd | 音声入力装置および方法 |
| JP2000339305A (ja) * | 1999-05-31 | 2000-12-08 | Toshiba Corp | 文書作成装置、及び文書作成方法 |
| JP2001042996A (ja) * | 1999-07-28 | 2001-02-16 | Toshiba Corp | 文書作成装置、文書作成方法 |
| JP2001159896A (ja) * | 1999-12-02 | 2001-06-12 | Nec Software Okinawa Ltd | 音声認識機能を利用した簡易文字入力方法 |
Family Cites Families (35)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0122880A2 (en) * | 1983-04-19 | 1984-10-24 | E.S.P. Elektronische Spezialprojekte Aktiengesellschaft | Electronic apparatus for high-speed writing on electronic typewriters, printers, photocomposers, processors and the like |
| US5027406A (en) * | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
| EP0505621A3 (en) * | 1991-03-28 | 1993-06-02 | International Business Machines Corporation | Improved message recognition employing integrated speech and handwriting information |
| US5502774A (en) * | 1992-06-09 | 1996-03-26 | International Business Machines Corporation | Automatic recognition of a consistent message using multiple complimentary sources of information |
| JP2986345B2 (ja) * | 1993-10-18 | 1999-12-06 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声記録指標化装置及び方法 |
| US5818437A (en) * | 1995-07-26 | 1998-10-06 | Tegic Communications, Inc. | Reduced keyboard disambiguating computer |
| US5960447A (en) * | 1995-11-13 | 1999-09-28 | Holt; Douglas | Word tagging and editing system for speech recognition |
| US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
| WO1999000790A1 (en) * | 1997-06-27 | 1999-01-07 | M.H. Segan Limited Partnership | Speech recognition computer input and device |
| US6219453B1 (en) * | 1997-08-11 | 2001-04-17 | At&T Corp. | Method and apparatus for performing an automatic correction of misrecognized words produced by an optical character recognition technique by using a Hidden Markov Model based algorithm |
| US6418431B1 (en) * | 1998-03-30 | 2002-07-09 | Microsoft Corporation | Information retrieval and speech recognition based on language models |
| US6078885A (en) * | 1998-05-08 | 2000-06-20 | At&T Corp | Verbal, fully automatic dictionary updates by end-users of speech synthesis and recognition systems |
| US6438523B1 (en) * | 1998-05-20 | 2002-08-20 | John A. Oberteuffer | Processing handwritten and hand-drawn input and speech input |
| US6457031B1 (en) * | 1998-09-02 | 2002-09-24 | International Business Machines Corp. | Method of marking previously dictated text for deferred correction in a speech recognition proofreader |
| US6167376A (en) * | 1998-12-21 | 2000-12-26 | Ditzik; Richard Joseph | Computer system with integrated telephony, handwriting and speech recognition functions |
| US6442518B1 (en) * | 1999-07-14 | 2002-08-27 | Compaq Information Technologies Group, L.P. | Method for refining time alignments of closed captions |
| US6904405B2 (en) * | 1999-07-17 | 2005-06-07 | Edwin A. Suominen | Message recognition using shared language model |
| US6789231B1 (en) * | 1999-10-05 | 2004-09-07 | Microsoft Corporation | Method and system for providing alternatives for text derived from stochastic input sources |
| US7149970B1 (en) * | 2000-06-23 | 2006-12-12 | Microsoft Corporation | Method and system for filtering and selecting from a candidate list generated by a stochastic input method |
| US7243069B2 (en) * | 2000-07-28 | 2007-07-10 | International Business Machines Corporation | Speech recognition by automated context creation |
| US6836759B1 (en) * | 2000-08-22 | 2004-12-28 | Microsoft Corporation | Method and system of handling the selection of alternates for recognized words |
| US6788815B2 (en) * | 2000-11-10 | 2004-09-07 | Microsoft Corporation | System and method for accepting disparate types of user input |
| US20020152071A1 (en) * | 2001-04-12 | 2002-10-17 | David Chaiken | Human-augmented, automatic speech recognition engine |
| US20020152075A1 (en) * | 2001-04-16 | 2002-10-17 | Shao-Tsu Kung | Composite input method |
| US6839667B2 (en) * | 2001-05-16 | 2005-01-04 | International Business Machines Corporation | Method of speech recognition by presenting N-best word candidates |
| US6996525B2 (en) * | 2001-06-15 | 2006-02-07 | Intel Corporation | Selecting one of multiple speech recognizers in a system based on performance predections resulting from experience |
| US7058575B2 (en) * | 2001-06-27 | 2006-06-06 | Intel Corporation | Integrating keyword spotting with graph decoder to improve the robustness of speech recognition |
| US7467089B2 (en) * | 2001-09-05 | 2008-12-16 | Roth Daniel L | Combined speech and handwriting recognition |
| US6708148B2 (en) * | 2001-10-12 | 2004-03-16 | Koninklijke Philips Electronics N.V. | Correction device to mark parts of a recognized text |
| US7124085B2 (en) * | 2001-12-13 | 2006-10-17 | Matsushita Electric Industrial Co., Ltd. | Constraint-based speech recognition system and method |
| US7103542B2 (en) * | 2001-12-14 | 2006-09-05 | Ben Franklin Patent Holding Llc | Automatically improving a voice recognition system |
| US20030112277A1 (en) * | 2001-12-14 | 2003-06-19 | Koninklijke Philips Electronics N.V. | Input of data using a combination of data input systems |
| US6986106B2 (en) * | 2002-05-13 | 2006-01-10 | Microsoft Corporation | Correction widget |
| US7137076B2 (en) * | 2002-07-30 | 2006-11-14 | Microsoft Corporation | Correcting recognition results associated with user input |
| US7228275B1 (en) * | 2002-10-21 | 2007-06-05 | Toyota Infotechnology Center Co., Ltd. | Speech recognition system having multiple speech recognizers |
-
2002
- 2002-02-07 DE DE10204924A patent/DE10204924A1/de not_active Withdrawn
-
2003
- 2003-01-30 US US10/503,420 patent/US20060167685A1/en not_active Abandoned
- 2003-01-30 AT AT03702838T patent/ATE358869T1/de not_active IP Right Cessation
- 2003-01-30 JP JP2003566843A patent/JP2005517216A/ja active Pending
- 2003-01-30 AU AU2003205955A patent/AU2003205955A1/en not_active Abandoned
- 2003-01-30 DE DE60312963T patent/DE60312963T2/de not_active Expired - Lifetime
- 2003-01-30 WO PCT/IB2003/000374 patent/WO2003067573A1/en not_active Ceased
- 2003-01-30 EP EP03702838A patent/EP1479070B1/en not_active Expired - Lifetime
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS6091435A (ja) * | 1983-10-25 | 1985-05-22 | Fujitsu Ltd | 文字入力装置 |
| JPS62229300A (ja) * | 1986-03-31 | 1987-10-08 | キヤノン株式会社 | 音声認識装置 |
| JPH0883092A (ja) * | 1994-09-14 | 1996-03-26 | Nippon Telegr & Teleph Corp <Ntt> | 情報入力装置及び情報入力方法 |
| JPH0968998A (ja) * | 1995-08-31 | 1997-03-11 | Matsushita Electric Ind Co Ltd | 音声認識方法及び音声認識装置 |
| US5855000A (en) * | 1995-09-08 | 1998-12-29 | Carnegie Mellon University | Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input |
| JP2000056792A (ja) * | 1998-05-25 | 2000-02-25 | Nokia Mobile Phones Ltd | ユ―ザの発話を認識するための方法及び装置 |
| JP2000056796A (ja) * | 1998-08-07 | 2000-02-25 | Asahi Chem Ind Co Ltd | 音声入力装置および方法 |
| JP2000339305A (ja) * | 1999-05-31 | 2000-12-08 | Toshiba Corp | 文書作成装置、及び文書作成方法 |
| JP2001042996A (ja) * | 1999-07-28 | 2001-02-16 | Toshiba Corp | 文書作成装置、文書作成方法 |
| JP2001159896A (ja) * | 1999-12-02 | 2001-06-12 | Nec Software Okinawa Ltd | 音声認識機能を利用した簡易文字入力方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2003067573A1 (en) | 2003-08-14 |
| DE60312963T2 (de) | 2007-12-13 |
| EP1479070B1 (en) | 2007-04-04 |
| ATE358869T1 (de) | 2007-04-15 |
| DE60312963D1 (de) | 2007-05-16 |
| AU2003205955A1 (en) | 2003-09-02 |
| EP1479070A1 (en) | 2004-11-24 |
| DE10204924A1 (de) | 2003-08-21 |
| US20060167685A1 (en) | 2006-07-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20180143956A1 (en) | Real-time caption correction by audience | |
| US11922944B2 (en) | Phrase alternatives representation for automatic speech recognition and methods of use | |
| KR100996212B1 (ko) | 음성인식을 위한 방법, 시스템 및 프로그램 | |
| JP4829901B2 (ja) | マニュアルでエントリされた不確定なテキスト入力を音声入力を使用して確定する方法および装置 | |
| US7848926B2 (en) | System, method, and program for correcting misrecognized spoken words by selecting appropriate correction word from one or more competitive words | |
| US7505911B2 (en) | Combined speech recognition and sound recording | |
| US7526431B2 (en) | Speech recognition using ambiguous or phone key spelling and/or filtering | |
| RU2379767C2 (ru) | Коррекция ошибок для систем распознавания речи | |
| US7809574B2 (en) | Word recognition using choice lists | |
| US7983912B2 (en) | Apparatus, method, and computer program product for correcting a misrecognized utterance using a whole or a partial re-utterance | |
| US6735565B2 (en) | Select a recognition error by comparing the phonetic | |
| US20090326938A1 (en) | Multiword text correction | |
| US20180144747A1 (en) | Real-time caption correction by moderator | |
| US20050038657A1 (en) | Combined speech recongnition and text-to-speech generation | |
| US20050159948A1 (en) | Combined speech and handwriting recognition | |
| US20110208507A1 (en) | Speech Correction for Typed Input | |
| JP5787780B2 (ja) | 書き起こし支援システムおよび書き起こし支援方法 | |
| CN101415259A (zh) | 嵌入式设备上基于双语语音查询的信息检索系统及方法 | |
| EP1479070B1 (en) | Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances | |
| Marx et al. | Putting people first: Specifying proper names in speech interfaces | |
| JP2002189490A (ja) | ピンイン音声入力の方法 | |
| JP2001013992A (ja) | 音声理解装置 | |
| JP2686085B2 (ja) | 音声認識システム | |
| Scott | A Comparative Analysis of Transcription Errors from Major Commercial Automatic Speech Recognition Systems on Speakers of Four Ethnic Backgrounds in the Pacific Northwest | |
| JP2547611B2 (ja) | 文章作成システム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20060127 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090602 |
|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20090715 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20090902 |
|
| A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20090909 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20091002 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20091201 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20100301 |
|
| A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20100308 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100401 |
|
| A524 | Written submission of copy of amendment under article 19 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A524 Effective date: 20100401 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20100511 |