JP2003518266A - 音声認識システムのテキスト編集用音声再生 - Google Patents

音声認識システムのテキスト編集用音声再生

Info

Publication number
JP2003518266A
JP2003518266A JP2001547300A JP2001547300A JP2003518266A JP 2003518266 A JP2003518266 A JP 2003518266A JP 2001547300 A JP2001547300 A JP 2001547300A JP 2001547300 A JP2001547300 A JP 2001547300A JP 2003518266 A JP2003518266 A JP 2003518266A
Authority
JP
Japan
Prior art keywords
word
recognized
user
sequence
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2001547300A
Other languages
English (en)
Japanese (ja)
Other versions
JP2003518266A5 (enExample
Inventor
へリバート ウッテ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Electronics NV filed Critical Philips Electronics NV
Publication of JP2003518266A publication Critical patent/JP2003518266A/ja
Publication of JP2003518266A5 publication Critical patent/JP2003518266A5/ja
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)
JP2001547300A 1999-12-20 2000-12-07 音声認識システムのテキスト編集用音声再生 Pending JP2003518266A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP99204400.8 1999-12-20
EP99204400 1999-12-20
PCT/EP2000/012447 WO2001046853A1 (en) 1999-12-20 2000-12-07 Audio playback for text edition in a speech recognition system

Publications (2)

Publication Number Publication Date
JP2003518266A true JP2003518266A (ja) 2003-06-03
JP2003518266A5 JP2003518266A5 (enExample) 2008-02-14

Family

ID=8241027

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2001547300A Pending JP2003518266A (ja) 1999-12-20 2000-12-07 音声認識システムのテキスト編集用音声再生

Country Status (4)

Country Link
US (1) US6792409B2 (enExample)
EP (2) EP2261893B1 (enExample)
JP (1) JP2003518266A (enExample)
WO (1) WO2001046853A1 (enExample)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004529381A (ja) * 2001-03-29 2004-09-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 認識音声に対する同期再生中の文字編集
JP2015203835A (ja) * 2014-04-16 2015-11-16 株式会社日立システムズ テキスト編集装置、テキスト編集方法、及びプログラム
KR20210115671A (ko) * 2020-03-16 2021-09-27 주식회사 한글과컴퓨터 문서 작성 프로그램에서 자주 사용되는 편집 명령에 대한 음성 인식을 가능하게 하는 전자 장치 및 그 동작 방법

Families Citing this family (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7047192B2 (en) * 2000-06-28 2006-05-16 Poirier Darrell A Simultaneous multi-user real-time speech recognition system
GB0018733D0 (en) * 2000-07-31 2000-09-20 Nissen John C D Serial presentation system
WO2002080143A1 (en) 2001-03-29 2002-10-10 Koninklijke Philips Electronics N.V. Synchronise an audio cursor and a text cursor during editing
US7225126B2 (en) * 2001-06-12 2007-05-29 At&T Corp. System and method for processing speech files
DE60207742T2 (de) * 2001-09-17 2006-08-03 Koninklijke Philips Electronics N.V. Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes
US6708148B2 (en) * 2001-10-12 2004-03-16 Koninklijke Philips Electronics N.V. Correction device to mark parts of a recognized text
JP4145796B2 (ja) * 2001-10-31 2008-09-03 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ テキストファイルのディクテーションを筆記するための及びテキストを修正するための方法及びシステム
WO2003042975A1 (en) * 2001-11-16 2003-05-22 Koninklijke Philips Electronics N.V. Device to edit a text in predefined windows
TWI225640B (en) * 2002-06-28 2004-12-21 Samsung Electronics Co Ltd Voice recognition device, observation probability calculating device, complex fast fourier transform calculation device and method, cache device, and method of controlling the cache device
US20040024598A1 (en) * 2002-07-03 2004-02-05 Amit Srivastava Thematic segmentation of speech
US20040006628A1 (en) * 2002-07-03 2004-01-08 Scott Shepard Systems and methods for providing real-time alerting
US20040021765A1 (en) * 2002-07-03 2004-02-05 Francis Kubala Speech recognition system for managing telemeetings
CN100383864C (zh) * 2002-10-17 2008-04-23 皇家飞利浦电子股份有限公司 用于重现音频数据的装置和方法以及用于其中的计算机程序产品
US20040083090A1 (en) * 2002-10-17 2004-04-29 Daniel Kiecza Manager for integrating language technology components
DE10251112A1 (de) * 2002-11-02 2004-05-19 Philips Intellectual Property & Standards Gmbh Verfahren und System zur Spracherkennung
EP1611570B1 (en) * 2003-03-31 2017-06-28 Nuance Communications Austria GmbH System for correction of speech recognition results with confidence level indication
EP1692610A2 (en) * 2003-11-28 2006-08-23 Koninklijke Philips Electronics N.V. Method and device for transcribing an audio signal
JP4558308B2 (ja) * 2003-12-03 2010-10-06 ニュアンス コミュニケーションズ,インコーポレイテッド 音声認識システム、データ処理装置、そのデータ処理方法及びプログラム
US8504369B1 (en) 2004-06-02 2013-08-06 Nuance Communications, Inc. Multi-cursor transcription editing
US7836412B1 (en) 2004-12-03 2010-11-16 Escription, Inc. Transcription editing
US20070011012A1 (en) * 2005-07-11 2007-01-11 Steve Yurick Method, system, and apparatus for facilitating captioning of multi-media content
US20080256071A1 (en) * 2005-10-31 2008-10-16 Prasad Datta G Method And System For Selection Of Text For Editing
US20100324895A1 (en) * 2009-01-15 2010-12-23 K-Nfb Reading Technology, Inc. Synchronization for document narration
JP5278064B2 (ja) * 2009-03-16 2013-09-04 富士通モバイルコミュニケーションズ株式会社 携帯端末および音声認識されたテキストデータの利用方法
US20110046950A1 (en) * 2009-08-18 2011-02-24 Priyamvada Sinvhal-Sharma Wireless Dictaphone Features and Interface
US8630971B2 (en) * 2009-11-20 2014-01-14 Indian Institute Of Science System and method of using Multi Pattern Viterbi Algorithm for joint decoding of multiple patterns
US9031831B1 (en) * 2010-01-14 2015-05-12 Abbyy Development Llc Method and system for looking up words on a display screen by OCR comprising a set of base forms of recognized inflected words
US9053098B2 (en) * 2010-01-14 2015-06-09 Abbyy Development Llc Insertion of translation in displayed text consisting of grammatical variations pertaining to gender, number and tense
US8392186B2 (en) 2010-05-18 2013-03-05 K-Nfb Reading Technology, Inc. Audio synchronization for document narration with user-selected playback
WO2012161359A1 (ko) * 2011-05-24 2012-11-29 엘지전자 주식회사 사용자 인터페이스 방법 및 장치
US20130035936A1 (en) * 2011-08-02 2013-02-07 Nexidia Inc. Language transcription
US8255218B1 (en) 2011-09-26 2012-08-28 Google Inc. Directing dictation into input fields
JP5404726B2 (ja) * 2011-09-26 2014-02-05 株式会社東芝 情報処理装置、情報処理方法およびプログラム
US8543397B1 (en) 2012-10-11 2013-09-24 Google Inc. Mobile device voice activation
CN103885743A (zh) * 2012-12-24 2014-06-25 大陆汽车投资(上海)有限公司 结合注视跟踪技术的语音文本输入方法和系统
FR3011353B1 (fr) * 2013-09-27 2016-12-02 Peugeot Citroen Automobiles Sa Procede et dispositif de traitement de la parole d'un utilisateur
EP2866153A1 (en) 2013-10-22 2015-04-29 Agfa Healthcare Speech recognition method and system with simultaneous text editing
US10776419B2 (en) 2014-05-16 2020-09-15 Gracenote Digital Ventures, Llc Audio file quality and accuracy assessment
JP2017199057A (ja) * 2016-04-25 2017-11-02 京セラ株式会社 電子機器、電子機器の制御方法、電子機器の制御装置、制御プログラムおよび電子機器システム
US10269376B1 (en) * 2018-06-28 2019-04-23 Invoca, Inc. Desired signal spotting in noisy, flawed environments
US11289092B2 (en) 2019-09-25 2022-03-29 International Business Machines Corporation Text editing using speech recognition

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6184771A (ja) * 1984-10-03 1986-04-30 Hitachi Ltd 音声入力装置
JPS63269200A (ja) * 1987-04-28 1988-11-07 キヤノン株式会社 音声認識装置
JPH02230199A (ja) * 1989-03-02 1990-09-12 Nec Corp 音声変換装置
US5857099A (en) * 1996-09-27 1999-01-05 Allvoice Computing Plc Speech-to-text dictation system with audio message capability
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4748670A (en) * 1985-05-29 1988-05-31 International Business Machines Corporation Apparatus and method for determining a likely word sequence from labels generated by an acoustic processor
JP2558682B2 (ja) * 1987-03-13 1996-11-27 株式会社東芝 知的ワ−クステ−シヨン
AT390685B (de) 1988-10-25 1990-06-11 Philips Nv System zur textverarbeitung
JPH0362800A (ja) * 1989-07-31 1991-03-18 Toshiba Corp 音声リモートコントロール装置
US5199077A (en) * 1991-09-19 1993-03-30 Xerox Corporation Wordspotting for voice editing and indexing
WO1994006086A1 (en) * 1992-09-04 1994-03-17 Caterpillar Inc. Integrated authoring and translation system
US5369704A (en) * 1993-03-24 1994-11-29 Engate Incorporated Down-line transcription system for manipulating real-time testimony
JP2986345B2 (ja) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声記録指標化装置及び方法
US5729741A (en) * 1995-04-10 1998-03-17 Golden Enterprises, Inc. System for storage and retrieval of diverse types of information obtained from different media sources which includes video, audio, and text transcriptions
US5850629A (en) * 1996-09-09 1998-12-15 Matsushita Electric Industrial Co., Ltd. User interface controller for text-to-speech synthesizer
US5995936A (en) * 1997-02-04 1999-11-30 Brais; Louis Report generation system and method for capturing prose, audio, and video by voice command and automatically linking sound and image to formatted text locations
US5909667A (en) * 1997-03-05 1999-06-01 International Business Machines Corporation Method and apparatus for fast voice selection of error words in dictated text
GB2323694B (en) * 1997-03-27 2001-07-18 Forum Technology Ltd Adaptation in speech to text conversion
US5970460A (en) * 1997-12-05 1999-10-19 Lernout & Hauspie Speech Products N.V. Speech recognition and editing system
ATE451652T1 (de) * 1998-03-03 2009-12-15 Koninkl Philips Electronics Nv Textverarbeitungssystem mit spracherkennungsvorrichtung und textänderungsmöglichkeit zum verändern von textblockdaten
US6181351B1 (en) * 1998-04-13 2001-01-30 Microsoft Corporation Synchronizing the moveable mouths of animated characters with recorded speech
US6064965A (en) * 1998-09-02 2000-05-16 International Business Machines Corporation Combined audio playback in speech recognition proofreader
US6611802B2 (en) * 1999-06-11 2003-08-26 International Business Machines Corporation Method and system for proofreading and correcting dictated text

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6184771A (ja) * 1984-10-03 1986-04-30 Hitachi Ltd 音声入力装置
JPS63269200A (ja) * 1987-04-28 1988-11-07 キヤノン株式会社 音声認識装置
JPH02230199A (ja) * 1989-03-02 1990-09-12 Nec Corp 音声変換装置
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
US5857099A (en) * 1996-09-27 1999-01-05 Allvoice Computing Plc Speech-to-text dictation system with audio message capability

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004529381A (ja) * 2001-03-29 2004-09-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 認識音声に対する同期再生中の文字編集
JP2015203835A (ja) * 2014-04-16 2015-11-16 株式会社日立システムズ テキスト編集装置、テキスト編集方法、及びプログラム
KR20210115671A (ko) * 2020-03-16 2021-09-27 주식회사 한글과컴퓨터 문서 작성 프로그램에서 자주 사용되는 편집 명령에 대한 음성 인식을 가능하게 하는 전자 장치 및 그 동작 방법
KR102375508B1 (ko) * 2020-03-16 2022-03-17 주식회사 한글과컴퓨터 문서 작성 프로그램에서 자주 사용되는 편집 명령에 대한 음성 인식을 가능하게 하는 전자 장치 및 그 동작 방법

Also Published As

Publication number Publication date
EP1169678B1 (en) 2015-01-21
EP2261893B1 (en) 2016-03-30
EP1169678A1 (en) 2002-01-09
US6792409B2 (en) 2004-09-14
US20010018653A1 (en) 2001-08-30
WO2001046853A1 (en) 2001-06-28
EP2261893A1 (en) 2010-12-15

Similar Documents

Publication Publication Date Title
US6792409B2 (en) Synchronous reproduction in a speech recognition system
US9774747B2 (en) Transcription system
US10522133B2 (en) Methods and apparatus for correcting recognition errors
JP4444396B2 (ja) 音声認識におけるポジション操作
JP4987623B2 (ja) ユーザと音声により対話する装置および方法
US8311832B2 (en) Hybrid-captioning system
US6801897B2 (en) Method of providing concise forms of natural commands
JP3232289B2 (ja) 記号挿入装置およびその方法
US20130035936A1 (en) Language transcription
US8155958B2 (en) Speech-to-text system, speech-to-text method, and speech-to-text program
EP0965978A1 (en) Non-interactive enrollment in speech recognition
JP3065924B2 (ja) 音声注釈方法、テキスト入力ストリームの音声注釈を機能強化するための方法および装置
US20090138266A1 (en) Apparatus, method, and computer program product for recognizing speech
WO2016139670A1 (en) System and method for generating accurate speech transcription from natural speech audio signals
US6345249B1 (en) Automatic analysis of a speech dictated document
KR100845428B1 (ko) 휴대용 단말기의 음성 인식 시스템
JP5451982B2 (ja) 支援装置、プログラムおよび支援方法
JP2002132287A (ja) 音声収録方法および音声収録装置および記憶媒体
EP0899737A2 (en) Script recognition using speech recognition
JP5819147B2 (ja) 音声合成装置、音声合成方法およびプログラム
JP3846300B2 (ja) 録音原稿作成装置および方法
JP4736478B2 (ja) 音声書き起こし支援装置およびその方法ならびにプログラム
JP3958908B2 (ja) 書き起こしテキスト自動生成装置、音声認識装置および記録媒体
JP2003241787A (ja) 音声認識装置および方法、並びにプログラム
JP3277579B2 (ja) 音声認識方法および装置

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20071206

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20071206

RD02 Notification of acceptance of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7422

Effective date: 20090519

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20090715

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20101124

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20110222

A602 Written permission of extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A602

Effective date: 20110301

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20110628