AU776890B2 - System and method for improving the accuracy of a speech recognition program - Google Patents

System and method for improving the accuracy of a speech recognition program Download PDF

Info

Publication number
AU776890B2
AU776890B2 AU63835/00A AU6383500A AU776890B2 AU 776890 B2 AU776890 B2 AU 776890B2 AU 63835/00 A AU63835/00 A AU 63835/00A AU 6383500 A AU6383500 A AU 6383500A AU 776890 B2 AU776890 B2 AU 776890B2
Authority
AU
Australia
Prior art keywords
speech recognition
recognition program
written text
invention according
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU63835/00A
Other languages
English (en)
Other versions
AU6383500A (en
Inventor
Thomas P. Flynn
Jonathan Kahn
Nicholas J. Linden
Charles Qin
James A. Sells
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Custom Speech USA Inc
Original Assignee
Custom Speech USA Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/362,255 external-priority patent/US6490558B1/en
Priority claimed from US09/625,657 external-priority patent/US6704709B1/en
Application filed by Custom Speech USA Inc filed Critical Custom Speech USA Inc
Publication of AU6383500A publication Critical patent/AU6383500A/en
Application granted granted Critical
Publication of AU776890B2 publication Critical patent/AU776890B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
AU63835/00A 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program Ceased AU776890B2 (en)

Applications Claiming Priority (9)

Application Number Priority Date Filing Date Title
US09/362255 1999-07-28
US09/362,255 US6490558B1 (en) 1999-07-28 1999-07-28 System and method for improving the accuracy of a speech recognition program through repetitive training
US09/430,144 US6421643B1 (en) 1999-07-28 1999-10-29 Method and apparatus for directing an audio file to a speech recognition program that does not accept such files
US09/430144 1999-10-29
US20887800P 2000-06-01 2000-06-01
US60/208878 2000-06-01
US09/625,657 US6704709B1 (en) 1999-07-28 2000-07-26 System and method for improving the accuracy of a speech recognition program
US09/625657 2000-07-26
PCT/US2000/020467 WO2001009877A2 (fr) 1999-07-28 2000-07-27 Systeme et procede pour ameliorer la precision d'un programme de reconnaissance vocale

Publications (2)

Publication Number Publication Date
AU6383500A AU6383500A (en) 2001-02-19
AU776890B2 true AU776890B2 (en) 2004-09-23

Family

ID=27498742

Family Applications (1)

Application Number Title Priority Date Filing Date
AU63835/00A Ceased AU776890B2 (en) 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program

Country Status (5)

Country Link
EP (1) EP1509902A4 (fr)
AU (1) AU776890B2 (fr)
CA (1) CA2380433A1 (fr)
NZ (1) NZ516956A (fr)
WO (1) WO2001009877A2 (fr)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2885247B1 (fr) * 2005-04-27 2007-08-31 Marc Bendayan Equipement de reconnaissance de la parole.
US8521510B2 (en) * 2006-08-31 2013-08-27 At&T Intellectual Property Ii, L.P. Method and system for providing an automated web transcription service
JP2012189930A (ja) 2011-03-14 2012-10-04 Seiko Epson Corp プロジェクター
CN112329926A (zh) * 2020-11-30 2021-02-05 珠海采筑电子商务有限公司 智能机器人的质量改善方法及系统

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
US6064957A (en) * 1997-08-15 2000-05-16 General Electric Company Improving speech recognition through text-based linguistic post-processing

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4994966A (en) * 1988-03-31 1991-02-19 Emerson & Stern Associates, Inc. System and method for natural language parsing by initiating processing prior to entry of complete sentences
JP2986345B2 (ja) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声記録指標化装置及び方法
US5883986A (en) * 1995-06-02 1999-03-16 Xerox Corporation Method and system for automatic transcription correction
US5712957A (en) * 1995-09-08 1998-01-27 Carnegie Mellon University Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists
GB9709341D0 (en) * 1997-05-08 1997-06-25 British Broadcasting Corp Method of and apparatus for editing audio or audio-visual recordings
US6353809B2 (en) * 1997-06-06 2002-03-05 Olympus Optical, Ltd. Speech recognition with text generation from portions of voice data preselected by manual-input commands

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
US6064957A (en) * 1997-08-15 2000-05-16 General Electric Company Improving speech recognition through text-based linguistic post-processing

Also Published As

Publication number Publication date
WO2001009877A9 (fr) 2002-07-11
EP1509902A4 (fr) 2005-08-17
EP1509902A2 (fr) 2005-03-02
WO2001009877A3 (fr) 2004-10-28
CA2380433A1 (fr) 2001-02-08
NZ516956A (en) 2004-11-26
AU6383500A (en) 2001-02-19
WO2001009877A2 (fr) 2001-02-08

Similar Documents

Publication Publication Date Title
US6704709B1 (en) System and method for improving the accuracy of a speech recognition program
US6490558B1 (en) System and method for improving the accuracy of a speech recognition program through repetitive training
US6122614A (en) System and method for automating transcription services
US6961699B1 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction
US4866778A (en) Interactive speech recognition apparatus
EP1183680B1 (fr) Systeme de transcription automatique et procede utilisant deux instances de conversion vocale et une correction assistee par ordinateur
JP3610083B2 (ja) マルチメディアプレゼンテーション装置および方法
US6161087A (en) Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording
US20080255837A1 (en) Method for locating an audio segment within an audio file
US6535848B1 (en) Method and apparatus for transcribing multiple files into a single document
US20030004724A1 (en) Speech recognition program mapping tool to align an audio file to verbatim text
US20050131559A1 (en) Method for locating an audio segment within an audio file
JP3065924B2 (ja) 音声注釈方法、テキスト入力ストリームの音声注釈を機能強化するための方法および装置
KR20000057795A (ko) 음독이 미숙한 자용 및 표시기가 없는 장치용 음성 인식등록 방법 및 장치
US7120581B2 (en) System and method for identifying an identical audio segment using text comparison
AU776890B2 (en) System and method for improving the accuracy of a speech recognition program
AU3588200A (en) System and method for automating transcription services
WO2001093058A1 (fr) Systeme et procede servant a comparer un texte genere en association avec un programme de reconnaissance vocale
US20050125236A1 (en) Automatic capture of intonation cues in audio segments for speech applications
JP2723214B2 (ja) 音声文書作成装置
AU2004233462B2 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction
US9684437B2 (en) Memorization system and method
JP2835320B2 (ja) 音声文書作成装置
Hewitt et al. Real-Time Speech-Generated Subtitles: Problems and Solutions
JPH0644060A (ja) プログラム開発支援方法およびその装置