NZ516956A - System and method for improving the accuracy of a speech recognition program - Google Patents
System and method for improving the accuracy of a speech recognition programInfo
- Publication number
- NZ516956A NZ516956A NZ516956A NZ51695600A NZ516956A NZ 516956 A NZ516956 A NZ 516956A NZ 516956 A NZ516956 A NZ 516956A NZ 51695600 A NZ51695600 A NZ 51695600A NZ 516956 A NZ516956 A NZ 516956A
- Authority
- NZ
- New Zealand
- Prior art keywords
- speech recognition
- recognition program
- written text
- speech
- text
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 53
- 238000006243 chemical reaction Methods 0.000 claims abstract description 20
- 239000000872 buffer Substances 0.000 claims description 18
- 241000282414 Homo sapiens Species 0.000 claims description 15
- 230000001360 synchronised effect Effects 0.000 claims description 12
- 230000006870 function Effects 0.000 claims description 11
- 230000003213 activating effect Effects 0.000 claims description 6
- 230000000977 initiatory effect Effects 0.000 claims description 5
- 238000012937 correction Methods 0.000 description 42
- 238000013459 approach Methods 0.000 description 33
- 238000012549 training Methods 0.000 description 22
- 238000013518 transcription Methods 0.000 description 11
- 230000035897 transcription Effects 0.000 description 11
- 230000008569 process Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 230000011218 segmentation Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 230000003278 mimic effect Effects 0.000 description 3
- 238000013522 software testing Methods 0.000 description 3
- TVZRAEYQIKYCPH-UHFFFAOYSA-N 3-(trimethylsilyl)propane-1-sulfonic acid Chemical compound C[Si](C)(C)CCCS(O)(=O)=O TVZRAEYQIKYCPH-UHFFFAOYSA-N 0.000 description 2
- 241000233805 Phoenix Species 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 206010011732 Cyst Diseases 0.000 description 1
- 241000270666 Testudines Species 0.000 description 1
- 241000736774 Uria aalge Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 235000014510 cooky Nutrition 0.000 description 1
- 208000031513 cyst Diseases 0.000 description 1
- 229910003460 diamond Inorganic materials 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 238000012553 document review Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- PWPJGUXAGUPAHP-UHFFFAOYSA-N lufenuron Chemical compound C1=C(Cl)C(OC(F)(F)C(C(F)(F)F)F)=CC(Cl)=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F PWPJGUXAGUPAHP-UHFFFAOYSA-N 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 239000012536 storage buffer Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/362,255 US6490558B1 (en) | 1999-07-28 | 1999-07-28 | System and method for improving the accuracy of a speech recognition program through repetitive training |
US09/430,144 US6421643B1 (en) | 1999-07-28 | 1999-10-29 | Method and apparatus for directing an audio file to a speech recognition program that does not accept such files |
US20887800P | 2000-06-01 | 2000-06-01 | |
US09/625,657 US6704709B1 (en) | 1999-07-28 | 2000-07-26 | System and method for improving the accuracy of a speech recognition program |
PCT/US2000/020467 WO2001009877A2 (fr) | 1999-07-28 | 2000-07-27 | Systeme et procede pour ameliorer la precision d'un programme de reconnaissance vocale |
Publications (1)
Publication Number | Publication Date |
---|---|
NZ516956A true NZ516956A (en) | 2004-11-26 |
Family
ID=27498742
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NZ516956A NZ516956A (en) | 1999-07-28 | 2000-07-27 | System and method for improving the accuracy of a speech recognition program |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1509902A4 (fr) |
AU (1) | AU776890B2 (fr) |
CA (1) | CA2380433A1 (fr) |
NZ (1) | NZ516956A (fr) |
WO (1) | WO2001009877A2 (fr) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2885247B1 (fr) * | 2005-04-27 | 2007-08-31 | Marc Bendayan | Equipement de reconnaissance de la parole. |
US8521510B2 (en) * | 2006-08-31 | 2013-08-27 | At&T Intellectual Property Ii, L.P. | Method and system for providing an automated web transcription service |
JP2012189930A (ja) | 2011-03-14 | 2012-10-04 | Seiko Epson Corp | プロジェクター |
CN112329926A (zh) * | 2020-11-30 | 2021-02-05 | 珠海采筑电子商务有限公司 | 智能机器人的质量改善方法及系统 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4914704A (en) * | 1984-10-30 | 1990-04-03 | International Business Machines Corporation | Text editor for speech input |
US4994966A (en) * | 1988-03-31 | 1991-02-19 | Emerson & Stern Associates, Inc. | System and method for natural language parsing by initiating processing prior to entry of complete sentences |
JP2986345B2 (ja) * | 1993-10-18 | 1999-12-06 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声記録指標化装置及び方法 |
US5883986A (en) * | 1995-06-02 | 1999-03-16 | Xerox Corporation | Method and system for automatic transcription correction |
US5712957A (en) * | 1995-09-08 | 1998-01-27 | Carnegie Mellon University | Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists |
US5960447A (en) * | 1995-11-13 | 1999-09-28 | Holt; Douglas | Word tagging and editing system for speech recognition |
GB9709341D0 (en) * | 1997-05-08 | 1997-06-25 | British Broadcasting Corp | Method of and apparatus for editing audio or audio-visual recordings |
US6353809B2 (en) * | 1997-06-06 | 2002-03-05 | Olympus Optical, Ltd. | Speech recognition with text generation from portions of voice data preselected by manual-input commands |
US6064957A (en) * | 1997-08-15 | 2000-05-16 | General Electric Company | Improving speech recognition through text-based linguistic post-processing |
-
2000
- 2000-07-27 EP EP00950784A patent/EP1509902A4/fr not_active Withdrawn
- 2000-07-27 NZ NZ516956A patent/NZ516956A/en unknown
- 2000-07-27 CA CA002380433A patent/CA2380433A1/fr not_active Abandoned
- 2000-07-27 WO PCT/US2000/020467 patent/WO2001009877A2/fr not_active Application Discontinuation
- 2000-07-27 AU AU63835/00A patent/AU776890B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
WO2001009877A9 (fr) | 2002-07-11 |
WO2001009877A3 (fr) | 2004-10-28 |
EP1509902A4 (fr) | 2005-08-17 |
CA2380433A1 (fr) | 2001-02-08 |
EP1509902A2 (fr) | 2005-03-02 |
AU776890B2 (en) | 2004-09-23 |
AU6383500A (en) | 2001-02-19 |
WO2001009877A2 (fr) | 2001-02-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6704709B1 (en) | System and method for improving the accuracy of a speech recognition program | |
US6490558B1 (en) | System and method for improving the accuracy of a speech recognition program through repetitive training | |
US6122614A (en) | System and method for automating transcription services | |
US6961699B1 (en) | Automated transcription system and method using two speech converting instances and computer-assisted correction | |
US6161087A (en) | Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording | |
EP1183680B1 (fr) | Systeme de transcription automatique et procede utilisant deux instances de conversion vocale et une correction assistee par ordinateur | |
US4866778A (en) | Interactive speech recognition apparatus | |
US6535848B1 (en) | Method and apparatus for transcribing multiple files into a single document | |
US7006967B1 (en) | System and method for automating transcription services | |
US20080255837A1 (en) | Method for locating an audio segment within an audio file | |
US20050222843A1 (en) | System for permanent alignment of text utterances to their associated audio utterances | |
US20050131559A1 (en) | Method for locating an audio segment within an audio file | |
US7120581B2 (en) | System and method for identifying an identical audio segment using text comparison | |
AU776890B2 (en) | System and method for improving the accuracy of a speech recognition program | |
AU3588200A (en) | System and method for automating transcription services | |
US20050125236A1 (en) | Automatic capture of intonation cues in audio segments for speech applications | |
WO2001093058A1 (fr) | Systeme et procede servant a comparer un texte genere en association avec un programme de reconnaissance vocale | |
AU2004233462B2 (en) | Automated transcription system and method using two speech converting instances and computer-assisted correction | |
JP3285145B2 (ja) | 録音音声データベース検証方法 | |
US9684437B2 (en) | Memorization system and method | |
JP2835320B2 (ja) | 音声文書作成装置 | |
Hewitt et al. | Real-Time Speech-Generated Subtitles: Problems and Solutions | |
JPH0644060A (ja) | プログラム開発支援方法およびその装置 | |
JP2002049389A (ja) | 音声認識方法およびそのプログラム記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PSEA | Patent sealed | ||
RENW | Renewal (renewal fees accepted) |