ATE417347T1 - Fehlerdetektion für sprach-zu-text- transkriptionssysteme - Google Patents
Fehlerdetektion für sprach-zu-text- transkriptionssystemeInfo
- Publication number
- ATE417347T1 ATE417347T1 AT04791820T AT04791820T ATE417347T1 AT E417347 T1 ATE417347 T1 AT E417347T1 AT 04791820 T AT04791820 T AT 04791820T AT 04791820 T AT04791820 T AT 04791820T AT E417347 T1 ATE417347 T1 AT E417347T1
- Authority
- AT
- Austria
- Prior art keywords
- speech
- text
- proof
- transcribed
- signal
- Prior art date
Links
- 238000013518 transcription Methods 0.000 title abstract 4
- 230000035897 transcription Effects 0.000 title abstract 4
- 238000001514 detection method Methods 0.000 title 1
- 238000000034 method Methods 0.000 abstract 4
- 230000001915 proofreading effect Effects 0.000 abstract 2
- 238000004590 computer program Methods 0.000 abstract 1
- 230000002708 enhancing effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
- Debugging And Monitoring (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03104078 | 2003-11-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE417347T1 true ATE417347T1 (de) | 2008-12-15 |
Family
ID=34560196
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT04791820T ATE417347T1 (de) | 2003-11-05 | 2004-10-27 | Fehlerdetektion für sprach-zu-text- transkriptionssysteme |
Country Status (7)
Country | Link |
---|---|
US (1) | US7617106B2 (de) |
EP (1) | EP1702319B1 (de) |
JP (1) | JP4714694B2 (de) |
CN (1) | CN1879146B (de) |
AT (1) | ATE417347T1 (de) |
DE (1) | DE602004018385D1 (de) |
WO (1) | WO2005045803A1 (de) |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6910481B2 (en) * | 2003-03-28 | 2005-06-28 | Ric Investments, Inc. | Pressure support compliance monitoring system |
US9520068B2 (en) * | 2004-09-10 | 2016-12-13 | Jtt Holdings, Inc. | Sentence level analysis in a reading tutor |
US8014650B1 (en) * | 2006-01-24 | 2011-09-06 | Adobe Systems Incorporated | Feedback of out-of-range signals |
FR2902542B1 (fr) * | 2006-06-16 | 2012-12-21 | Gilles Vessiere Consultants | Correcteur semantiques, syntaxique et/ou lexical, procede de correction, ainsi que support d'enregistrement et programme d'ordinateur pour la mise en oeuvre de ce procede |
KR101373336B1 (ko) | 2007-08-08 | 2014-03-10 | 엘지전자 주식회사 | 방송수신 휴대단말기 |
US9280971B2 (en) * | 2009-02-27 | 2016-03-08 | Blackberry Limited | Mobile wireless communications device with speech to text conversion and related methods |
CN102163379B (zh) * | 2010-02-24 | 2013-03-13 | 英业达股份有限公司 | 听写文章之校正语音的定位与播放系统及其方法 |
US20150279354A1 (en) * | 2010-05-19 | 2015-10-01 | Google Inc. | Personalization and Latency Reduction for Voice-Activated Commands |
US9236045B2 (en) * | 2011-05-23 | 2016-01-12 | Nuance Communications, Inc. | Methods and apparatus for proofing of a text input |
AU2013251457A1 (en) * | 2012-04-27 | 2014-10-09 | Interactive Intelligence, Inc. | Negative example (anti-word) based performance improvement for speech recognition |
CN102665012B (zh) * | 2012-05-02 | 2015-07-08 | 江苏南大数码科技有限公司 | 远程电话语音查询平台故障自动巡检方法 |
US9135916B2 (en) * | 2013-02-26 | 2015-09-15 | Honeywell International Inc. | System and method for correcting accent induced speech transmission problems |
CN105493425B (zh) | 2013-08-29 | 2019-04-30 | 统一有限责任两合公司 | 在拥挤的通信信道中维持音频通信 |
US10069965B2 (en) | 2013-08-29 | 2018-09-04 | Unify Gmbh & Co. Kg | Maintaining audio communication in a congested communication channel |
KR101808810B1 (ko) * | 2013-11-27 | 2017-12-14 | 한국전자통신연구원 | 음성/무음성 구간 검출 방법 및 장치 |
CN105374356B (zh) * | 2014-08-29 | 2019-07-30 | 株式会社理光 | 语音识别方法、语音评分方法、语音识别系统及语音评分系统 |
US20160379640A1 (en) * | 2015-06-24 | 2016-12-29 | Honeywell International Inc. | System and method for aircraft voice-to-text communication with message validation |
JP6605995B2 (ja) * | 2016-03-16 | 2019-11-13 | 株式会社東芝 | 音声認識誤り修正装置、方法及びプログラム |
US10650810B2 (en) | 2016-10-20 | 2020-05-12 | Google Llc | Determining phonetic relationships |
US10446138B2 (en) * | 2017-05-23 | 2019-10-15 | Verbit Software Ltd. | System and method for assessing audio files for transcription services |
CN109949828B (zh) * | 2017-12-20 | 2022-05-24 | 苏州君林智能科技有限公司 | 一种文字校验方法及装置 |
CN112567456A (zh) * | 2018-07-16 | 2021-03-26 | 万卷智能有限公司 | 学习辅助工具 |
KR102615154B1 (ko) * | 2019-02-28 | 2023-12-18 | 삼성전자주식회사 | 전자 장치 및 전자 장치의 제어 방법 |
US11410658B1 (en) * | 2019-10-29 | 2022-08-09 | Dialpad, Inc. | Maintainable and scalable pipeline for automatic speech recognition language modeling |
US20240095449A1 (en) * | 2022-09-16 | 2024-03-21 | Verizon Patent And Licensing Inc. | Systems and methods for adjusting a transcript based on output from a machine learning model |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS61233832A (ja) * | 1985-04-08 | 1986-10-18 | Toshiba Corp | 読合わせ校正装置 |
JP2585547B2 (ja) * | 1986-09-19 | 1997-02-26 | 株式会社日立製作所 | 音声入出力装置における入力音声の修正方法 |
JPH0488399A (ja) * | 1990-08-01 | 1992-03-23 | Clarion Co Ltd | 音声認識装置 |
GB2303955B (en) * | 1996-09-24 | 1997-05-14 | Allvoice Computing Plc | Data processing method and apparatus |
US6088674A (en) * | 1996-12-04 | 2000-07-11 | Justsystem Corp. | Synthesizing a voice by developing meter patterns in the direction of a time axis according to velocity and pitch of a voice |
US5987405A (en) * | 1997-06-24 | 1999-11-16 | International Business Machines Corporation | Speech compression by speech recognition |
JP3519259B2 (ja) * | 1997-12-29 | 2004-04-12 | 京セラ株式会社 | 音声認識作動装置 |
DE19824450C2 (de) | 1998-05-30 | 2001-05-31 | Grundig Ag | Verfahren und Vorrichtung zur Verarbeitung von Sprachsignalen |
US6490563B2 (en) * | 1998-08-17 | 2002-12-03 | Microsoft Corporation | Proofreading with text to speech feedback |
US6338038B1 (en) * | 1998-09-02 | 2002-01-08 | International Business Machines Corp. | Variable speed audio playback in speech recognition proofreader |
US6064965A (en) * | 1998-09-02 | 2000-05-16 | International Business Machines Corporation | Combined audio playback in speech recognition proofreader |
US6219638B1 (en) * | 1998-11-03 | 2001-04-17 | International Business Machines Corporation | Telephone messaging and editing system |
DE19920501A1 (de) * | 1999-05-05 | 2000-11-09 | Nokia Mobile Phones Ltd | Wiedergabeverfahren für sprachgesteuerte Systeme mit textbasierter Sprachsynthese |
US6611802B2 (en) * | 1999-06-11 | 2003-08-26 | International Business Machines Corporation | Method and system for proofreading and correcting dictated text |
US6370503B1 (en) * | 1999-06-30 | 2002-04-09 | International Business Machines Corp. | Method and apparatus for improving speech recognition accuracy |
US7010489B1 (en) * | 2000-03-09 | 2006-03-07 | International Business Mahcines Corporation | Method for guiding text-to-speech output timing using speech recognition markers |
DE10304229A1 (de) * | 2003-01-28 | 2004-08-05 | Deutsche Telekom Ag | Kommunikationssystem, Kommunikationsendeinrichtung und Vorrichtung zum Erkennen fehlerbehafteter Text-Nachrichten |
-
2004
- 2004-10-27 DE DE602004018385T patent/DE602004018385D1/de active Active
- 2004-10-27 CN CN200480032825.6A patent/CN1879146B/zh active Active
- 2004-10-27 US US10/578,073 patent/US7617106B2/en active Active
- 2004-10-27 EP EP04791820A patent/EP1702319B1/de active Active
- 2004-10-27 AT AT04791820T patent/ATE417347T1/de not_active IP Right Cessation
- 2004-10-27 JP JP2006537527A patent/JP4714694B2/ja not_active Expired - Fee Related
- 2004-10-27 WO PCT/IB2004/052218 patent/WO2005045803A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
US20070027686A1 (en) | 2007-02-01 |
EP1702319B1 (de) | 2008-12-10 |
JP2007510943A (ja) | 2007-04-26 |
WO2005045803A1 (en) | 2005-05-19 |
EP1702319A1 (de) | 2006-09-20 |
WO2005045803A8 (en) | 2006-08-10 |
CN1879146A (zh) | 2006-12-13 |
DE602004018385D1 (de) | 2009-01-22 |
CN1879146B (zh) | 2011-06-08 |
US7617106B2 (en) | 2009-11-10 |
JP4714694B2 (ja) | 2011-06-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE417347T1 (de) | Fehlerdetektion für sprach-zu-text- transkriptionssysteme | |
DE60010106D1 (de) | Verfahren und vorrichtung zum unterscheidenden training von akustischen modellen in einem spracherkennungssystem | |
CN110148402A (zh) | 语音处理方法、装置、计算机设备及存储介质 | |
Shahnawazuddin et al. | Pitch-Adaptive Front-End Features for Robust Children's ASR. | |
ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
EP1901286A2 (de) | Sprachverbesserungsvorrichtung, Sprachaufzeichnungsvorrichtung, Sprachverbesserungsprogramm, Sprachaufzeichnungsprogramm, Sprachverbesserungsverfahren und Sprachaufzeichnungsverfahren | |
ATE235733T1 (de) | Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner | |
ATE320650T1 (de) | Verfahren zur erweiterung des wortschatzes eines spracherkennungssystems | |
CN101114447A (zh) | 语音翻译装置和方法 | |
WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
WO2007117814A3 (en) | Voice signal perturbation for speech recognition | |
ATE492875T1 (de) | Sprachanalysesystem | |
Nanjo et al. | Speaking-rate dependent decoding and adaptation for spontaneous lecture speech recognition | |
Chandra et al. | An overview of speech recognition and speech synthesis algorithms | |
ATE441918T1 (de) | Sprachdialogverfahren und -system | |
JP2015215503A (ja) | 音声認識方法、音声認識装置および音声認識プログラム | |
O'Shaughnessy | Correcting complex false starts in spontaneous speech | |
Sudhakar et al. | Automatic speech segmentation to improve speech synthesis performance | |
Ishimitsu et al. | Construction of speech support system using body-conducted speech recognition for disorders | |
Gales | Acoustic modelling for speech recognition: Hidden Markov Models and beyond? | |
Valentini-Botinhao et al. | Non linear time compression of clear and normal speech at high rates | |
Wang et al. | Improved generation of fundamental frequency in HMM-based speech synthesis using generation process model. | |
Gurunath Shivakumar et al. | Spoken Language Intent Detection using Confusion2Vec | |
DE602004014416D1 (de) | Spracherkennung durch kontextuelle modellierung der spracheinheiten | |
KR20080030338A (ko) | 경계 휴지강도를 이용한 발음변환 방법 및 이를 기반으로하는 음성합성 시스템 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |