ATE417347T1 - Fehlerdetektion für sprach-zu-text- transkriptionssysteme - Google Patents

Fehlerdetektion für sprach-zu-text- transkriptionssysteme

Info

Publication number
ATE417347T1
ATE417347T1 AT04791820T AT04791820T ATE417347T1 AT E417347 T1 ATE417347 T1 AT E417347T1 AT 04791820 T AT04791820 T AT 04791820T AT 04791820 T AT04791820 T AT 04791820T AT E417347 T1 ATE417347 T1 AT E417347T1
Authority
AT
Austria
Prior art keywords
speech
text
proof
transcribed
signal
Prior art date
Application number
AT04791820T
Other languages
English (en)
Inventor
Hauke Schramm
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE417347T1 publication Critical patent/ATE417347T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
AT04791820T 2003-11-05 2004-10-27 Fehlerdetektion für sprach-zu-text- transkriptionssysteme ATE417347T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP03104078 2003-11-05

Publications (1)

Publication Number Publication Date
ATE417347T1 true ATE417347T1 (de) 2008-12-15

Family

ID=34560196

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04791820T ATE417347T1 (de) 2003-11-05 2004-10-27 Fehlerdetektion für sprach-zu-text- transkriptionssysteme

Country Status (7)

Country Link
US (1) US7617106B2 (de)
EP (1) EP1702319B1 (de)
JP (1) JP4714694B2 (de)
CN (1) CN1879146B (de)
AT (1) ATE417347T1 (de)
DE (1) DE602004018385D1 (de)
WO (1) WO2005045803A1 (de)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6910481B2 (en) * 2003-03-28 2005-06-28 Ric Investments, Inc. Pressure support compliance monitoring system
US9520068B2 (en) * 2004-09-10 2016-12-13 Jtt Holdings, Inc. Sentence level analysis in a reading tutor
US8014650B1 (en) * 2006-01-24 2011-09-06 Adobe Systems Incorporated Feedback of out-of-range signals
FR2902542B1 (fr) * 2006-06-16 2012-12-21 Gilles Vessiere Consultants Correcteur semantiques, syntaxique et/ou lexical, procede de correction, ainsi que support d'enregistrement et programme d'ordinateur pour la mise en oeuvre de ce procede
KR101373336B1 (ko) 2007-08-08 2014-03-10 엘지전자 주식회사 방송수신 휴대단말기
US9280971B2 (en) * 2009-02-27 2016-03-08 Blackberry Limited Mobile wireless communications device with speech to text conversion and related methods
CN102163379B (zh) * 2010-02-24 2013-03-13 英业达股份有限公司 听写文章之校正语音的定位与播放系统及其方法
US20150279354A1 (en) * 2010-05-19 2015-10-01 Google Inc. Personalization and Latency Reduction for Voice-Activated Commands
US10522133B2 (en) * 2011-05-23 2019-12-31 Nuance Communications, Inc. Methods and apparatus for correcting recognition errors
JP2015520410A (ja) * 2012-04-27 2015-07-16 インタラクティブ・インテリジェンス・インコーポレイテッド 音声認識に対する負例(アンチワード)に基づく性能改善
CN102665012B (zh) * 2012-05-02 2015-07-08 江苏南大数码科技有限公司 远程电话语音查询平台故障自动巡检方法
US9135916B2 (en) * 2013-02-26 2015-09-15 Honeywell International Inc. System and method for correcting accent induced speech transmission problems
RU2658602C2 (ru) 2013-08-29 2018-06-22 Юнифай Гмбх Унд Ко. Кг Поддержание аудиосвязи в перегруженном канале связи
US10069965B2 (en) 2013-08-29 2018-09-04 Unify Gmbh & Co. Kg Maintaining audio communication in a congested communication channel
KR101808810B1 (ko) * 2013-11-27 2017-12-14 한국전자통신연구원 음성/무음성 구간 검출 방법 및 장치
CN105374356B (zh) * 2014-08-29 2019-07-30 株式会社理光 语音识别方法、语音评分方法、语音识别系统及语音评分系统
US20160379640A1 (en) * 2015-06-24 2016-12-29 Honeywell International Inc. System and method for aircraft voice-to-text communication with message validation
JP6605995B2 (ja) * 2016-03-16 2019-11-13 株式会社東芝 音声認識誤り修正装置、方法及びプログラム
WO2018075224A1 (en) * 2016-10-20 2018-04-26 Google Llc Determining phonetic relationships
US10446138B2 (en) * 2017-05-23 2019-10-15 Verbit Software Ltd. System and method for assessing audio files for transcription services
CN109949828B (zh) * 2017-12-20 2022-05-24 苏州君林智能科技有限公司 一种文字校验方法及装置
WO2020014730A1 (en) * 2018-07-16 2020-01-23 Bookbot Pty Ltd Learning aid
KR102615154B1 (ko) * 2019-02-28 2023-12-18 삼성전자주식회사 전자 장치 및 전자 장치의 제어 방법
US11410658B1 (en) * 2019-10-29 2022-08-09 Dialpad, Inc. Maintainable and scalable pipeline for automatic speech recognition language modeling

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61233832A (ja) * 1985-04-08 1986-10-18 Toshiba Corp 読合わせ校正装置
JP2585547B2 (ja) * 1986-09-19 1997-02-26 株式会社日立製作所 音声入出力装置における入力音声の修正方法
JPH0488399A (ja) * 1990-08-01 1992-03-23 Clarion Co Ltd 音声認識装置
GB2302199B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US6088674A (en) * 1996-12-04 2000-07-11 Justsystem Corp. Synthesizing a voice by developing meter patterns in the direction of a time axis according to velocity and pitch of a voice
US5987405A (en) * 1997-06-24 1999-11-16 International Business Machines Corporation Speech compression by speech recognition
JP3519259B2 (ja) * 1997-12-29 2004-04-12 京セラ株式会社 音声認識作動装置
DE19824450C2 (de) 1998-05-30 2001-05-31 Grundig Ag Verfahren und Vorrichtung zur Verarbeitung von Sprachsignalen
US6490563B2 (en) * 1998-08-17 2002-12-03 Microsoft Corporation Proofreading with text to speech feedback
US6064965A (en) * 1998-09-02 2000-05-16 International Business Machines Corporation Combined audio playback in speech recognition proofreader
US6338038B1 (en) * 1998-09-02 2002-01-08 International Business Machines Corp. Variable speed audio playback in speech recognition proofreader
US6219638B1 (en) * 1998-11-03 2001-04-17 International Business Machines Corporation Telephone messaging and editing system
DE19920501A1 (de) * 1999-05-05 2000-11-09 Nokia Mobile Phones Ltd Wiedergabeverfahren für sprachgesteuerte Systeme mit textbasierter Sprachsynthese
US6611802B2 (en) * 1999-06-11 2003-08-26 International Business Machines Corporation Method and system for proofreading and correcting dictated text
US6370503B1 (en) * 1999-06-30 2002-04-09 International Business Machines Corp. Method and apparatus for improving speech recognition accuracy
US7010489B1 (en) * 2000-03-09 2006-03-07 International Business Mahcines Corporation Method for guiding text-to-speech output timing using speech recognition markers
DE10304229A1 (de) * 2003-01-28 2004-08-05 Deutsche Telekom Ag Kommunikationssystem, Kommunikationsendeinrichtung und Vorrichtung zum Erkennen fehlerbehafteter Text-Nachrichten

Also Published As

Publication number Publication date
US7617106B2 (en) 2009-11-10
DE602004018385D1 (de) 2009-01-22
EP1702319B1 (de) 2008-12-10
CN1879146A (zh) 2006-12-13
JP2007510943A (ja) 2007-04-26
WO2005045803A1 (en) 2005-05-19
EP1702319A1 (de) 2006-09-20
US20070027686A1 (en) 2007-02-01
JP4714694B2 (ja) 2011-06-29
CN1879146B (zh) 2011-06-08
WO2005045803A8 (en) 2006-08-10

Similar Documents

Publication Publication Date Title
ATE417347T1 (de) Fehlerdetektion für sprach-zu-text- transkriptionssysteme
DE60010106D1 (de) Verfahren und vorrichtung zum unterscheidenden training von akustischen modellen in einem spracherkennungssystem
CN110148402A (zh) 语音处理方法、装置、计算机设备及存储介质
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
ATE235733T1 (de) Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
ATE320650T1 (de) Verfahren zur erweiterung des wortschatzes eines spracherkennungssystems
CN101114447A (zh) 语音翻译装置和方法
WO2009025356A1 (ja) 音声認識装置および音声認識方法
WO2007117814A3 (en) Voice signal perturbation for speech recognition
ATE492875T1 (de) Sprachanalysesystem
CN103050116A (zh) 语音命令识别方法及系统
Chandra et al. An overview of speech recognition and speech synthesis algorithms
ATE441918T1 (de) Sprachdialogverfahren und -system
O'Shaughnessy Correcting complex false starts in spontaneous speech
DE602004006429D1 (de) Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme
Ishimitsu et al. Construction of speech support system using body-conducted speech recognition for disorders
Gales Acoustic modelling for speech recognition: Hidden Markov Models and beyond?
JP2015215503A (ja) 音声認識方法、音声認識装置および音声認識プログラム
Amin et al. Nine voices, one artist: Linguistic and acoustic analysis
Valentini-Botinhao et al. Non linear time compression of clear and normal speech at high rates
Gurunath Shivakumar et al. Spoken Language Intent Detection using Confusion2Vec
DE602004014416D1 (de) Spracherkennung durch kontextuelle modellierung der spracheinheiten
KR20080030338A (ko) 경계 휴지강도를 이용한 발음변환 방법 및 이를 기반으로하는 음성합성 시스템
Dzibela et al. Hidden-Markov-model based speech enhancement
Rani et al. Reduction of confusion pairs on different rates of speech in Telugu language

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties