DE60020504D1 - Anpassung eines spracherkenners an korrigierte texte - Google Patents
Anpassung eines spracherkenners an korrigierte texteInfo
- Publication number
- DE60020504D1 DE60020504D1 DE60020504T DE60020504T DE60020504D1 DE 60020504 D1 DE60020504 D1 DE 60020504D1 DE 60020504 T DE60020504 T DE 60020504T DE 60020504 T DE60020504 T DE 60020504T DE 60020504 D1 DE60020504 D1 DE 60020504D1
- Authority
- DE
- Germany
- Prior art keywords
- text information
- indicator
- adjusting
- information
- smi
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G10L15/075—Adaptation to the speaker supervised, i.e. under machine guidance
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP99890232 | 1999-07-08 | ||
EP99890232 | 1999-07-08 | ||
PCT/EP2000/006167 WO2001004874A1 (en) | 1999-07-08 | 2000-06-30 | Adaptation of a speech recognizer from corrected text |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60020504D1 true DE60020504D1 (de) | 2005-07-07 |
DE60020504T2 DE60020504T2 (de) | 2006-05-04 |
Family
ID=8243996
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60020504T Expired - Lifetime DE60020504T2 (de) | 1999-07-08 | 2000-06-30 | Anpassung eines spracherkenners an korrigierte texte |
Country Status (6)
Country | Link |
---|---|
US (1) | US6725194B1 (de) |
EP (1) | EP1110204B1 (de) |
JP (1) | JP2003504674A (de) |
AT (1) | ATE297046T1 (de) |
DE (1) | DE60020504T2 (de) |
WO (1) | WO2001004874A1 (de) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE306116T1 (de) * | 1999-07-08 | 2005-10-15 | Koninkl Philips Electronics Nv | Spracherkennungseinrichtung mit transfermitteln |
US7689416B1 (en) | 1999-09-29 | 2010-03-30 | Poirier Darrell A | System for transferring personalize matter from one computer to another |
JP2001100781A (ja) | 1999-09-30 | 2001-04-13 | Sony Corp | 音声処理装置および音声処理方法、並びに記録媒体 |
ATE362271T1 (de) * | 2000-07-31 | 2007-06-15 | Eliza Corp | Verfahren und system zur verbesserung der genauigkeit in einem spracherkennungssystem |
US7418381B2 (en) * | 2001-09-07 | 2008-08-26 | Hewlett-Packard Development Company, L.P. | Device for automatically translating and presenting voice messages as text messages |
US20030158735A1 (en) * | 2002-02-15 | 2003-08-21 | Canon Kabushiki Kaisha | Information processing apparatus and method with speech synthesis function |
US8019602B2 (en) | 2004-01-20 | 2011-09-13 | Microsoft Corporation | Automatic speech recognition learning using user corrections |
US7590533B2 (en) * | 2004-03-10 | 2009-09-15 | Microsoft Corporation | New-word pronunciation learning using a pronunciation graph |
WO2008008730A2 (en) | 2006-07-08 | 2008-01-17 | Personics Holdings Inc. | Personal audio assistant device and method |
CN103714048B (zh) * | 2012-09-29 | 2017-07-21 | 国际商业机器公司 | 用于校正文本的方法和系统 |
KR102009423B1 (ko) * | 2012-10-08 | 2019-08-09 | 삼성전자주식회사 | 음성 인식을 이용한 미리 설정된 동작 모드의 수행 방법 및 장치 |
CN107086040B (zh) * | 2017-06-23 | 2021-03-02 | 歌尔股份有限公司 | 语音识别能力测试方法和装置 |
US10943583B1 (en) * | 2017-07-20 | 2021-03-09 | Amazon Technologies, Inc. | Creation of language models for speech recognition |
US10600408B1 (en) * | 2018-03-23 | 2020-03-24 | Amazon Technologies, Inc. | Content output management based on speech quality |
US11393471B1 (en) * | 2020-03-30 | 2022-07-19 | Amazon Technologies, Inc. | Multi-device output management based on speech characteristics |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4779209A (en) * | 1982-11-03 | 1988-10-18 | Wang Laboratories, Inc. | Editing voice data |
AT390685B (de) | 1988-10-25 | 1990-06-11 | Philips Nv | System zur textverarbeitung |
JPH08505959A (ja) * | 1993-01-21 | 1996-06-25 | アップル コンピューター インコーポレイテッド | ベクトル量子化ベース音声符号化/複号化を用いたテキスト−音声合成システム |
US5852801A (en) * | 1995-10-04 | 1998-12-22 | Apple Computer, Inc. | Method and apparatus for automatically invoking a new word module for unrecognized user input |
US5960447A (en) * | 1995-11-13 | 1999-09-28 | Holt; Douglas | Word tagging and editing system for speech recognition |
US5794189A (en) * | 1995-11-13 | 1998-08-11 | Dragon Systems, Inc. | Continuous speech recognition |
GB2302199B (en) * | 1996-09-24 | 1997-05-14 | Allvoice Computing Plc | Data processing method and apparatus |
US5857099A (en) * | 1996-09-27 | 1999-01-05 | Allvoice Computing Plc | Speech-to-text dictation system with audio message capability |
US6173259B1 (en) * | 1997-03-27 | 2001-01-09 | Speech Machines Plc | Speech to text conversion |
US6263308B1 (en) * | 2000-03-20 | 2001-07-17 | Microsoft Corporation | Methods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process |
-
2000
- 2000-06-30 WO PCT/EP2000/006167 patent/WO2001004874A1/en active IP Right Grant
- 2000-06-30 EP EP00943966A patent/EP1110204B1/de not_active Expired - Lifetime
- 2000-06-30 AT AT00943966T patent/ATE297046T1/de not_active IP Right Cessation
- 2000-06-30 DE DE60020504T patent/DE60020504T2/de not_active Expired - Lifetime
- 2000-06-30 JP JP2001509020A patent/JP2003504674A/ja not_active Withdrawn
- 2000-07-06 US US09/610,714 patent/US6725194B1/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
ATE297046T1 (de) | 2005-06-15 |
US6725194B1 (en) | 2004-04-20 |
JP2003504674A (ja) | 2003-02-04 |
DE60020504T2 (de) | 2006-05-04 |
EP1110204A1 (de) | 2001-06-27 |
WO2001004874A1 (en) | 2001-01-18 |
EP1110204B1 (de) | 2005-06-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60020504D1 (de) | Anpassung eines spracherkenners an korrigierte texte | |
Kanthak et al. | Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition | |
US7490039B1 (en) | Text to speech system and method having interactive spelling capabilities | |
CA2181205A1 (en) | Discriminative Utterance Verification for Connected Digits Recognition | |
JP2006058899A (ja) | 発話検索のためのラティス・ベースの検索システムおよび方法 | |
ATE395685T1 (de) | Spracherkennung durch wort-in-phrase-befehl | |
DE60004862D1 (de) | Automatische bestimmung der genauigkeit eines aussprachewörterbuchs in einem spracherkennungssystem | |
ATE311650T1 (de) | Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes | |
KR19990087935A (ko) | 연속음성인식시에구두점들을자동으로발생시키기위한장치및방법 | |
CA2493265A1 (en) | System and method for augmenting spoken language understanding by correcting common errors in linguistic performance | |
DE60209103D1 (de) | Texteditierung von erkannter sprache bei gleichzeitiger wiedergabe | |
WO2007034478A3 (en) | System and method for correcting speech | |
ATE319161T1 (de) | Korrekturvorrichtung mit markierung von teilen eines erkannten textes | |
Zetterholm | Voice imitation: a phonetic study of perceptual illusions and acoustic success | |
Greenberg et al. | Linguistic dissection of switchboard-corpus automatic speech recognition systems | |
Che et al. | Speaker recognition using HMM with experiments on the YOHO database | |
US6377921B1 (en) | Identifying mismatches between assumed and actual pronunciations of words | |
Newman et al. | Speaker verification through large vocabulary continuous speech recognition | |
Amdal et al. | Joint pronunciation modelling of non-native speakers using data-driven methods. | |
Schuller et al. | Comparing one and two-stage acoustic modeling in the recognition of emotion in speech | |
US20050234724A1 (en) | System and method for improving text-to-speech software intelligibility through the detection of uncommon words and phrases | |
DE60022976D1 (de) | Spracherkennungseinrichtung mit transfermitteln | |
Gauvain et al. | Experiments with speaker verification over the telephone. | |
Schramm et al. | Filled-pause modeling for medical transcriptions | |
KR20090109501A (ko) | 언어학습용 리듬훈련 시스템 및 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8327 | Change in the person/name/address of the patent owner |
Owner name: NUANCE COMMUNICATIONS AUSTRIA GMBH, WIEN, AT |
|
8328 | Change in the person/name/address of the agent |
Representative=s name: VOSSIUS & PARTNER, 81675 MUENCHEN |