ATE385024T1 - Multilinguale spracherkennung - Google Patents
Multilinguale spracherkennungInfo
- Publication number
- ATE385024T1 ATE385024T1 AT05003670T AT05003670T ATE385024T1 AT E385024 T1 ATE385024 T1 AT E385024T1 AT 05003670 T AT05003670 T AT 05003670T AT 05003670 T AT05003670 T AT 05003670T AT E385024 T1 ATE385024 T1 AT E385024T1
- Authority
- AT
- Austria
- Prior art keywords
- subword
- speech recognition
- items
- list
- subword unit
- Prior art date
Links
- 238000013518 transcription Methods 0.000 abstract 2
- 230000035897 transcription Effects 0.000 abstract 2
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP05003670A EP1693828B1 (de) | 2005-02-21 | 2005-02-21 | Multilinguale Spracherkennung |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE385024T1 true ATE385024T1 (de) | 2008-02-15 |
Family
ID=34933852
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT05003670T ATE385024T1 (de) | 2005-02-21 | 2005-02-21 | Multilinguale spracherkennung |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20060206331A1 (de) |
| EP (1) | EP1693828B1 (de) |
| AT (1) | ATE385024T1 (de) |
| DE (1) | DE602005004503T2 (de) |
Families Citing this family (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1693829B1 (de) * | 2005-02-21 | 2018-12-05 | Harman Becker Automotive Systems GmbH | Sprachgesteuertes Datensystem |
| US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
| SG133419A1 (en) * | 2005-12-12 | 2007-07-30 | Creative Tech Ltd | A method and apparatus for accessing a digital file from a collection of digital files |
| US7873517B2 (en) | 2006-11-09 | 2011-01-18 | Volkswagen Of America, Inc. | Motor vehicle with a speech interface |
| DE102006057159A1 (de) * | 2006-12-01 | 2008-06-05 | Deutsche Telekom Ag | Verfahren zur Klassifizierung der gesprochenen Sprache in Sprachdialogsystemen |
| EP1975923B1 (de) * | 2007-03-28 | 2016-04-27 | Nuance Communications, Inc. | Mehrsprachige nicht-muttersprachliche Spracherkennung |
| US8099290B2 (en) * | 2009-01-28 | 2012-01-17 | Mitsubishi Electric Corporation | Voice recognition device |
| US9892730B2 (en) | 2009-07-01 | 2018-02-13 | Comcast Interactive Media, Llc | Generating topic-specific language models |
| US8949125B1 (en) * | 2010-06-16 | 2015-02-03 | Google Inc. | Annotating maps with user-contributed pronunciations |
| US8489398B1 (en) | 2011-01-14 | 2013-07-16 | Google Inc. | Disambiguation of spoken proper names |
| US9286894B1 (en) | 2012-01-31 | 2016-03-15 | Google Inc. | Parallel recognition |
| US9093076B2 (en) * | 2012-04-30 | 2015-07-28 | 2236008 Ontario Inc. | Multipass ASR controlling multiple applications |
| US9431012B2 (en) | 2012-04-30 | 2016-08-30 | 2236008 Ontario Inc. | Post processing of natural language automatic speech recognition |
| US20140214401A1 (en) | 2013-01-29 | 2014-07-31 | Tencent Technology (Shenzhen) Company Limited | Method and device for error correction model training and text error correction |
| US9471567B2 (en) * | 2013-01-31 | 2016-10-18 | Ncr Corporation | Automatic language recognition |
| DE102013005844B3 (de) * | 2013-03-28 | 2014-08-28 | Technische Universität Braunschweig | Verfahren und Vorrichtung zum Messen der Qualität eines Sprachsignals |
| KR102084646B1 (ko) | 2013-07-04 | 2020-04-14 | 삼성전자주식회사 | 음성 인식 장치 및 음성 인식 방법 |
| WO2015030474A1 (ko) | 2013-08-26 | 2015-03-05 | 삼성전자 주식회사 | 음성 인식을 위한 전자 장치 및 방법 |
| JP6080978B2 (ja) | 2013-11-20 | 2017-02-15 | 三菱電機株式会社 | 音声認識装置および音声認識方法 |
| US9747897B2 (en) * | 2013-12-17 | 2017-08-29 | Google Inc. | Identifying substitute pronunciations |
| US10339920B2 (en) * | 2014-03-04 | 2019-07-02 | Amazon Technologies, Inc. | Predicting pronunciation in speech recognition |
| DE102014210716A1 (de) * | 2014-06-05 | 2015-12-17 | Continental Automotive Gmbh | Assistenzsystem, das mittels Spracheingaben steuerbar ist, mit einer Funktionseinrichtung und mehreren Spracherkennungsmodulen |
| US9683862B2 (en) * | 2015-08-24 | 2017-06-20 | International Business Machines Corporation | Internationalization during navigation |
| DE102015014206B4 (de) | 2015-11-04 | 2020-06-25 | Audi Ag | Verfahren und Vorrichtung zum Auswählen eines Navigationsziels aus einer von mehreren Sprachregionen mittels Spracheingabe |
| US9959887B2 (en) * | 2016-03-08 | 2018-05-01 | International Business Machines Corporation | Multi-pass speech activity detection strategy to improve automatic speech recognition |
| US10593321B2 (en) * | 2017-12-15 | 2020-03-17 | Mitsubishi Electric Research Laboratories, Inc. | Method and apparatus for multi-lingual end-to-end speech recognition |
| US10565320B1 (en) | 2018-09-28 | 2020-02-18 | International Business Machines Corporation | Dynamic multilingual speech recognition |
| US11270687B2 (en) * | 2019-05-03 | 2022-03-08 | Google Llc | Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models |
| CN112364658B (zh) | 2019-07-24 | 2024-07-26 | 阿里巴巴集团控股有限公司 | 翻译以及语音识别方法、装置、设备 |
| CN110634487B (zh) * | 2019-10-24 | 2022-05-17 | 科大讯飞股份有限公司 | 一种双语种混合语音识别方法、装置、设备及存储介质 |
| CN111798836B (zh) * | 2020-08-03 | 2023-12-05 | 上海茂声智能科技有限公司 | 一种自动切换语种方法、装置、系统、设备和存储介质 |
| CN113035171B (zh) * | 2021-03-05 | 2022-09-02 | 随锐科技集团股份有限公司 | 语音识别处理方法及系统 |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5602960A (en) * | 1994-09-30 | 1997-02-11 | Apple Computer, Inc. | Continuous mandarin chinese speech recognition system having an integrated tone classifier |
| DE19636739C1 (de) * | 1996-09-10 | 1997-07-03 | Siemens Ag | Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem |
| US6085160A (en) * | 1998-07-10 | 2000-07-04 | Lernout & Hauspie Speech Products N.V. | Language independent speech recognition |
| US7120582B1 (en) * | 1999-09-07 | 2006-10-10 | Dragon Systems, Inc. | Expanding an effective vocabulary of a speech recognition system |
| US6912499B1 (en) * | 1999-08-31 | 2005-06-28 | Nortel Networks Limited | Method and apparatus for training a multilingual speech model set |
| EP1134726A1 (de) * | 2000-03-15 | 2001-09-19 | Siemens Aktiengesellschaft | Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem |
| US7181395B1 (en) * | 2000-10-27 | 2007-02-20 | International Business Machines Corporation | Methods and apparatus for automatic generation of multiple pronunciations from acoustic data |
| DE60111329T2 (de) * | 2000-11-14 | 2006-03-16 | International Business Machines Corp. | Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung |
| GB0028277D0 (en) * | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
| EP1217610A1 (de) * | 2000-11-28 | 2002-06-26 | Siemens Aktiengesellschaft | Verfahren und System zur multilingualen Spracherkennung |
| EP1233406A1 (de) * | 2001-02-14 | 2002-08-21 | Sony International (Europe) GmbH | Angepasste Spracherkennung für ausländische Sprecher |
| US7043431B2 (en) * | 2001-08-31 | 2006-05-09 | Nokia Corporation | Multilingual speech recognition system using text derived recognition models |
| DE10207895B4 (de) * | 2002-02-23 | 2005-11-03 | Harman Becker Automotive Systems Gmbh | Verfahren zur Spracherkennung und Spracherkennungssystem |
| US7092883B1 (en) * | 2002-03-29 | 2006-08-15 | At&T | Generating confidence scores from word lattices |
| US6932873B2 (en) * | 2002-07-30 | 2005-08-23 | Applied Materials Israel, Ltd. | Managing work-piece deflection |
| US7149688B2 (en) * | 2002-11-04 | 2006-12-12 | Speechworks International, Inc. | Multi-lingual speech recognition with cross-language context modeling |
| WO2004047077A1 (en) * | 2002-11-15 | 2004-06-03 | Voice Signal Technologies, Inc. | Multilingual speech recognition |
| US8285537B2 (en) * | 2003-01-31 | 2012-10-09 | Comverse, Inc. | Recognition of proper nouns using native-language pronunciation |
| US7689404B2 (en) * | 2004-02-24 | 2010-03-30 | Arkady Khasin | Method of multilingual speech recognition by reduction to single-language recognizer engine components |
| US20050197837A1 (en) * | 2004-03-08 | 2005-09-08 | Janne Suontausta | Enhanced multilingual speech recognition system |
| US20050267755A1 (en) * | 2004-05-27 | 2005-12-01 | Nokia Corporation | Arrangement for speech recognition |
-
2005
- 2005-02-21 AT AT05003670T patent/ATE385024T1/de not_active IP Right Cessation
- 2005-02-21 DE DE602005004503T patent/DE602005004503T2/de not_active Expired - Lifetime
- 2005-02-21 EP EP05003670A patent/EP1693828B1/de not_active Expired - Lifetime
-
2006
- 2006-02-21 US US11/360,024 patent/US20060206331A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| DE602005004503D1 (de) | 2008-03-13 |
| US20060206331A1 (en) | 2006-09-14 |
| EP1693828A1 (de) | 2006-08-23 |
| DE602005004503T2 (de) | 2009-01-22 |
| EP1693828B1 (de) | 2008-01-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE385024T1 (de) | Multilinguale spracherkennung | |
| Atmaja et al. | Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion | |
| Deng et al. | Improving accent identification and accented speech recognition under a framework of self-supervised learning | |
| ATE527652T1 (de) | Mehrstufige spracherkennung | |
| CN102077275B (zh) | 用于从声学数据生成词条的方法和设备 | |
| US20160336007A1 (en) | Speech search device and speech search method | |
| Bhaykar et al. | Speaker dependent, speaker independent and cross language emotion recognition from speech using GMM and HMM | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| CN108074562B (zh) | 语音识别装置、语音识别方法以及存储介质 | |
| TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
| ATE419616T1 (de) | Verfahren, einrichtung und computerprogramm zur spracherkennung | |
| ATE405919T1 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
| ATE524777T1 (de) | Automatische aktualisierung eines sprachmodells | |
| CN108091334B (zh) | 识别装置、识别方法以及存储介质 | |
| GB2443753A (en) | Spoken language proficiency assessment by computer | |
| WO2008096582A1 (ja) | 認識器重み学習装置および音声認識装置、ならびに、システム | |
| KR20090060631A (ko) | 타 언어권 화자음성에 대한 음성인식 시스템의 성능 향상을위한 비직접적 데이터 기반 발음변이 모델링 시스템 및방법 | |
| US8682668B2 (en) | Language model score look-ahead value imparting device, language model score look-ahead value imparting method, and program storage medium | |
| Rastrow et al. | Towards using hybrid word and fragment units for vocabulary independent LVCSR systems. | |
| WO2010018453A3 (en) | System and method for processing electronically generated text | |
| Marin et al. | Using syntactic and confusion network structure for out-of-vocabulary word detection | |
| Gupta et al. | A Language Independent Approach to Audio Search. | |
| Dzhambazov et al. | Automatic lyrics-to-audio alignment in classical Turkish music | |
| Peng et al. | Multilingual approach to joint speech and accent recognition with DNN-HMM framework | |
| Saikia et al. | Generating Manipuri English pronunciation dictionary using sequence labelling problem |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |