ATE385024T1 - Multilinguale spracherkennung - Google Patents

Multilinguale spracherkennung

Info

Publication number
ATE385024T1
ATE385024T1 AT05003670T AT05003670T ATE385024T1 AT E385024 T1 ATE385024 T1 AT E385024T1 AT 05003670 T AT05003670 T AT 05003670T AT 05003670 T AT05003670 T AT 05003670T AT E385024 T1 ATE385024 T1 AT E385024T1
Authority
AT
Austria
Prior art keywords
subword
speech recognition
items
list
subword unit
Prior art date
Application number
AT05003670T
Other languages
English (en)
Inventor
Marcus Hennecke
Thomas Krippgans
Original Assignee
Harman Becker Automotive Sys
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harman Becker Automotive Sys filed Critical Harman Becker Automotive Sys
Application granted granted Critical
Publication of ATE385024T1 publication Critical patent/ATE385024T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
AT05003670T 2005-02-21 2005-02-21 Multilinguale spracherkennung ATE385024T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP05003670A EP1693828B1 (de) 2005-02-21 2005-02-21 Multilinguale Spracherkennung

Publications (1)

Publication Number Publication Date
ATE385024T1 true ATE385024T1 (de) 2008-02-15

Family

ID=34933852

Family Applications (1)

Application Number Title Priority Date Filing Date
AT05003670T ATE385024T1 (de) 2005-02-21 2005-02-21 Multilinguale spracherkennung

Country Status (4)

Country Link
US (1) US20060206331A1 (de)
EP (1) EP1693828B1 (de)
AT (1) ATE385024T1 (de)
DE (1) DE602005004503T2 (de)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1693829B1 (de) * 2005-02-21 2018-12-05 Harman Becker Automotive Systems GmbH Sprachgesteuertes Datensystem
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
SG133419A1 (en) * 2005-12-12 2007-07-30 Creative Tech Ltd A method and apparatus for accessing a digital file from a collection of digital files
US7873517B2 (en) 2006-11-09 2011-01-18 Volkswagen Of America, Inc. Motor vehicle with a speech interface
DE102006057159A1 (de) * 2006-12-01 2008-06-05 Deutsche Telekom Ag Verfahren zur Klassifizierung der gesprochenen Sprache in Sprachdialogsystemen
EP1975923B1 (de) * 2007-03-28 2016-04-27 Nuance Communications, Inc. Mehrsprachige nicht-muttersprachliche Spracherkennung
US8099290B2 (en) * 2009-01-28 2012-01-17 Mitsubishi Electric Corporation Voice recognition device
US9892730B2 (en) 2009-07-01 2018-02-13 Comcast Interactive Media, Llc Generating topic-specific language models
US8949125B1 (en) * 2010-06-16 2015-02-03 Google Inc. Annotating maps with user-contributed pronunciations
US8489398B1 (en) 2011-01-14 2013-07-16 Google Inc. Disambiguation of spoken proper names
US9286894B1 (en) 2012-01-31 2016-03-15 Google Inc. Parallel recognition
US9093076B2 (en) * 2012-04-30 2015-07-28 2236008 Ontario Inc. Multipass ASR controlling multiple applications
US9431012B2 (en) 2012-04-30 2016-08-30 2236008 Ontario Inc. Post processing of natural language automatic speech recognition
US20140214401A1 (en) 2013-01-29 2014-07-31 Tencent Technology (Shenzhen) Company Limited Method and device for error correction model training and text error correction
US9471567B2 (en) * 2013-01-31 2016-10-18 Ncr Corporation Automatic language recognition
DE102013005844B3 (de) * 2013-03-28 2014-08-28 Technische Universität Braunschweig Verfahren und Vorrichtung zum Messen der Qualität eines Sprachsignals
KR102084646B1 (ko) 2013-07-04 2020-04-14 삼성전자주식회사 음성 인식 장치 및 음성 인식 방법
WO2015030474A1 (ko) 2013-08-26 2015-03-05 삼성전자 주식회사 음성 인식을 위한 전자 장치 및 방법
JP6080978B2 (ja) 2013-11-20 2017-02-15 三菱電機株式会社 音声認識装置および音声認識方法
US9747897B2 (en) * 2013-12-17 2017-08-29 Google Inc. Identifying substitute pronunciations
US10339920B2 (en) * 2014-03-04 2019-07-02 Amazon Technologies, Inc. Predicting pronunciation in speech recognition
DE102014210716A1 (de) * 2014-06-05 2015-12-17 Continental Automotive Gmbh Assistenzsystem, das mittels Spracheingaben steuerbar ist, mit einer Funktionseinrichtung und mehreren Spracherkennungsmodulen
US9683862B2 (en) * 2015-08-24 2017-06-20 International Business Machines Corporation Internationalization during navigation
DE102015014206B4 (de) 2015-11-04 2020-06-25 Audi Ag Verfahren und Vorrichtung zum Auswählen eines Navigationsziels aus einer von mehreren Sprachregionen mittels Spracheingabe
US9959887B2 (en) * 2016-03-08 2018-05-01 International Business Machines Corporation Multi-pass speech activity detection strategy to improve automatic speech recognition
US10593321B2 (en) * 2017-12-15 2020-03-17 Mitsubishi Electric Research Laboratories, Inc. Method and apparatus for multi-lingual end-to-end speech recognition
US10565320B1 (en) 2018-09-28 2020-02-18 International Business Machines Corporation Dynamic multilingual speech recognition
US11270687B2 (en) * 2019-05-03 2022-03-08 Google Llc Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models
CN112364658B (zh) 2019-07-24 2024-07-26 阿里巴巴集团控股有限公司 翻译以及语音识别方法、装置、设备
CN110634487B (zh) * 2019-10-24 2022-05-17 科大讯飞股份有限公司 一种双语种混合语音识别方法、装置、设备及存储介质
CN111798836B (zh) * 2020-08-03 2023-12-05 上海茂声智能科技有限公司 一种自动切换语种方法、装置、系统、设备和存储介质
CN113035171B (zh) * 2021-03-05 2022-09-02 随锐科技集团股份有限公司 语音识别处理方法及系统

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5602960A (en) * 1994-09-30 1997-02-11 Apple Computer, Inc. Continuous mandarin chinese speech recognition system having an integrated tone classifier
DE19636739C1 (de) * 1996-09-10 1997-07-03 Siemens Ag Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem
US6085160A (en) * 1998-07-10 2000-07-04 Lernout & Hauspie Speech Products N.V. Language independent speech recognition
US7120582B1 (en) * 1999-09-07 2006-10-10 Dragon Systems, Inc. Expanding an effective vocabulary of a speech recognition system
US6912499B1 (en) * 1999-08-31 2005-06-28 Nortel Networks Limited Method and apparatus for training a multilingual speech model set
EP1134726A1 (de) * 2000-03-15 2001-09-19 Siemens Aktiengesellschaft Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem
US7181395B1 (en) * 2000-10-27 2007-02-20 International Business Machines Corporation Methods and apparatus for automatic generation of multiple pronunciations from acoustic data
DE60111329T2 (de) * 2000-11-14 2006-03-16 International Business Machines Corp. Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
GB0028277D0 (en) * 2000-11-20 2001-01-03 Canon Kk Speech processing system
EP1217610A1 (de) * 2000-11-28 2002-06-26 Siemens Aktiengesellschaft Verfahren und System zur multilingualen Spracherkennung
EP1233406A1 (de) * 2001-02-14 2002-08-21 Sony International (Europe) GmbH Angepasste Spracherkennung für ausländische Sprecher
US7043431B2 (en) * 2001-08-31 2006-05-09 Nokia Corporation Multilingual speech recognition system using text derived recognition models
DE10207895B4 (de) * 2002-02-23 2005-11-03 Harman Becker Automotive Systems Gmbh Verfahren zur Spracherkennung und Spracherkennungssystem
US7092883B1 (en) * 2002-03-29 2006-08-15 At&T Generating confidence scores from word lattices
US6932873B2 (en) * 2002-07-30 2005-08-23 Applied Materials Israel, Ltd. Managing work-piece deflection
US7149688B2 (en) * 2002-11-04 2006-12-12 Speechworks International, Inc. Multi-lingual speech recognition with cross-language context modeling
WO2004047077A1 (en) * 2002-11-15 2004-06-03 Voice Signal Technologies, Inc. Multilingual speech recognition
US8285537B2 (en) * 2003-01-31 2012-10-09 Comverse, Inc. Recognition of proper nouns using native-language pronunciation
US7689404B2 (en) * 2004-02-24 2010-03-30 Arkady Khasin Method of multilingual speech recognition by reduction to single-language recognizer engine components
US20050197837A1 (en) * 2004-03-08 2005-09-08 Janne Suontausta Enhanced multilingual speech recognition system
US20050267755A1 (en) * 2004-05-27 2005-12-01 Nokia Corporation Arrangement for speech recognition

Also Published As

Publication number Publication date
DE602005004503D1 (de) 2008-03-13
US20060206331A1 (en) 2006-09-14
EP1693828A1 (de) 2006-08-23
DE602005004503T2 (de) 2009-01-22
EP1693828B1 (de) 2008-01-23

Similar Documents

Publication Publication Date Title
ATE385024T1 (de) Multilinguale spracherkennung
Atmaja et al. Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion
Deng et al. Improving accent identification and accented speech recognition under a framework of self-supervised learning
ATE527652T1 (de) Mehrstufige spracherkennung
CN102077275B (zh) 用于从声学数据生成词条的方法和设备
US20160336007A1 (en) Speech search device and speech search method
Bhaykar et al. Speaker dependent, speaker independent and cross language emotion recognition from speech using GMM and HMM
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
CN108074562B (zh) 语音识别装置、语音识别方法以及存储介质
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
ATE419616T1 (de) Verfahren, einrichtung und computerprogramm zur spracherkennung
ATE405919T1 (de) Spracherkennungssystem und verfahren auf phonetischer basis
ATE524777T1 (de) Automatische aktualisierung eines sprachmodells
CN108091334B (zh) 识别装置、识别方法以及存储介质
GB2443753A (en) Spoken language proficiency assessment by computer
WO2008096582A1 (ja) 認識器重み学習装置および音声認識装置、ならびに、システム
KR20090060631A (ko) 타 언어권 화자음성에 대한 음성인식 시스템의 성능 향상을위한 비직접적 데이터 기반 발음변이 모델링 시스템 및방법
US8682668B2 (en) Language model score look-ahead value imparting device, language model score look-ahead value imparting method, and program storage medium
Rastrow et al. Towards using hybrid word and fragment units for vocabulary independent LVCSR systems.
WO2010018453A3 (en) System and method for processing electronically generated text
Marin et al. Using syntactic and confusion network structure for out-of-vocabulary word detection
Gupta et al. A Language Independent Approach to Audio Search.
Dzhambazov et al. Automatic lyrics-to-audio alignment in classical Turkish music
Peng et al. Multilingual approach to joint speech and accent recognition with DNN-HMM framework
Saikia et al. Generating Manipuri English pronunciation dictionary using sequence labelling problem

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties