ATE445215T1 - Spracherkennung für grosse dynamische vokabulare - Google Patents

Spracherkennung für grosse dynamische vokabulare

Info

Publication number
ATE445215T1
ATE445215T1 AT04767631T AT04767631T ATE445215T1 AT E445215 T1 ATE445215 T1 AT E445215T1 AT 04767631 T AT04767631 T AT 04767631T AT 04767631 T AT04767631 T AT 04767631T AT E445215 T1 ATE445215 T1 AT E445215T1
Authority
AT
Austria
Prior art keywords
large dynamic
language recognition
markov
vocabulary
vocabularies
Prior art date
Application number
AT04767631T
Other languages
English (en)
Inventor
Laurent Cogne
Huitouze Serge Le
Frederic Soufflet
Original Assignee
Telisma
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telisma filed Critical Telisma
Application granted granted Critical
Publication of ATE445215T1 publication Critical patent/ATE445215T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/083Recognition networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Processing (AREA)
AT04767631T 2003-07-08 2004-07-08 Spracherkennung für grosse dynamische vokabulare ATE445215T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0308341A FR2857528B1 (fr) 2003-07-08 2003-07-08 Reconnaissance vocale pour les larges vocabulaires dynamiques
PCT/FR2004/001799 WO2005006308A1 (fr) 2003-07-08 2004-07-08 Reconnaissance vocale pour les larges vocabulaires dynamiques

Publications (1)

Publication Number Publication Date
ATE445215T1 true ATE445215T1 (de) 2009-10-15

Family

ID=33522861

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04767631T ATE445215T1 (de) 2003-07-08 2004-07-08 Spracherkennung für grosse dynamische vokabulare

Country Status (8)

Country Link
US (1) US20070038451A1 (de)
EP (1) EP1642264B1 (de)
AT (1) ATE445215T1 (de)
AU (1) AU2004256561A1 (de)
CA (1) CA2531496C (de)
DE (1) DE602004023508D1 (de)
FR (1) FR2857528B1 (de)
WO (1) WO2005006308A1 (de)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4579595B2 (ja) * 2004-06-29 2010-11-10 キヤノン株式会社 音声認識文法作成装置、音声認識文法作成方法、プログラム、及び記憶媒体
WO2006042943A1 (fr) * 2004-10-19 2006-04-27 France Telecom Procede de reconnaissance vocale comprenant une etape d ' insertion de marqueurs temporels et systeme correspondant
DE602006012181D1 (de) 2005-04-18 2010-03-25 Koninkl Philips Electronics Nv Kaffeemaschine mit mitteln zur erzeugung einer drehung in einem getränkestrom
US8510109B2 (en) 2007-08-22 2013-08-13 Canyon Ip Holdings Llc Continuous speech transcription performance indication
US7902447B1 (en) * 2006-10-03 2011-03-08 Sony Computer Entertainment Inc. Automatic composition of sound sequences using finite state automata
US9973450B2 (en) 2007-09-17 2018-05-15 Amazon Technologies, Inc. Methods and systems for dynamically updating web service profile information by parsing transcribed message strings
US8447120B2 (en) * 2008-10-04 2013-05-21 Microsoft Corporation Incremental feature indexing for scalable location recognition
KR20110006004A (ko) * 2009-07-13 2011-01-20 삼성전자주식회사 결합인식단위 최적화 장치 및 그 방법
US9063931B2 (en) * 2011-02-16 2015-06-23 Ming-Yuan Wu Multiple language translation system
US8914286B1 (en) * 2011-04-14 2014-12-16 Canyon IP Holdings, LLC Speech recognition with hierarchical networks
WO2014189486A1 (en) * 2013-05-20 2014-11-27 Intel Corporation Natural human-computer interaction for virtual personal assistant systems
CN107293298B (zh) * 2016-04-05 2021-02-19 富泰华工业(深圳)有限公司 语音控制系统及方法

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0782348B2 (ja) * 1992-03-21 1995-09-06 株式会社エイ・ティ・アール自動翻訳電話研究所 音声認識用サブワードモデル生成方法
US6073095A (en) * 1997-10-15 2000-06-06 International Business Machines Corporation Fast vocabulary independent method and apparatus for spotting words in speech
US5983180A (en) * 1997-10-23 1999-11-09 Softsound Limited Recognition of sequential data using finite state sequence models organized in a tree structure
US6456970B1 (en) * 1998-07-31 2002-09-24 Texas Instruments Incorporated Minimization of search network in speech recognition
US6324510B1 (en) * 1998-11-06 2001-11-27 Lernout & Hauspie Speech Products N.V. Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains
US6629073B1 (en) * 2000-04-27 2003-09-30 Microsoft Corporation Speech recognition method and apparatus utilizing multi-unit models
US20040034519A1 (en) * 2000-05-23 2004-02-19 Huitouze Serge Le Dynamic language models for speech recognition
US7035802B1 (en) * 2000-07-31 2006-04-25 Matsushita Electric Industrial Co., Ltd. Recognition system using lexical trees
US20020087313A1 (en) * 2000-12-29 2002-07-04 Lee Victor Wai Leung Computer-implemented intelligent speech model partitioning method and system
JP2003208195A (ja) * 2002-01-16 2003-07-25 Sharp Corp 連続音声認識装置および連続音声認識方法、連続音声認識プログラム、並びに、プログラム記録媒体

Also Published As

Publication number Publication date
CA2531496C (fr) 2014-05-06
EP1642264B1 (de) 2009-10-07
DE602004023508D1 (de) 2009-11-19
FR2857528A1 (fr) 2005-01-14
WO2005006308A1 (fr) 2005-01-20
US20070038451A1 (en) 2007-02-15
FR2857528B1 (fr) 2006-01-06
CA2531496A1 (fr) 2005-01-20
EP1642264A1 (de) 2006-04-05
AU2004256561A1 (en) 2005-01-20

Similar Documents

Publication Publication Date Title
US11676585B1 (en) Hybrid decoding using hardware and software for automatic speech recognition systems
US8972243B1 (en) Parse information encoding in a finite state transducer
CN105118501B (zh) 语音识别的方法及系统
DE69827667D1 (de) Vokoder basierter spracherkenner
ATE405920T1 (de) Erzeugen einer spracherkennungsgrammatik für alphanumerische ausdrücke
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
ATE445215T1 (de) Spracherkennung für grosse dynamische vokabulare
WO2007034478A3 (en) System and method for correcting speech
DE59904741D1 (de) Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
JP2000221990A (ja) 音声認識装置
US20220036893A1 (en) Language and grammar model adaptation
TW200627376A (en) Method and apparatus for constructing Chinese new words by the input voice
US10143027B1 (en) Device selection for routing of communications
ATE263997T1 (de) Zwischen-wörter verbindung phonemische modelle
JP4581549B2 (ja) 音声処理装置および方法、記録媒体、並びにプログラム
US11172527B2 (en) Routing of communications to a device
Hofmann et al. Improving spontaneous English ASR using a joint-sequence pronunciation model
Chen et al. Large vocabulary word recognition based on tree-trellis search
WO2001026092A3 (en) Attribute-based word modeling
Tolba et al. Speech recognition by intelligent machines
KR101578766B1 (ko) 음성 인식용 탐색 공간 생성 장치 및 방법
JP4972660B2 (ja) 音声学習装置及びプログラム
Kao et al. A low cost dynamic vocabulary speech recognizer on a GPP-DSP system
Song et al. Discriminative pronunciation modeling based on minimum phone error training.

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties