ATE440360T1 - Verfahren und system zur echtzeit-spracherkennung - Google Patents

Verfahren und system zur echtzeit-spracherkennung

Info

Publication number
ATE440360T1
ATE440360T1 AT02801823T AT02801823T ATE440360T1 AT E440360 T1 ATE440360 T1 AT E440360T1 AT 02801823 T AT02801823 T AT 02801823T AT 02801823 T AT02801823 T AT 02801823T AT E440360 T1 ATE440360 T1 AT E440360T1
Authority
AT
Austria
Prior art keywords
real
voice recognition
time voice
processor
processor units
Prior art date
Application number
AT02801823T
Other languages
English (en)
Inventor
Hamid Sheikhzadeh-Nadjar
Etienne Cornu
Robert Brennan
Nicolas Destrez
Alain Dufaux
Original Assignee
Emma Mixed Signal Cv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Emma Mixed Signal Cv filed Critical Emma Mixed Signal Cv
Application granted granted Critical
Publication of ATE440360T1 publication Critical patent/ATE440360T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Complex Calculations (AREA)
  • Multi Processors (AREA)
  • Image Processing (AREA)
  • Mobile Radio Communication Systems (AREA)
AT02801823T 2001-10-22 2002-10-22 Verfahren und system zur echtzeit-spracherkennung ATE440360T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CA002359544A CA2359544A1 (en) 2001-10-22 2001-10-22 Low-resource real-time speech recognition system using an oversampled filterbank
PCT/CA2002/001578 WO2003036618A1 (en) 2001-10-22 2002-10-22 Method and system for real-time speech recognition

Publications (1)

Publication Number Publication Date
ATE440360T1 true ATE440360T1 (de) 2009-09-15

Family

ID=4170315

Family Applications (1)

Application Number Title Priority Date Filing Date
AT02801823T ATE440360T1 (de) 2001-10-22 2002-10-22 Verfahren und system zur echtzeit-spracherkennung

Country Status (7)

Country Link
US (1) US7139707B2 (de)
EP (1) EP1449203B1 (de)
AT (1) ATE440360T1 (de)
CA (1) CA2359544A1 (de)
DE (1) DE60233426D1 (de)
DK (1) DK1449203T3 (de)
WO (1) WO2003036618A1 (de)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7610199B2 (en) * 2004-09-01 2009-10-27 Sri International Method and apparatus for obtaining complete speech signals for speech recognition applications
JP5103907B2 (ja) * 2005-01-17 2012-12-19 日本電気株式会社 音声認識システム、音声認識方法及び音声認識プログラム
US7587441B2 (en) * 2005-06-29 2009-09-08 L-3 Communications Integrated Systems L.P. Systems and methods for weighted overlap and add processing
US7249868B2 (en) * 2005-07-07 2007-07-31 Visteon Global Technologies, Inc. Lamp housing with interior cooling by a thermoelectric device
US7970613B2 (en) 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US8380506B2 (en) * 2006-01-27 2013-02-19 Georgia Tech Research Corporation Automatic pattern recognition using category dependent feature selection
US8195462B2 (en) * 2006-02-16 2012-06-05 At&T Intellectual Property Ii, L.P. System and method for providing large vocabulary speech processing based on fixed-point arithmetic
US8010358B2 (en) 2006-02-21 2011-08-30 Sony Computer Entertainment Inc. Voice recognition with parallel gender and age normalization
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
WO2010042631A2 (en) * 2008-10-10 2010-04-15 Fastow Richard M Real-time data pattern analysis system and method of operation thereof
US8818802B2 (en) 2008-10-10 2014-08-26 Spansion Llc Real-time data pattern analysis system and method of operation thereof
US8442833B2 (en) 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US8788256B2 (en) 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US8442829B2 (en) 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
CN102256201A (zh) * 2010-05-19 2011-11-23 上海聪维声学技术有限公司 用于助听器的自动环境识别方法
US8924453B2 (en) * 2011-12-19 2014-12-30 Spansion Llc Arithmetic logic unit architecture
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
US9514739B2 (en) * 2012-06-06 2016-12-06 Cypress Semiconductor Corporation Phoneme score accelerator
US9224384B2 (en) * 2012-06-06 2015-12-29 Cypress Semiconductor Corporation Histogram based pre-pruning scheme for active HMMS
US9542933B2 (en) 2013-03-08 2017-01-10 Analog Devices Global Microphone circuit assembly and system with speech recognition
US9836450B2 (en) * 2014-12-09 2017-12-05 Sansa AI Inc. Methods and systems for providing universal portability in machine learning
US10540957B2 (en) * 2014-12-15 2020-01-21 Baidu Usa Llc Systems and methods for speech transcription
US10089989B2 (en) 2015-12-07 2018-10-02 Semiconductor Components Industries, Llc Method and apparatus for a low power voice trigger device
CN109102799B (zh) * 2018-08-17 2023-01-24 信阳师范学院 一种基于频域系数对数和的语音端点检测方法
CN108962249B (zh) * 2018-08-21 2023-03-31 广州市保伦电子有限公司 一种基于mfcc语音特征的语音匹配方法及存储介质
CN110875034B (zh) * 2018-09-03 2024-03-22 嘉楠明芯(北京)科技有限公司 用于语音识别的模板训练方法、语音识别方法及其系统
CN111354337A (zh) * 2018-12-24 2020-06-30 上海新微技术研发中心有限公司 语音识别方法以及用户终端
US20210074294A1 (en) * 2019-09-06 2021-03-11 Verbit Software Ltd. User interface to assist in hybrid transcription of audio that includes a repeated phrase

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5704004A (en) * 1993-12-01 1997-12-30 Industrial Technology Research Institute Apparatus and method for normalizing and categorizing linear prediction code vectors using Bayesian categorization technique
US6236731B1 (en) 1997-04-16 2001-05-22 Dspfactory Ltd. Filterbank structure and method for filtering and separating an information signal into different bands, particularly for audio signal in hearing aids
EP0878790A1 (de) * 1997-05-15 1998-11-18 Hewlett-Packard Company Sprachkodiersystem und Verfahren
US6249761B1 (en) 1997-09-30 2001-06-19 At&T Corp. Assigning and processing states and arcs of a speech recognition model in parallel processors
JP4197195B2 (ja) * 1998-02-27 2008-12-17 ヒューレット・パッカード・カンパニー 音声情報の提供方法
EP1082719B1 (de) 1999-04-01 2013-07-03 Koninklijke Philips Electronics N.V. Mehrstufiger spracherkenner
FI19992350A (fi) * 1999-10-29 2001-04-30 Nokia Mobile Phones Ltd Parannettu puheentunnistus

Also Published As

Publication number Publication date
US20030110033A1 (en) 2003-06-12
US7139707B2 (en) 2006-11-21
DK1449203T3 (da) 2009-11-09
EP1449203A1 (de) 2004-08-25
EP1449203B1 (de) 2009-08-19
CA2359544A1 (en) 2003-04-22
WO2003036618A1 (en) 2003-05-01
DE60233426D1 (de) 2009-10-01

Similar Documents

Publication Publication Date Title
ATE440360T1 (de) Verfahren und system zur echtzeit-spracherkennung
DE60321256D1 (de) Spracherkennungssystem, Spracherkennungsverfahren und Programmprodukt
DE69811921D1 (de) Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
ATE410768T1 (de) System und verfahren zum betrieb eines spracherkennungssystems in einem fahrzeug
ATE361523T1 (de) Verfahren zum komprimieren von wörterbuchdaten
DE60309822D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE502006007957D1 (de) Reinigungsanlage
ATE410447T1 (de) Verfahren zur kollagenherstellung
DE60317025D1 (de) Vorrichtung und Verfahren zur Gesichtserkennung
DE60318544D1 (de) Sprachmodell für die Spracherkennung
ATE345526T1 (de) Informationsverarbeitungsvorrichtung und - verfahren und programmprodukt
DE60221530D1 (de) Verfahren und vorrichtung zum unterdrücken von tönen, die durch dem-algorithmen (cyclic dynamic element matching) verursacht werden
DE60207863D1 (de) Vorrichtung und Verfahren zur Gesichtserkennung
DE60124471D1 (de) Vorrichtung zur Spracherkennung
WO2003067572A3 (en) Speech recognition circuit using parallel processors
SG140445A1 (en) Method and apparatus for automatically recognizing audio data
ATE256329T1 (de) Verfahren zur verringerung von datenbankanforderungen für ein spracherkennungssystem
ATE358042T1 (de) Vorrichtung und verfahren zur behandlung von werkstücken, insbesondere fahrzeugkarossen
ATE407421T1 (de) Vorrichtung und verfahren für speicherung von spracherkennungsmodellen
DE69613293T2 (de) Vorrichtung zur Musteranpassung für Sprach- oder Mustererkennung
ATE342563T1 (de) Verfahren und vorrichtung zur einschränkung des suchumfangs in einem lexikon für spracherkennung
NO20051096D0 (no) System og fremgangsmate for bronnoverhaling med horisontaltre
DE60028219D1 (de) Verfahren zur Spracherkennung
ATE302636T1 (de) Verfahren und vorrichtung für atemluftproduktion
DE60032776D1 (de) Verfahren zur Spracherkennung

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties