ATE440360T1 - Verfahren und system zur echtzeit-spracherkennung - Google Patents
Verfahren und system zur echtzeit-spracherkennungInfo
- Publication number
- ATE440360T1 ATE440360T1 AT02801823T AT02801823T ATE440360T1 AT E440360 T1 ATE440360 T1 AT E440360T1 AT 02801823 T AT02801823 T AT 02801823T AT 02801823 T AT02801823 T AT 02801823T AT E440360 T1 ATE440360 T1 AT E440360T1
- Authority
- AT
- Austria
- Prior art keywords
- real
- voice recognition
- time voice
- processor
- processor units
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 238000000605 extraction Methods 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/34—Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Complex Calculations (AREA)
- Multi Processors (AREA)
- Image Processing (AREA)
- Mobile Radio Communication Systems (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002359544A CA2359544A1 (en) | 2001-10-22 | 2001-10-22 | Low-resource real-time speech recognition system using an oversampled filterbank |
PCT/CA2002/001578 WO2003036618A1 (en) | 2001-10-22 | 2002-10-22 | Method and system for real-time speech recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE440360T1 true ATE440360T1 (de) | 2009-09-15 |
Family
ID=4170315
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT02801823T ATE440360T1 (de) | 2001-10-22 | 2002-10-22 | Verfahren und system zur echtzeit-spracherkennung |
Country Status (7)
Country | Link |
---|---|
US (1) | US7139707B2 (de) |
EP (1) | EP1449203B1 (de) |
AT (1) | ATE440360T1 (de) |
CA (1) | CA2359544A1 (de) |
DE (1) | DE60233426D1 (de) |
DK (1) | DK1449203T3 (de) |
WO (1) | WO2003036618A1 (de) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7610199B2 (en) * | 2004-09-01 | 2009-10-27 | Sri International | Method and apparatus for obtaining complete speech signals for speech recognition applications |
WO2006075648A1 (ja) * | 2005-01-17 | 2006-07-20 | Nec Corporation | 音声認識システム、音声認識方法及び音声認識プログラム |
US7587441B2 (en) | 2005-06-29 | 2009-09-08 | L-3 Communications Integrated Systems L.P. | Systems and methods for weighted overlap and add processing |
US7249868B2 (en) * | 2005-07-07 | 2007-07-31 | Visteon Global Technologies, Inc. | Lamp housing with interior cooling by a thermoelectric device |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US8380506B2 (en) * | 2006-01-27 | 2013-02-19 | Georgia Tech Research Corporation | Automatic pattern recognition using category dependent feature selection |
US8195462B2 (en) * | 2006-02-16 | 2012-06-05 | At&T Intellectual Property Ii, L.P. | System and method for providing large vocabulary speech processing based on fixed-point arithmetic |
US8010358B2 (en) | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8818802B2 (en) | 2008-10-10 | 2014-08-26 | Spansion Llc | Real-time data pattern analysis system and method of operation thereof |
WO2010042631A2 (en) * | 2008-10-10 | 2010-04-15 | Fastow Richard M | Real-time data pattern analysis system and method of operation thereof |
US8442833B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8788256B2 (en) | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442829B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
CN102256201A (zh) * | 2010-05-19 | 2011-11-23 | 上海聪维声学技术有限公司 | 用于助听器的自动环境识别方法 |
US8924453B2 (en) * | 2011-12-19 | 2014-12-30 | Spansion Llc | Arithmetic logic unit architecture |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
US9514739B2 (en) * | 2012-06-06 | 2016-12-06 | Cypress Semiconductor Corporation | Phoneme score accelerator |
US9224384B2 (en) * | 2012-06-06 | 2015-12-29 | Cypress Semiconductor Corporation | Histogram based pre-pruning scheme for active HMMS |
US9542933B2 (en) | 2013-03-08 | 2017-01-10 | Analog Devices Global | Microphone circuit assembly and system with speech recognition |
US9836450B2 (en) * | 2014-12-09 | 2017-12-05 | Sansa AI Inc. | Methods and systems for providing universal portability in machine learning |
US10540957B2 (en) * | 2014-12-15 | 2020-01-21 | Baidu Usa Llc | Systems and methods for speech transcription |
US10089989B2 (en) * | 2015-12-07 | 2018-10-02 | Semiconductor Components Industries, Llc | Method and apparatus for a low power voice trigger device |
CN109102799B (zh) * | 2018-08-17 | 2023-01-24 | 信阳师范学院 | 一种基于频域系数对数和的语音端点检测方法 |
CN108962249B (zh) * | 2018-08-21 | 2023-03-31 | 广州市保伦电子有限公司 | 一种基于mfcc语音特征的语音匹配方法及存储介质 |
CN110875034B (zh) * | 2018-09-03 | 2024-03-22 | 嘉楠明芯(北京)科技有限公司 | 用于语音识别的模板训练方法、语音识别方法及其系统 |
CN111354337A (zh) * | 2018-12-24 | 2020-06-30 | 上海新微技术研发中心有限公司 | 语音识别方法以及用户终端 |
US11158322B2 (en) * | 2019-09-06 | 2021-10-26 | Verbit Software Ltd. | Human resolution of repeated phrases in a hybrid transcription system |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5704004A (en) * | 1993-12-01 | 1997-12-30 | Industrial Technology Research Institute | Apparatus and method for normalizing and categorizing linear prediction code vectors using Bayesian categorization technique |
US6236731B1 (en) * | 1997-04-16 | 2001-05-22 | Dspfactory Ltd. | Filterbank structure and method for filtering and separating an information signal into different bands, particularly for audio signal in hearing aids |
EP0878790A1 (de) * | 1997-05-15 | 1998-11-18 | Hewlett-Packard Company | Sprachkodiersystem und Verfahren |
US6249761B1 (en) * | 1997-09-30 | 2001-06-19 | At&T Corp. | Assigning and processing states and arcs of a speech recognition model in parallel processors |
JP4197195B2 (ja) * | 1998-02-27 | 2008-12-17 | ヒューレット・パッカード・カンパニー | 音声情報の提供方法 |
EP1082719B1 (de) * | 1999-04-01 | 2013-07-03 | Koninklijke Philips Electronics N.V. | Mehrstufiger spracherkenner |
FI19992350A (fi) * | 1999-10-29 | 2001-04-30 | Nokia Mobile Phones Ltd | Parannettu puheentunnistus |
-
2001
- 2001-10-22 CA CA002359544A patent/CA2359544A1/en not_active Abandoned
-
2002
- 2002-10-22 DE DE60233426T patent/DE60233426D1/de not_active Expired - Lifetime
- 2002-10-22 US US10/277,454 patent/US7139707B2/en active Active
- 2002-10-22 AT AT02801823T patent/ATE440360T1/de not_active IP Right Cessation
- 2002-10-22 EP EP02801823A patent/EP1449203B1/de not_active Expired - Lifetime
- 2002-10-22 WO PCT/CA2002/001578 patent/WO2003036618A1/en not_active Application Discontinuation
- 2002-10-22 DK DK02801823T patent/DK1449203T3/da active
Also Published As
Publication number | Publication date |
---|---|
CA2359544A1 (en) | 2003-04-22 |
DK1449203T3 (da) | 2009-11-09 |
DE60233426D1 (de) | 2009-10-01 |
US7139707B2 (en) | 2006-11-21 |
US20030110033A1 (en) | 2003-06-12 |
EP1449203B1 (de) | 2009-08-19 |
EP1449203A1 (de) | 2004-08-25 |
WO2003036618A1 (en) | 2003-05-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE440360T1 (de) | Verfahren und system zur echtzeit-spracherkennung | |
DE60321256D1 (de) | Spracherkennungssystem, Spracherkennungsverfahren und Programmprodukt | |
DE69811921D1 (de) | Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung | |
ATE410768T1 (de) | System und verfahren zum betrieb eines spracherkennungssystems in einem fahrzeug | |
DE60219943D1 (de) | Verfahren zum komprimieren von wörterbuchdaten | |
DE60309822D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
ATE410447T1 (de) | Verfahren zur kollagenherstellung | |
DE60317025D1 (de) | Vorrichtung und Verfahren zur Gesichtserkennung | |
DE60318544D1 (de) | Sprachmodell für die Spracherkennung | |
ATE345526T1 (de) | Informationsverarbeitungsvorrichtung und - verfahren und programmprodukt | |
DE60221530D1 (de) | Verfahren und vorrichtung zum unterdrücken von tönen, die durch dem-algorithmen (cyclic dynamic element matching) verursacht werden | |
DE60124471D1 (de) | Vorrichtung zur Spracherkennung | |
WO2003067572A3 (en) | Speech recognition circuit using parallel processors | |
DE50104036D1 (de) | Spracherkennungssystem und Verfahren zum Betrieb eines solchen | |
SG140445A1 (en) | Method and apparatus for automatically recognizing audio data | |
ATE256329T1 (de) | Verfahren zur verringerung von datenbankanforderungen für ein spracherkennungssystem | |
ES2162946T3 (es) | Sistema de proceso de informacion. | |
ATE407421T1 (de) | Vorrichtung und verfahren für speicherung von spracherkennungsmodellen | |
DE69613293T2 (de) | Vorrichtung zur Musteranpassung für Sprach- oder Mustererkennung | |
ATE342563T1 (de) | Verfahren und vorrichtung zur einschränkung des suchumfangs in einem lexikon für spracherkennung | |
NO20051096D0 (no) | System og fremgangsmate for bronnoverhaling med horisontaltre | |
DE60028219D1 (de) | Verfahren zur Spracherkennung | |
DE60032776D1 (de) | Verfahren zur Spracherkennung | |
DE60301390D1 (de) | Verfahren und vorrichtung für atemluftproduktion | |
ATE422087T1 (de) | Verfahren und vorrichtung zur durchführung von spracherkennung über einen sprachkanal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |