ATE440360T1 - Verfahren und system zur echtzeit-spracherkennung - Google Patents
Verfahren und system zur echtzeit-spracherkennungInfo
- Publication number
- ATE440360T1 ATE440360T1 AT02801823T AT02801823T ATE440360T1 AT E440360 T1 ATE440360 T1 AT E440360T1 AT 02801823 T AT02801823 T AT 02801823T AT 02801823 T AT02801823 T AT 02801823T AT E440360 T1 ATE440360 T1 AT E440360T1
- Authority
- AT
- Austria
- Prior art keywords
- real
- voice recognition
- time voice
- processor
- processor units
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 238000000605 extraction Methods 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/34—Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Complex Calculations (AREA)
- Mobile Radio Communication Systems (AREA)
- Image Processing (AREA)
- Multi Processors (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CA002359544A CA2359544A1 (en) | 2001-10-22 | 2001-10-22 | Low-resource real-time speech recognition system using an oversampled filterbank |
| PCT/CA2002/001578 WO2003036618A1 (en) | 2001-10-22 | 2002-10-22 | Method and system for real-time speech recognition |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE440360T1 true ATE440360T1 (de) | 2009-09-15 |
Family
ID=4170315
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT02801823T ATE440360T1 (de) | 2001-10-22 | 2002-10-22 | Verfahren und system zur echtzeit-spracherkennung |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US7139707B2 (de) |
| EP (1) | EP1449203B1 (de) |
| AT (1) | ATE440360T1 (de) |
| CA (1) | CA2359544A1 (de) |
| DE (1) | DE60233426D1 (de) |
| DK (1) | DK1449203T3 (de) |
| WO (1) | WO2003036618A1 (de) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7610199B2 (en) * | 2004-09-01 | 2009-10-27 | Sri International | Method and apparatus for obtaining complete speech signals for speech recognition applications |
| CN101120397B (zh) | 2005-01-17 | 2011-08-17 | 日本电气株式会社 | 语音识别系统、语音识别方法 |
| US7587441B2 (en) * | 2005-06-29 | 2009-09-08 | L-3 Communications Integrated Systems L.P. | Systems and methods for weighted overlap and add processing |
| US7249868B2 (en) * | 2005-07-07 | 2007-07-31 | Visteon Global Technologies, Inc. | Lamp housing with interior cooling by a thermoelectric device |
| US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
| US8380506B2 (en) * | 2006-01-27 | 2013-02-19 | Georgia Tech Research Corporation | Automatic pattern recognition using category dependent feature selection |
| US8195462B2 (en) * | 2006-02-16 | 2012-06-05 | At&T Intellectual Property Ii, L.P. | System and method for providing large vocabulary speech processing based on fixed-point arithmetic |
| US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
| US8010358B2 (en) | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
| US8818802B2 (en) | 2008-10-10 | 2014-08-26 | Spansion Llc | Real-time data pattern analysis system and method of operation thereof |
| WO2010042631A2 (en) * | 2008-10-10 | 2010-04-15 | Fastow Richard M | Real-time data pattern analysis system and method of operation thereof |
| US8788256B2 (en) | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
| US8442833B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
| US8442829B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
| CN102256201A (zh) * | 2010-05-19 | 2011-11-23 | 上海聪维声学技术有限公司 | 用于助听器的自动环境识别方法 |
| US8924453B2 (en) * | 2011-12-19 | 2014-12-30 | Spansion Llc | Arithmetic logic unit architecture |
| US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
| US9514739B2 (en) * | 2012-06-06 | 2016-12-06 | Cypress Semiconductor Corporation | Phoneme score accelerator |
| US9224384B2 (en) * | 2012-06-06 | 2015-12-29 | Cypress Semiconductor Corporation | Histogram based pre-pruning scheme for active HMMS |
| US9542933B2 (en) | 2013-03-08 | 2017-01-10 | Analog Devices Global | Microphone circuit assembly and system with speech recognition |
| US9836450B2 (en) * | 2014-12-09 | 2017-12-05 | Sansa AI Inc. | Methods and systems for providing universal portability in machine learning |
| US10540957B2 (en) * | 2014-12-15 | 2020-01-21 | Baidu Usa Llc | Systems and methods for speech transcription |
| US10089989B2 (en) * | 2015-12-07 | 2018-10-02 | Semiconductor Components Industries, Llc | Method and apparatus for a low power voice trigger device |
| CN109102799B (zh) * | 2018-08-17 | 2023-01-24 | 信阳师范学院 | 一种基于频域系数对数和的语音端点检测方法 |
| CN108962249B (zh) * | 2018-08-21 | 2023-03-31 | 广州市保伦电子有限公司 | 一种基于mfcc语音特征的语音匹配方法及存储介质 |
| CN110875034B (zh) * | 2018-09-03 | 2024-03-22 | 嘉楠明芯(北京)科技有限公司 | 用于语音识别的模板训练方法、语音识别方法及其系统 |
| CN111354337A (zh) * | 2018-12-24 | 2020-06-30 | 上海新微技术研发中心有限公司 | 语音识别方法以及用户终端 |
| US10726834B1 (en) * | 2019-09-06 | 2020-07-28 | Verbit Software Ltd. | Human-based accent detection to assist rapid transcription with automatic speech recognition |
| CN117672201A (zh) * | 2023-12-05 | 2024-03-08 | 舜泰汽车有限公司 | 一种农机无人驾驶语音识别的控制系统 |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5704004A (en) * | 1993-12-01 | 1997-12-30 | Industrial Technology Research Institute | Apparatus and method for normalizing and categorizing linear prediction code vectors using Bayesian categorization technique |
| US6236731B1 (en) * | 1997-04-16 | 2001-05-22 | Dspfactory Ltd. | Filterbank structure and method for filtering and separating an information signal into different bands, particularly for audio signal in hearing aids |
| EP0878790A1 (de) * | 1997-05-15 | 1998-11-18 | Hewlett-Packard Company | Sprachkodiersystem und Verfahren |
| US6249761B1 (en) * | 1997-09-30 | 2001-06-19 | At&T Corp. | Assigning and processing states and arcs of a speech recognition model in parallel processors |
| JP4197195B2 (ja) * | 1998-02-27 | 2008-12-17 | ヒューレット・パッカード・カンパニー | 音声情報の提供方法 |
| EP1082719B1 (de) | 1999-04-01 | 2013-07-03 | Koninklijke Philips Electronics N.V. | Mehrstufiger spracherkenner |
| FI19992350L (fi) * | 1999-10-29 | 2001-04-30 | Nokia Mobile Phones Ltd | Parannettu puheentunnistus |
-
2001
- 2001-10-22 CA CA002359544A patent/CA2359544A1/en not_active Abandoned
-
2002
- 2002-10-22 US US10/277,454 patent/US7139707B2/en not_active Expired - Lifetime
- 2002-10-22 EP EP02801823A patent/EP1449203B1/de not_active Expired - Lifetime
- 2002-10-22 AT AT02801823T patent/ATE440360T1/de not_active IP Right Cessation
- 2002-10-22 DK DK02801823T patent/DK1449203T3/da active
- 2002-10-22 DE DE60233426T patent/DE60233426D1/de not_active Expired - Lifetime
- 2002-10-22 WO PCT/CA2002/001578 patent/WO2003036618A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| DK1449203T3 (da) | 2009-11-09 |
| WO2003036618A1 (en) | 2003-05-01 |
| DE60233426D1 (de) | 2009-10-01 |
| EP1449203B1 (de) | 2009-08-19 |
| US7139707B2 (en) | 2006-11-21 |
| US20030110033A1 (en) | 2003-06-12 |
| CA2359544A1 (en) | 2003-04-22 |
| EP1449203A1 (de) | 2004-08-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE440360T1 (de) | Verfahren und system zur echtzeit-spracherkennung | |
| DE60222093D1 (de) | Verfahren, modul, vorrichtung und server zur spracherkennung | |
| ATE361523T1 (de) | Verfahren zum komprimieren von wörterbuchdaten | |
| DE60310785D1 (de) | Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache | |
| DE60309822D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
| EP1441328A4 (de) | Vorrichtung und verfahren zur spracherkennung | |
| FI20040872L (fi) | Menetelmä ja laitteisto monitasoiseksi hajautetuksi puheentunnistukseksi | |
| WO2005058018A3 (en) | System and method for plant identification | |
| ATE410447T1 (de) | Verfahren zur kollagenherstellung | |
| ATE233935T1 (de) | Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung | |
| DE60317025D1 (de) | Vorrichtung und Verfahren zur Gesichtserkennung | |
| ATE345526T1 (de) | Informationsverarbeitungsvorrichtung und - verfahren und programmprodukt | |
| WO2003067572A3 (en) | Speech recognition circuit using parallel processors | |
| DE502006007957D1 (de) | Reinigungsanlage | |
| DE60124471D1 (de) | Vorrichtung zur Spracherkennung | |
| SG140445A1 (en) | Method and apparatus for automatically recognizing audio data | |
| DE69630999T2 (de) | Verfahren zur verringerung von datenbankanforderungen für ein spracherkennungssystem | |
| WO2004046890A3 (en) | Method and system for processing sales process information, for sales process configuration, for sales process integration, and for modeling sales processes | |
| DE60228681D1 (de) | Vorrichtung und verfahren für speicherung von spracherkennungsmodellen | |
| DE60229315D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
| DE60028219D1 (de) | Verfahren zur Spracherkennung | |
| ATE302636T1 (de) | Verfahren und vorrichtung für atemluftproduktion | |
| DE60122893D1 (de) | Verfahren, vorrichtung und programm zur sprecherkennung | |
| ATE422087T1 (de) | Verfahren und vorrichtung zur durchführung von spracherkennung über einen sprachkanal | |
| NO20051096D0 (no) | System og fremgangsmate for bronnoverhaling med horisontaltre |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |