BR0206413A - Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz - Google Patents
Sistema e método para armazenamento eficiente de modelos de reconhecimento de vozInfo
- Publication number
- BR0206413A BR0206413A BR0206413-8A BR0206413A BR0206413A BR 0206413 A BR0206413 A BR 0206413A BR 0206413 A BR0206413 A BR 0206413A BR 0206413 A BR0206413 A BR 0206413A
- Authority
- BR
- Brazil
- Prior art keywords
- models
- voice recognition
- law
- compress
- efficient storage
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 4
- 230000006835 compression Effects 0.000 abstract 3
- 238000007906 compression Methods 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephonic Communication Services (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Supply And Installment Of Electrical Components (AREA)
Abstract
"SISTEMA E MéTODO PARA ARMAZENAMENTO EFICIENTE DE MODELOS DE RECONHECIMENTO DE VOZ". um método e sistema que aperfeiçoam o reconhecimento de voz por meio do aperfeiçoamento do armazenamento dos gabaritos de reconhecimento de voz (VR). O armazenamento aperfeiçoado significa que mais modelos de VR podem ser armazenados na memória. Mais modelos de VR podem ser armazenados na memória, mais robusto é o sistema de VR e portanto mais preciso será o sistema de VR. As técnicas de compressão com perda são utilizadas para comprimir modelos de VR. Em uma modalidade, a compressão por Lei-A e a expansão por Lei-A são utilizadas para comprimir e expandir os modelos de VR. Em outra modalidade, a compressão por Lei-Mu e a expansão por Lei-Mu são utilizadas para comprimir e expandir os modelos de VR. Os modelos de VR são comprimidos durante um processo de treinamento e são expandidos durante o reconhecimento de voz.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/760,076 US6681207B2 (en) | 2001-01-12 | 2001-01-12 | System and method for lossy compression of voice recognition models |
PCT/US2002/000890 WO2002059871A2 (en) | 2001-01-12 | 2002-01-10 | System and method for efficient storage of voice recognition models |
Publications (1)
Publication Number | Publication Date |
---|---|
BR0206413A true BR0206413A (pt) | 2004-06-22 |
Family
ID=25058019
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR0206413-8A BR0206413A (pt) | 2001-01-12 | 2002-01-10 | Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz |
Country Status (12)
Country | Link |
---|---|
US (2) | US6681207B2 (pt) |
EP (1) | EP1352389B1 (pt) |
JP (1) | JP2004523788A (pt) |
CN (1) | CN100527224C (pt) |
AT (1) | ATE407421T1 (pt) |
AU (1) | AU2002246992A1 (pt) |
BR (1) | BR0206413A (pt) |
CA (1) | CA2434562A1 (pt) |
DE (1) | DE60228681D1 (pt) |
IL (1) | IL156891A0 (pt) |
TW (1) | TW546632B (pt) |
WO (1) | WO2002059871A2 (pt) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6681207B2 (en) * | 2001-01-12 | 2004-01-20 | Qualcomm Incorporated | System and method for lossy compression of voice recognition models |
US20030004720A1 (en) * | 2001-01-30 | 2003-01-02 | Harinath Garudadri | System and method for computing and transmitting parameters in a distributed voice recognition system |
US7941313B2 (en) * | 2001-05-17 | 2011-05-10 | Qualcomm Incorporated | System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system |
US7203643B2 (en) * | 2001-06-14 | 2007-04-10 | Qualcomm Incorporated | Method and apparatus for transmitting speech activity in distributed voice recognition systems |
US7379868B2 (en) * | 2002-07-18 | 2008-05-27 | Massachusetts Institute Of Technology | Method and apparatus for differential compression of speaker models |
US7333930B2 (en) * | 2003-03-14 | 2008-02-19 | Agere Systems Inc. | Tonal analysis for perceptual audio coding using a compressed spectral representation |
JP4531350B2 (ja) * | 2003-06-04 | 2010-08-25 | アルパイン株式会社 | 音声入力装置および音声認識処理システム |
US7558744B2 (en) * | 2004-01-23 | 2009-07-07 | Razumov Sergey N | Multimedia terminal for product ordering |
US7430328B2 (en) * | 2004-12-01 | 2008-09-30 | Honeywell International Inc. | Rice lossless compression module |
US20060136210A1 (en) * | 2004-12-16 | 2006-06-22 | Sony Corporation | System and method for tying variance vectors for speech recognition |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US8108205B2 (en) * | 2006-12-01 | 2012-01-31 | Microsoft Corporation | Leveraging back-off grammars for authoring context-free grammars |
US20100106269A1 (en) * | 2008-09-26 | 2010-04-29 | Qualcomm Incorporated | Method and apparatus for signal processing using transform-domain log-companding |
FI20086260A (fi) * | 2008-12-31 | 2010-09-02 | Teknillinen Korkeakoulu | Menetelmä hahmon löytämiseksi ja tunnistamiseksi |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8788256B2 (en) * | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8510103B2 (en) * | 2009-10-15 | 2013-08-13 | Paul Angott | System and method for voice recognition |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
US9653070B2 (en) * | 2012-12-31 | 2017-05-16 | Intel Corporation | Flexible architecture for acoustic signal processing engine |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69030561T2 (de) * | 1989-12-28 | 1997-10-09 | Sharp Kk | Spracherkennungseinrichtung |
JP2524472B2 (ja) * | 1992-09-21 | 1996-08-14 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 電話回線利用の音声認識システムを訓練する方法 |
US5627939A (en) | 1993-09-03 | 1997-05-06 | Microsoft Corporation | Speech recognition system and method employing data compression |
CN1160450A (zh) | 1994-09-07 | 1997-09-24 | 摩托罗拉公司 | 从连续语音中识别讲话声音的系统及其应用方法 |
US6009387A (en) * | 1997-03-20 | 1999-12-28 | International Business Machines Corporation | System and method of compression/decompressing a speech signal by using split vector quantization and scalar quantization |
US6370504B1 (en) * | 1997-05-29 | 2002-04-09 | University Of Washington | Speech recognition on MPEG/Audio encoded files |
US6044346A (en) * | 1998-03-09 | 2000-03-28 | Lucent Technologies Inc. | System and method for operating a digital voice recognition processor with flash memory storage |
US6463413B1 (en) * | 1999-04-20 | 2002-10-08 | Matsushita Electrical Industrial Co., Ltd. | Speech recognition training for small hardware devices |
DE10043946C2 (de) * | 2000-09-06 | 2002-12-12 | Siemens Ag | Komprimieren von HMM-Prototypen |
US6694294B1 (en) * | 2000-10-31 | 2004-02-17 | Qualcomm Incorporated | System and method of mu-law or A-law compression of bark amplitudes for speech recognition |
US6681207B2 (en) * | 2001-01-12 | 2004-01-20 | Qualcomm Incorporated | System and method for lossy compression of voice recognition models |
-
2001
- 2001-01-12 US US09/760,076 patent/US6681207B2/en not_active Expired - Lifetime
-
2002
- 2002-01-10 DE DE60228681T patent/DE60228681D1/de not_active Expired - Lifetime
- 2002-01-10 WO PCT/US2002/000890 patent/WO2002059871A2/en active Application Filing
- 2002-01-10 AU AU2002246992A patent/AU2002246992A1/en not_active Abandoned
- 2002-01-10 CN CNB028048164A patent/CN100527224C/zh not_active Expired - Lifetime
- 2002-01-10 BR BR0206413-8A patent/BR0206413A/pt not_active IP Right Cessation
- 2002-01-10 IL IL15689102A patent/IL156891A0/xx unknown
- 2002-01-10 CA CA002434562A patent/CA2434562A1/en not_active Abandoned
- 2002-01-10 EP EP02714742A patent/EP1352389B1/en not_active Expired - Lifetime
- 2002-01-10 JP JP2002560118A patent/JP2004523788A/ja active Pending
- 2002-01-10 AT AT02714742T patent/ATE407421T1/de not_active IP Right Cessation
- 2002-01-11 TW TW091100301A patent/TW546632B/zh not_active IP Right Cessation
-
2003
- 2003-11-12 US US10/712,583 patent/US7136815B2/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
CN100527224C (zh) | 2009-08-12 |
EP1352389B1 (en) | 2008-09-03 |
JP2004523788A (ja) | 2004-08-05 |
US20020133345A1 (en) | 2002-09-19 |
TW546632B (en) | 2003-08-11 |
CN1582468A (zh) | 2005-02-16 |
IL156891A0 (en) | 2004-02-08 |
US6681207B2 (en) | 2004-01-20 |
US20040098258A1 (en) | 2004-05-20 |
WO2002059871A2 (en) | 2002-08-01 |
EP1352389A2 (en) | 2003-10-15 |
WO2002059871A3 (en) | 2003-03-13 |
CA2434562A1 (en) | 2002-08-01 |
AU2002246992A1 (en) | 2002-08-06 |
US7136815B2 (en) | 2006-11-14 |
ATE407421T1 (de) | 2008-09-15 |
DE60228681D1 (de) | 2008-10-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR0206413A (pt) | Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz | |
PH12019501881A1 (en) | Method and apparatus for the efficient compression of genomic sequence reads | |
ATE410768T1 (de) | System und verfahren zum betrieb eines spracherkennungssystems in einem fahrzeug | |
TW357313B (en) | Methods and apparatus for handwriting recognition | |
EP0847041A3 (en) | Method and apparatus for speech recognition performing noise adaptation | |
WO2003014866A3 (en) | Method and system for dynamic learning through a regression-based library generation process | |
ATE426526T1 (de) | System und verfahren zur auswahl eines benutzersprachprofils fur eine vorrichtung in einem fahrzeug | |
AU2017408800A1 (en) | Method and system of mining information, electronic device and readable storable medium | |
DE60225170D1 (de) | Verfahren und vorrichtung zum dekodieren handschriftlicher zeichen | |
EP1416386A3 (en) | Method and apparatus for effectively enabling an out of sequence write process within a non-volatile memory system | |
EP2306345A3 (en) | Speech retrieval apparatus and speech retrieval method | |
WO2004070558A3 (en) | Method and apparatus to identify a work received by a processing system | |
WO2007138600A3 (en) | Method and system for transformation of logical data objects for storage | |
EP1447792A3 (en) | Method and apparatus for modeling a speech recognition system and for predicting word error rates from text | |
EP1551007A4 (en) | LANGUAGE MODEL CREATION / CREATION DEVICE, VOICE RECOGNITION DEVICE, LANGUAGE MODEL CREATION METHOD, AND VOICE RECOGNITION METHOD | |
WO2005017807A3 (en) | Apparatus and method for classifying multi-dimensional biological data | |
ATE407411T1 (de) | Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text | |
TW200639691A (en) | System, method and medium of automatic document classification | |
TW200700987A (en) | Method and apparatus for performing multi-programmable function with one-time programmable memories | |
EA200501534A1 (ru) | Способ получения экстракта из листьев плюща | |
DE60231422D1 (de) | Verfahren und Vorrichtung zum Starten von Leseoptimierungen in Speicher-Verbindungen | |
EP1187098A3 (de) | Komprimieren von HMM-Prototypen | |
DE602004001948D1 (de) | Verfahren zum Trennen von Platten, die miteinander geklebt sind und eine gestapelte Struktur bilden | |
ATE377241T1 (de) | Verfahren zur spracherkennung mit automatischen korrektur | |
ATE384295T1 (de) | Methode und verfahren für seitenband- leseantwortkopf in einem speicher- verbindungsnetzwerk |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B08F | Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette] |
Free format text: REFERENTE A 10A ANUIDADE. |
|
B08K | Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette] |
Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 2168 DE 24/07/2012. |