BR0206413A - Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz - Google Patents

Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz

Info

Publication number
BR0206413A
BR0206413A BR0206413-8A BR0206413A BR0206413A BR 0206413 A BR0206413 A BR 0206413A BR 0206413 A BR0206413 A BR 0206413A BR 0206413 A BR0206413 A BR 0206413A
Authority
BR
Brazil
Prior art keywords
models
voice recognition
law
compress
efficient storage
Prior art date
Application number
BR0206413-8A
Other languages
English (en)
Inventor
Harinath Garudadri
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of BR0206413A publication Critical patent/BR0206413A/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Supply And Installment Of Electrical Components (AREA)

Abstract

"SISTEMA E MéTODO PARA ARMAZENAMENTO EFICIENTE DE MODELOS DE RECONHECIMENTO DE VOZ". um método e sistema que aperfeiçoam o reconhecimento de voz por meio do aperfeiçoamento do armazenamento dos gabaritos de reconhecimento de voz (VR). O armazenamento aperfeiçoado significa que mais modelos de VR podem ser armazenados na memória. Mais modelos de VR podem ser armazenados na memória, mais robusto é o sistema de VR e portanto mais preciso será o sistema de VR. As técnicas de compressão com perda são utilizadas para comprimir modelos de VR. Em uma modalidade, a compressão por Lei-A e a expansão por Lei-A são utilizadas para comprimir e expandir os modelos de VR. Em outra modalidade, a compressão por Lei-Mu e a expansão por Lei-Mu são utilizadas para comprimir e expandir os modelos de VR. Os modelos de VR são comprimidos durante um processo de treinamento e são expandidos durante o reconhecimento de voz.
BR0206413-8A 2001-01-12 2002-01-10 Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz BR0206413A (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/760,076 US6681207B2 (en) 2001-01-12 2001-01-12 System and method for lossy compression of voice recognition models
PCT/US2002/000890 WO2002059871A2 (en) 2001-01-12 2002-01-10 System and method for efficient storage of voice recognition models

Publications (1)

Publication Number Publication Date
BR0206413A true BR0206413A (pt) 2004-06-22

Family

ID=25058019

Family Applications (1)

Application Number Title Priority Date Filing Date
BR0206413-8A BR0206413A (pt) 2001-01-12 2002-01-10 Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz

Country Status (12)

Country Link
US (2) US6681207B2 (pt)
EP (1) EP1352389B1 (pt)
JP (1) JP2004523788A (pt)
CN (1) CN100527224C (pt)
AT (1) ATE407421T1 (pt)
AU (1) AU2002246992A1 (pt)
BR (1) BR0206413A (pt)
CA (1) CA2434562A1 (pt)
DE (1) DE60228681D1 (pt)
IL (1) IL156891A0 (pt)
TW (1) TW546632B (pt)
WO (1) WO2002059871A2 (pt)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6681207B2 (en) * 2001-01-12 2004-01-20 Qualcomm Incorporated System and method for lossy compression of voice recognition models
US20030004720A1 (en) * 2001-01-30 2003-01-02 Harinath Garudadri System and method for computing and transmitting parameters in a distributed voice recognition system
US7941313B2 (en) * 2001-05-17 2011-05-10 Qualcomm Incorporated System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system
US7203643B2 (en) * 2001-06-14 2007-04-10 Qualcomm Incorporated Method and apparatus for transmitting speech activity in distributed voice recognition systems
US7379868B2 (en) * 2002-07-18 2008-05-27 Massachusetts Institute Of Technology Method and apparatus for differential compression of speaker models
US7333930B2 (en) * 2003-03-14 2008-02-19 Agere Systems Inc. Tonal analysis for perceptual audio coding using a compressed spectral representation
JP4531350B2 (ja) * 2003-06-04 2010-08-25 アルパイン株式会社 音声入力装置および音声認識処理システム
US7558744B2 (en) * 2004-01-23 2009-07-07 Razumov Sergey N Multimedia terminal for product ordering
US7430328B2 (en) * 2004-12-01 2008-09-30 Honeywell International Inc. Rice lossless compression module
US20060136210A1 (en) * 2004-12-16 2006-06-22 Sony Corporation System and method for tying variance vectors for speech recognition
US7970613B2 (en) 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8010358B2 (en) * 2006-02-21 2011-08-30 Sony Computer Entertainment Inc. Voice recognition with parallel gender and age normalization
US8108205B2 (en) * 2006-12-01 2012-01-31 Microsoft Corporation Leveraging back-off grammars for authoring context-free grammars
US20100106269A1 (en) * 2008-09-26 2010-04-29 Qualcomm Incorporated Method and apparatus for signal processing using transform-domain log-companding
FI20086260A (fi) * 2008-12-31 2010-09-02 Teknillinen Korkeakoulu Menetelmä hahmon löytämiseksi ja tunnistamiseksi
US8442829B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US8442833B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US8788256B2 (en) * 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US8510103B2 (en) * 2009-10-15 2013-08-13 Paul Angott System and method for voice recognition
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
US9653070B2 (en) * 2012-12-31 2017-05-16 Intel Corporation Flexible architecture for acoustic signal processing engine

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69030561T2 (de) * 1989-12-28 1997-10-09 Sharp Kk Spracherkennungseinrichtung
JP2524472B2 (ja) * 1992-09-21 1996-08-14 インターナショナル・ビジネス・マシーンズ・コーポレイション 電話回線利用の音声認識システムを訓練する方法
US5627939A (en) 1993-09-03 1997-05-06 Microsoft Corporation Speech recognition system and method employing data compression
CN1160450A (zh) 1994-09-07 1997-09-24 摩托罗拉公司 从连续语音中识别讲话声音的系统及其应用方法
US6009387A (en) * 1997-03-20 1999-12-28 International Business Machines Corporation System and method of compression/decompressing a speech signal by using split vector quantization and scalar quantization
US6370504B1 (en) * 1997-05-29 2002-04-09 University Of Washington Speech recognition on MPEG/Audio encoded files
US6044346A (en) * 1998-03-09 2000-03-28 Lucent Technologies Inc. System and method for operating a digital voice recognition processor with flash memory storage
US6463413B1 (en) * 1999-04-20 2002-10-08 Matsushita Electrical Industrial Co., Ltd. Speech recognition training for small hardware devices
DE10043946C2 (de) * 2000-09-06 2002-12-12 Siemens Ag Komprimieren von HMM-Prototypen
US6694294B1 (en) * 2000-10-31 2004-02-17 Qualcomm Incorporated System and method of mu-law or A-law compression of bark amplitudes for speech recognition
US6681207B2 (en) * 2001-01-12 2004-01-20 Qualcomm Incorporated System and method for lossy compression of voice recognition models

Also Published As

Publication number Publication date
CN100527224C (zh) 2009-08-12
EP1352389B1 (en) 2008-09-03
JP2004523788A (ja) 2004-08-05
US20020133345A1 (en) 2002-09-19
TW546632B (en) 2003-08-11
CN1582468A (zh) 2005-02-16
IL156891A0 (en) 2004-02-08
US6681207B2 (en) 2004-01-20
US20040098258A1 (en) 2004-05-20
WO2002059871A2 (en) 2002-08-01
EP1352389A2 (en) 2003-10-15
WO2002059871A3 (en) 2003-03-13
CA2434562A1 (en) 2002-08-01
AU2002246992A1 (en) 2002-08-06
US7136815B2 (en) 2006-11-14
ATE407421T1 (de) 2008-09-15
DE60228681D1 (de) 2008-10-16

Similar Documents

Publication Publication Date Title
BR0206413A (pt) Sistema e método para armazenamento eficiente de modelos de reconhecimento de voz
PH12019501881A1 (en) Method and apparatus for the efficient compression of genomic sequence reads
ATE410768T1 (de) System und verfahren zum betrieb eines spracherkennungssystems in einem fahrzeug
TW357313B (en) Methods and apparatus for handwriting recognition
EP0847041A3 (en) Method and apparatus for speech recognition performing noise adaptation
WO2003014866A3 (en) Method and system for dynamic learning through a regression-based library generation process
ATE426526T1 (de) System und verfahren zur auswahl eines benutzersprachprofils fur eine vorrichtung in einem fahrzeug
AU2017408800A1 (en) Method and system of mining information, electronic device and readable storable medium
DE60225170D1 (de) Verfahren und vorrichtung zum dekodieren handschriftlicher zeichen
EP1416386A3 (en) Method and apparatus for effectively enabling an out of sequence write process within a non-volatile memory system
EP2306345A3 (en) Speech retrieval apparatus and speech retrieval method
WO2004070558A3 (en) Method and apparatus to identify a work received by a processing system
WO2007138600A3 (en) Method and system for transformation of logical data objects for storage
EP1447792A3 (en) Method and apparatus for modeling a speech recognition system and for predicting word error rates from text
EP1551007A4 (en) LANGUAGE MODEL CREATION / CREATION DEVICE, VOICE RECOGNITION DEVICE, LANGUAGE MODEL CREATION METHOD, AND VOICE RECOGNITION METHOD
WO2005017807A3 (en) Apparatus and method for classifying multi-dimensional biological data
ATE407411T1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
TW200639691A (en) System, method and medium of automatic document classification
TW200700987A (en) Method and apparatus for performing multi-programmable function with one-time programmable memories
EA200501534A1 (ru) Способ получения экстракта из листьев плюща
DE60231422D1 (de) Verfahren und Vorrichtung zum Starten von Leseoptimierungen in Speicher-Verbindungen
EP1187098A3 (de) Komprimieren von HMM-Prototypen
DE602004001948D1 (de) Verfahren zum Trennen von Platten, die miteinander geklebt sind und eine gestapelte Struktur bilden
ATE377241T1 (de) Verfahren zur spracherkennung mit automatischen korrektur
ATE384295T1 (de) Methode und verfahren für seitenband- leseantwortkopf in einem speicher- verbindungsnetzwerk

Legal Events

Date Code Title Description
B08F Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette]

Free format text: REFERENTE A 10A ANUIDADE.

B08K Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette]

Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 2168 DE 24/07/2012.