CN1551101B - 压缩声音模型的自适应 - Google Patents
压缩声音模型的自适应 Download PDFInfo
- Publication number
- CN1551101B CN1551101B CN2004100435508A CN200410043550A CN1551101B CN 1551101 B CN1551101 B CN 1551101B CN 2004100435508 A CN2004100435508 A CN 2004100435508A CN 200410043550 A CN200410043550 A CN 200410043550A CN 1551101 B CN1551101 B CN 1551101B
- Authority
- CN
- China
- Prior art keywords
- code
- code word
- code book
- subspace
- conversion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F24—HEATING; RANGES; VENTILATING
- F24F—AIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
- F24F1/00—Room units for air-conditioning, e.g. separate or self-contained units or units receiving primary air from a central station
- F24F1/06—Separate outdoor units, e.g. outdoor unit to be linked to a separate room comprising a compressor and a heat exchanger
- F24F1/56—Casing or covers of separate outdoor units, e.g. fan guards
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F24—HEATING; RANGES; VENTILATING
- F24F—AIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
- F24F13/00—Details common to, or for air-conditioning, air-humidification, ventilation or use of air currents for screening
- F24F13/20—Casings or covers
- F24F2013/207—Casings or covers with control knobs; Mounting controlling members or control units therein
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/285—Memory allocation or algorithm optimisation to reduce hardware requirements
Abstract
Description
Claims (19)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/438,498 | 2003-05-15 | ||
US10/438,498 US7499857B2 (en) | 2003-05-15 | 2003-05-15 | Adaptation of compressed acoustic models |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1551101A CN1551101A (zh) | 2004-12-01 |
CN1551101B true CN1551101B (zh) | 2012-04-11 |
Family
ID=33029806
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2004100435508A Expired - Fee Related CN1551101B (zh) | 2003-05-15 | 2004-05-17 | 压缩声音模型的自适应 |
Country Status (6)
Country | Link |
---|---|
US (1) | US7499857B2 (zh) |
EP (1) | EP1477966B1 (zh) |
JP (1) | JP2004341532A (zh) |
KR (1) | KR101036712B1 (zh) |
CN (1) | CN1551101B (zh) |
AT (1) | ATE531032T1 (zh) |
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102004045979A1 (de) * | 2004-09-22 | 2006-03-30 | Siemens Ag | Verfahren zur Sprecheradaption für ein Hidden-Markov-Modell basiertes Spracherkennungssystem |
US20060136210A1 (en) * | 2004-12-16 | 2006-06-22 | Sony Corporation | System and method for tying variance vectors for speech recognition |
WO2006091811A2 (en) * | 2005-02-24 | 2006-08-31 | Braxton Ernest E | Apparatus and method for non-invasive measurement of intracranial pressure |
US7729909B2 (en) * | 2005-03-04 | 2010-06-01 | Panasonic Corporation | Block-diagonal covariance joint subspace tying and model compensation for noise robust automatic speech recognition |
US20070088552A1 (en) * | 2005-10-17 | 2007-04-19 | Nokia Corporation | Method and a device for speech recognition |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
JP2009536805A (ja) * | 2006-05-09 | 2009-10-15 | インターデイジタル テクノロジー コーポレーション | ユニバーサル地上無線アクセスのための可変フィードバック |
US8239195B2 (en) * | 2008-09-23 | 2012-08-07 | Microsoft Corporation | Adapting a compressed model for use in speech recognition |
US8145483B2 (en) * | 2009-08-05 | 2012-03-27 | Tze Fen Li | Speech recognition method for all languages without using samples |
US20120116764A1 (en) * | 2010-11-09 | 2012-05-10 | Tze Fen Li | Speech recognition method on sentences in all languages |
US9367612B1 (en) * | 2011-11-18 | 2016-06-14 | Google Inc. | Correlation-based method for representing long-timescale structure in time-series data |
US8543398B1 (en) | 2012-02-29 | 2013-09-24 | Google Inc. | Training an automatic speech recognition system using compressed word frequencies |
US8374865B1 (en) | 2012-04-26 | 2013-02-12 | Google Inc. | Sampling training data for an automatic speech recognition system based on a benchmark classification distribution |
US8805684B1 (en) * | 2012-05-31 | 2014-08-12 | Google Inc. | Distributed speaker adaptation |
US8571859B1 (en) | 2012-05-31 | 2013-10-29 | Google Inc. | Multi-stage speaker adaptation |
US8554559B1 (en) | 2012-07-13 | 2013-10-08 | Google Inc. | Localized speech recognition with offload |
US9123333B2 (en) | 2012-09-12 | 2015-09-01 | Google Inc. | Minimum bayesian risk methods for automatic speech recognition |
US9093069B2 (en) | 2012-11-05 | 2015-07-28 | Nuance Communications, Inc. | Privacy-sensitive speech model creation via aggregation of multiple user models |
US9378729B1 (en) * | 2013-03-12 | 2016-06-28 | Amazon Technologies, Inc. | Maximum likelihood channel normalization |
US9858922B2 (en) | 2014-06-23 | 2018-01-02 | Google Inc. | Caching speech recognition scores |
US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
US9299347B1 (en) | 2014-10-22 | 2016-03-29 | Google Inc. | Speech recognition using associative mapping |
US10511695B2 (en) * | 2015-06-23 | 2019-12-17 | Georgia Tech Research Corporation | Packet-level clustering for memory-assisted compression of network traffic |
US9786270B2 (en) | 2015-07-09 | 2017-10-10 | Google Inc. | Generating acoustic models |
KR102492318B1 (ko) | 2015-09-18 | 2023-01-26 | 삼성전자주식회사 | 모델 학습 방법 및 장치, 및 데이터 인식 방법 |
US10229672B1 (en) | 2015-12-31 | 2019-03-12 | Google Llc | Training acoustic models using connectionist temporal classification |
US20180018973A1 (en) | 2016-07-15 | 2018-01-18 | Google Inc. | Speaker verification |
US10593346B2 (en) * | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US10706840B2 (en) | 2017-08-18 | 2020-07-07 | Google Llc | Encoder-decoder models for sequence to sequence mapping |
CA3099933A1 (en) * | 2018-05-18 | 2019-11-21 | Greeneden U.S. Holdings Ii, Llc | System and method for a multiclass approach for confidence modeling in automatic speech recognition systems |
US11620263B2 (en) * | 2020-12-17 | 2023-04-04 | EMC IP Holding Company LLC | Data compression using dictionaries |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5535305A (en) * | 1992-12-31 | 1996-07-09 | Apple Computer, Inc. | Sub-partitioned vector quantization of probability density functions |
JPH08116972A (ja) * | 1994-10-28 | 1996-05-14 | Kyowa Hakko Kogyo Co Ltd | ヒト26sプロテアソーム構成成分蛋白質 |
US5864810A (en) * | 1995-01-20 | 1999-01-26 | Sri International | Method and apparatus for speech recognition adapted to an individual speaker |
US5806029A (en) * | 1995-09-15 | 1998-09-08 | At&T Corp | Signal conditioned minimum error rate training for continuous speech recognition |
US5897616A (en) * | 1997-06-11 | 1999-04-27 | International Business Machines Corporation | Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases |
DE19912405A1 (de) * | 1999-03-19 | 2000-09-21 | Philips Corp Intellectual Pty | Bestimmung einer Regressionsklassen-Baumstruktur für Spracherkenner |
US6442519B1 (en) * | 1999-11-10 | 2002-08-27 | International Business Machines Corp. | Speaker model adaptation via network of similar users |
US6571208B1 (en) * | 1999-11-29 | 2003-05-27 | Matsushita Electric Industrial Co., Ltd. | Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training |
US7571097B2 (en) * | 2003-03-13 | 2009-08-04 | Microsoft Corporation | Method for training of subspace coded gaussian models |
-
2003
- 2003-05-15 US US10/438,498 patent/US7499857B2/en not_active Expired - Fee Related
-
2004
- 2004-04-23 AT AT04009734T patent/ATE531032T1/de not_active IP Right Cessation
- 2004-04-23 EP EP04009734A patent/EP1477966B1/en not_active Expired - Lifetime
- 2004-05-14 JP JP2004145307A patent/JP2004341532A/ja active Pending
- 2004-05-14 KR KR1020040034101A patent/KR101036712B1/ko not_active IP Right Cessation
- 2004-05-17 CN CN2004100435508A patent/CN1551101B/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1551101A (zh) | 2004-12-01 |
KR101036712B1 (ko) | 2011-05-24 |
US7499857B2 (en) | 2009-03-03 |
ATE531032T1 (de) | 2011-11-15 |
JP2004341532A (ja) | 2004-12-02 |
US20040230424A1 (en) | 2004-11-18 |
KR20040098589A (ko) | 2004-11-20 |
EP1477966B1 (en) | 2011-10-26 |
EP1477966A2 (en) | 2004-11-17 |
EP1477966A3 (en) | 2005-01-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1551101B (zh) | 压缩声音模型的自适应 | |
US5627939A (en) | Speech recognition system and method employing data compression | |
US6256607B1 (en) | Method and apparatus for automatic recognition using features encoded with product-space vector quantization | |
US7254529B2 (en) | Method and apparatus for distribution-based language model adaptation | |
JP4913204B2 (ja) | 音声認識システムのための動的にコンフィギュレーション可能な音響モデル | |
CN100580771C (zh) | 用于子空间编码高斯模型的训练的方法 | |
Digalakis et al. | Quantization of cepstral parameters for speech recognition over the world wide web | |
US6064958A (en) | Pattern recognition scheme using probabilistic models based on mixtures distribution of discrete distribution | |
KR100924399B1 (ko) | 음성 인식 장치 및 음성 인식 방법 | |
US7813926B2 (en) | Training system for a speech recognition application | |
JP3696231B2 (ja) | 言語モデル生成蓄積装置、音声認識装置、言語モデル生成方法および音声認識方法 | |
CN1645478B (zh) | 用于音调语言的分段音调建模 | |
US20100131262A1 (en) | Speech Recognition Based on a Multilingual Acoustic Model | |
JPH0555040B2 (zh) | ||
Deena et al. | Recurrent neural network language model adaptation for multi-genre broadcast speech recognition and alignment | |
Padmanabhan et al. | Large-vocabulary speech recognition algorithms | |
US7454341B1 (en) | Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system | |
EP2867890B1 (en) | Meta-data inputs to front end processing for automatic speech recognition | |
EP2107554B1 (en) | Generation of multilingual codebooks for speech recognition | |
JPH10149192A (ja) | パターン認識方法、装置およびその記憶媒体 | |
Stadermann et al. | Flexible feature extraction and HMM design for a hybrid distributed speech recognition system in noisy environments | |
Rtischev et al. | Speaker adaptation via VQ prototype modification | |
Stadermann et al. | Comparison of standard and hybrid modeling techniques for distributed speech recognition | |
Gu et al. | Combined parameter training and reduction in tied-mixture HMM design | |
JPH11328400A (ja) | パターン認識方法およびパターン認識装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150504 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150504 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120411 Termination date: 20190517 |
|
CF01 | Termination of patent right due to non-payment of annual fee |