PL399698A1 - Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy - Google Patents
Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowyInfo
- Publication number
- PL399698A1 PL399698A1 PL399698A PL39969812A PL399698A1 PL 399698 A1 PL399698 A1 PL 399698A1 PL 399698 A PL399698 A PL 399698A PL 39969812 A PL39969812 A PL 39969812A PL 399698 A1 PL399698 A1 PL 399698A1
- Authority
- PL
- Poland
- Prior art keywords
- acoustic model
- complexity
- discrete acoustic
- selecting
- recognition system
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 3
- 239000013598 vector Substances 0.000 abstract 2
- 238000010606 normalization Methods 0.000 abstract 1
- 238000013518 transcription Methods 0.000 abstract 1
- 230000035897 transcription Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Machine Translation (AREA)
Abstract
Wynalazek dotyczy sposobu doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy, obejmujacym dyskretny model akustyczny, slownik wymowy i opcjonalnie model jezyka badz gramatyke, gdzie przy zadanej bazie danych mowy, obejmujacej wiele par, skladajacych sie z nagrania mowy zwanego przebiegiem czasowym sygnalu mowy i transkrypcji ortograficznej przebiegu czasowego, konstruuje sie modele akustyczne, poprzez: konwersje zapisu ortograficznego na fonetyczny, parametryzacje przebiegów czasowych poprzez obliczanie wektorów cech i normalizacje ciagów wektorów cech i charakteryzuje sie tym, ze zlozonosc Pl dyskretnego modelu akustycznego ustawia sie wedlug procedury, przy zalozonym wspólczynniku generalizacji N.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PL399698A PL399698A1 (pl) | 2012-06-27 | 2012-06-27 | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy |
US13/567,963 US20140006021A1 (en) | 2012-06-27 | 2012-08-06 | Method for adjusting discrete model complexity in an automatic speech recognition system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PL399698A PL399698A1 (pl) | 2012-06-27 | 2012-06-27 | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy |
Publications (1)
Publication Number | Publication Date |
---|---|
PL399698A1 true PL399698A1 (pl) | 2014-01-07 |
Family
ID=49779004
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PL399698A PL399698A1 (pl) | 2012-06-27 | 2012-06-27 | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy |
Country Status (2)
Country | Link |
---|---|
US (1) | US20140006021A1 (pl) |
PL (1) | PL399698A1 (pl) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113300890B (zh) * | 2021-05-24 | 2022-06-14 | 同济大学 | 一种网络化机器学习系统的自适应通信方法 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5535305A (en) * | 1992-12-31 | 1996-07-09 | Apple Computer, Inc. | Sub-partitioned vector quantization of probability density functions |
US5794197A (en) * | 1994-01-21 | 1998-08-11 | Micrsoft Corporation | Senone tree representation and evaluation |
JP2690027B2 (ja) * | 1994-10-05 | 1997-12-10 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | パターン認識方法及び装置 |
US5806030A (en) * | 1996-05-06 | 1998-09-08 | Matsushita Electric Ind Co Ltd | Low complexity, high accuracy clustering method for speech recognizer |
US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
US20040006470A1 (en) * | 2002-07-03 | 2004-01-08 | Pioneer Corporation | Word-spotting apparatus, word-spotting method, and word-spotting program |
US8214213B1 (en) * | 2006-04-27 | 2012-07-03 | At&T Intellectual Property Ii, L.P. | Speech recognition based on pronunciation modeling |
US7617103B2 (en) * | 2006-08-25 | 2009-11-10 | Microsoft Corporation | Incrementally regulated discriminative margins in MCE training for speech recognition |
US8423364B2 (en) * | 2007-02-20 | 2013-04-16 | Microsoft Corporation | Generic framework for large-margin MCE training in speech recognition |
US8200797B2 (en) * | 2007-11-16 | 2012-06-12 | Nec Laboratories America, Inc. | Systems and methods for automatic profiling of network event sequences |
RU2409897C1 (ru) * | 2009-05-18 | 2011-01-20 | Самсунг Электроникс Ко., Лтд | Кодер, передающее устройство, система передачи и способ кодирования информационных объектов |
KR20120045582A (ko) * | 2010-10-29 | 2012-05-09 | 한국전자통신연구원 | 음향 모델 생성 장치 및 방법 |
-
2012
- 2012-06-27 PL PL399698A patent/PL399698A1/pl unknown
- 2012-08-06 US US13/567,963 patent/US20140006021A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
US20140006021A1 (en) | 2014-01-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP4235647A3 (en) | Determining dialog states for language models | |
WO2015009586A3 (en) | Performing an operation relative to tabular data based upon voice input | |
WO2014197334A3 (en) | System and method for user-specified pronunciation of words for speech synthesis and recognition | |
SG11201912053XA (en) | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface | |
WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
WO2015057907A3 (en) | System and method for learning alternate pronunciations for speech recognition | |
GB2552623A (en) | Systems and methods for automated evaluation of human speech | |
GB2551917A (en) | Privacy-preserving training corpus selection | |
EP4312147A3 (en) | Scalable dynamic class language modeling | |
MX2017001121A (es) | Reconocimiento del habla en base a acustica y a dominio para vehiculos. | |
WO2014005142A3 (en) | Modeling l1-specific phonological errors | |
ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
PE20141910A1 (es) | Sistemas y metodos para el aprendizaje de idiomas | |
EP4318463A3 (en) | Multi-modal input on an electronic device | |
WO2014145960A3 (en) | Method and system for generating advanced feature discrimination vectors for use in speech recognition | |
WO2016139670A8 (en) | System and method for generating accurate speech transcription from natural speech audio signals | |
EP4235649A3 (en) | Language model biasing | |
GB2486038B (en) | Speech-to-text conversion | |
WO2007034478A3 (en) | System and method for correcting speech | |
WO2018118492A3 (en) | Linguistic modeling using sets of base phonetics | |
WO2020117639A3 (en) | Text independent speaker recognition | |
MX2015014413A (es) | Simulacion de respuesta al impulso acustico. | |
PL399698A1 (pl) | Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy | |
WO2016029045A3 (en) | Lexical dialect analysis system |