PL399698A1 - Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy - Google Patents

Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy

Info

Publication number
PL399698A1
PL399698A1 PL399698A PL39969812A PL399698A1 PL 399698 A1 PL399698 A1 PL 399698A1 PL 399698 A PL399698 A PL 399698A PL 39969812 A PL39969812 A PL 39969812A PL 399698 A1 PL399698 A1 PL 399698A1
Authority
PL
Poland
Prior art keywords
acoustic model
complexity
discrete acoustic
selecting
recognition system
Prior art date
Application number
PL399698A
Other languages
English (en)
Inventor
Marcin Kuropatwinski
Original Assignee
Voice Lab Spólka Z Ograniczona Odpowiedzialnoscia
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Voice Lab Spólka Z Ograniczona Odpowiedzialnoscia filed Critical Voice Lab Spólka Z Ograniczona Odpowiedzialnoscia
Priority to PL399698A priority Critical patent/PL399698A1/pl
Priority to US13/567,963 priority patent/US20140006021A1/en
Publication of PL399698A1 publication Critical patent/PL399698A1/pl

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/085Methods for reducing search complexity, pruning

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Machine Translation (AREA)

Abstract

Wynalazek dotyczy sposobu doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy, obejmujacym dyskretny model akustyczny, slownik wymowy i opcjonalnie model jezyka badz gramatyke, gdzie przy zadanej bazie danych mowy, obejmujacej wiele par, skladajacych sie z nagrania mowy zwanego przebiegiem czasowym sygnalu mowy i transkrypcji ortograficznej przebiegu czasowego, konstruuje sie modele akustyczne, poprzez: konwersje zapisu ortograficznego na fonetyczny, parametryzacje przebiegów czasowych poprzez obliczanie wektorów cech i normalizacje ciagów wektorów cech i charakteryzuje sie tym, ze zlozonosc Pl dyskretnego modelu akustycznego ustawia sie wedlug procedury, przy zalozonym wspólczynniku generalizacji N.
PL399698A 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy PL399698A1 (pl)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PL399698A PL399698A1 (pl) 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy
US13/567,963 US20140006021A1 (en) 2012-06-27 2012-08-06 Method for adjusting discrete model complexity in an automatic speech recognition system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PL399698A PL399698A1 (pl) 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy

Publications (1)

Publication Number Publication Date
PL399698A1 true PL399698A1 (pl) 2014-01-07

Family

ID=49779004

Family Applications (1)

Application Number Title Priority Date Filing Date
PL399698A PL399698A1 (pl) 2012-06-27 2012-06-27 Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy

Country Status (2)

Country Link
US (1) US20140006021A1 (pl)
PL (1) PL399698A1 (pl)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113300890B (zh) * 2021-05-24 2022-06-14 同济大学 一种网络化机器学习系统的自适应通信方法

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5535305A (en) * 1992-12-31 1996-07-09 Apple Computer, Inc. Sub-partitioned vector quantization of probability density functions
US5794197A (en) * 1994-01-21 1998-08-11 Micrsoft Corporation Senone tree representation and evaluation
JP2690027B2 (ja) * 1994-10-05 1997-12-10 株式会社エイ・ティ・アール音声翻訳通信研究所 パターン認識方法及び装置
US5806030A (en) * 1996-05-06 1998-09-08 Matsushita Electric Ind Co Ltd Low complexity, high accuracy clustering method for speech recognizer
US6895375B2 (en) * 2001-10-04 2005-05-17 At&T Corp. System for bandwidth extension of Narrow-band speech
US20040006470A1 (en) * 2002-07-03 2004-01-08 Pioneer Corporation Word-spotting apparatus, word-spotting method, and word-spotting program
US8214213B1 (en) * 2006-04-27 2012-07-03 At&T Intellectual Property Ii, L.P. Speech recognition based on pronunciation modeling
US7617103B2 (en) * 2006-08-25 2009-11-10 Microsoft Corporation Incrementally regulated discriminative margins in MCE training for speech recognition
US8423364B2 (en) * 2007-02-20 2013-04-16 Microsoft Corporation Generic framework for large-margin MCE training in speech recognition
US8200797B2 (en) * 2007-11-16 2012-06-12 Nec Laboratories America, Inc. Systems and methods for automatic profiling of network event sequences
RU2409897C1 (ru) * 2009-05-18 2011-01-20 Самсунг Электроникс Ко., Лтд Кодер, передающее устройство, система передачи и способ кодирования информационных объектов
KR20120045582A (ko) * 2010-10-29 2012-05-09 한국전자통신연구원 음향 모델 생성 장치 및 방법

Also Published As

Publication number Publication date
US20140006021A1 (en) 2014-01-02

Similar Documents

Publication Publication Date Title
EP4235647A3 (en) Determining dialog states for language models
WO2015009586A3 (en) Performing an operation relative to tabular data based upon voice input
WO2014197334A3 (en) System and method for user-specified pronunciation of words for speech synthesis and recognition
SG11201912053XA (en) Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
WO2009025356A1 (ja) 音声認識装置および音声認識方法
WO2015057907A3 (en) System and method for learning alternate pronunciations for speech recognition
GB2552623A (en) Systems and methods for automated evaluation of human speech
GB2551917A (en) Privacy-preserving training corpus selection
EP4312147A3 (en) Scalable dynamic class language modeling
MX2017001121A (es) Reconocimiento del habla en base a acustica y a dominio para vehiculos.
WO2014005142A3 (en) Modeling l1-specific phonological errors
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
PE20141910A1 (es) Sistemas y metodos para el aprendizaje de idiomas
EP4318463A3 (en) Multi-modal input on an electronic device
WO2014145960A3 (en) Method and system for generating advanced feature discrimination vectors for use in speech recognition
WO2016139670A8 (en) System and method for generating accurate speech transcription from natural speech audio signals
EP4235649A3 (en) Language model biasing
GB2486038B (en) Speech-to-text conversion
WO2007034478A3 (en) System and method for correcting speech
WO2018118492A3 (en) Linguistic modeling using sets of base phonetics
WO2020117639A3 (en) Text independent speaker recognition
MX2015014413A (es) Simulacion de respuesta al impulso acustico.
PL399698A1 (pl) Sposób doboru zlozonosci dyskretnego modelu akustycznego w systemie automatycznego rozpoznawania mowy
WO2016029045A3 (en) Lexical dialect analysis system