ATE445896T1 - Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt - Google Patents

Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt

Info

Publication number
ATE445896T1
ATE445896T1 AT04007985T AT04007985T ATE445896T1 AT E445896 T1 ATE445896 T1 AT E445896T1 AT 04007985 T AT04007985 T AT 04007985T AT 04007985 T AT04007985 T AT 04007985T AT E445896 T1 ATE445896 T1 AT E445896T1
Authority
AT
Austria
Prior art keywords
state space
speech recognition
sequence
recognition method
changing state
Prior art date
Application number
AT04007985T
Other languages
English (en)
Inventor
Hagai Attias
Leo Jingyu Lee
Li Deng
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE445896T1 publication Critical patent/ATE445896T1/de

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01DSEPARATION
    • B01D21/00Separation of suspended solid particles from liquids by sedimentation
    • B01D21/24Feed or discharge mechanisms for settling tanks
    • B01D21/245Discharge mechanisms for the sediments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01DSEPARATION
    • B01D21/00Separation of suspended solid particles from liquids by sedimentation
    • B01D21/24Feed or discharge mechanisms for settling tanks
    • B01D21/2433Discharge mechanisms for floating particles
    • CCHEMISTRY; METALLURGY
    • C02TREATMENT OF WATER, WASTE WATER, SEWAGE, OR SLUDGE
    • C02FTREATMENT OF WATER, WASTE WATER, SEWAGE, OR SLUDGE
    • C02F1/00Treatment of water, waste water, or sewage
    • C02F1/40Devices for separating or removing fatty or oily substances or similar floating material

Landscapes

  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Hydrology & Water Resources (AREA)
  • Environmental & Geological Engineering (AREA)
  • Water Supply & Treatment (AREA)
  • Organic Chemistry (AREA)
  • Complex Calculations (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Analysis (AREA)
AT04007985T 2003-04-01 2004-04-01 Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt ATE445896T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/405,166 US6931374B2 (en) 2003-04-01 2003-04-01 Method of speech recognition using variational inference with switching state space models

Publications (1)

Publication Number Publication Date
ATE445896T1 true ATE445896T1 (de) 2009-10-15

Family

ID=32850610

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04007985T ATE445896T1 (de) 2003-04-01 2004-04-01 Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt

Country Status (7)

Country Link
US (2) US6931374B2 (de)
EP (1) EP1465154B1 (de)
JP (1) JP2004310098A (de)
KR (1) KR20040088368A (de)
CN (1) CN1534597A (de)
AT (1) ATE445896T1 (de)
DE (1) DE602004023555D1 (de)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6931374B2 (en) * 2003-04-01 2005-08-16 Microsoft Corporation Method of speech recognition using variational inference with switching state space models
US7424423B2 (en) * 2003-04-01 2008-09-09 Microsoft Corporation Method and apparatus for formant tracking using a residual model
US7277850B1 (en) 2003-04-02 2007-10-02 At&T Corp. System and method of word graph matrix decomposition
US7643989B2 (en) * 2003-08-29 2010-01-05 Microsoft Corporation Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint
US7475011B2 (en) * 2004-08-25 2009-01-06 Microsoft Corporation Greedy algorithm for identifying values for vocal tract resonance vectors
US8938390B2 (en) 2007-01-23 2015-01-20 Lena Foundation System and method for expressive language and developmental disorder assessment
US8078465B2 (en) * 2007-01-23 2011-12-13 Lena Foundation System and method for detection and analysis of speech
US10223934B2 (en) 2004-09-16 2019-03-05 Lena Foundation Systems and methods for expressive language, developmental disorder, and emotion assessment, and contextual feedback
US9240188B2 (en) 2004-09-16 2016-01-19 Lena Foundation System and method for expressive language, developmental disorder, and emotion assessment
US9355651B2 (en) 2004-09-16 2016-05-31 Lena Foundation System and method for expressive language, developmental disorder, and emotion assessment
US7899761B2 (en) * 2005-04-25 2011-03-01 GM Global Technology Operations LLC System and method for signal prediction
US8010356B2 (en) * 2006-02-17 2011-08-30 Microsoft Corporation Parameter learning in a hidden trajectory model
US7877256B2 (en) * 2006-02-17 2011-01-25 Microsoft Corporation Time synchronous decoding for long-span hidden trajectory model
US8234116B2 (en) * 2006-08-22 2012-07-31 Microsoft Corporation Calculating cost measures between HMM acoustic models
US7805308B2 (en) * 2007-01-19 2010-09-28 Microsoft Corporation Hidden trajectory modeling with differential cepstra for speech recognition
EP2126901B1 (de) 2007-01-23 2015-07-01 Infoture, Inc. System zur sprachanalyse
US20080256613A1 (en) 2007-03-13 2008-10-16 Grover Noel J Voice print identification portal
CN102486922B (zh) * 2010-12-03 2014-12-03 株式会社理光 说话人识别方法、装置和系统
EP2608351A1 (de) * 2011-12-20 2013-06-26 ABB Research Ltd. Handhabung von Resonanzen in einem Leistungsübertragungssystem
EP2736042A1 (de) 2012-11-23 2014-05-28 Samsung Electronics Co., Ltd Vorrichtung und Verfahren zur Erstellung eines mehrsprachigen akustischen Modells und computerlesbares Aufzeichnungsmedium für Speicherprogramm zur Ausführung des Verfahrens
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
CN107680584B (zh) * 2017-09-29 2020-08-25 百度在线网络技术(北京)有限公司 用于切分音频的方法和装置
US10529357B2 (en) 2017-12-07 2020-01-07 Lena Foundation Systems and methods for automatic determination of infant cry and discrimination of cry from fussiness

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5317673A (en) * 1992-06-22 1994-05-31 Sri International Method and apparatus for context-dependent estimation of multiple probability distributions of phonetic classes with multilayer perceptrons in a speech recognition system
JP3114468B2 (ja) * 1993-11-25 2000-12-04 松下電器産業株式会社 音声認識方法
US5799272A (en) * 1996-07-01 1998-08-25 Ess Technology, Inc. Switched multiple sequence excitation model for low bit rate speech compression
JPH10111862A (ja) * 1996-08-13 1998-04-28 Fujitsu Ltd 再帰型ニューラルネットワークに基づく時系列解析装置および方法
US5924066A (en) * 1997-09-26 1999-07-13 U S West, Inc. System and method for classifying a speech signal
TW413795B (en) * 1999-02-26 2000-12-01 Cyberlink Corp An image processing method of 3-D head motion with three face feature points
US6678658B1 (en) * 1999-07-09 2004-01-13 The Regents Of The University Of California Speech processing using conditional observable maximum likelihood continuity mapping
US6591146B1 (en) * 1999-09-16 2003-07-08 Hewlett-Packard Development Company L.C. Method for learning switching linear dynamic system models from data
US6993462B1 (en) * 1999-09-16 2006-01-31 Hewlett-Packard Development Company, L.P. Method for motion synthesis and interpolation using switching linear dynamic system models
JP2001126056A (ja) * 1999-10-26 2001-05-11 Mitsubishi Electric Inf Technol Center America Inc 複数の形態で動作するシステムをモデリングするための方法および多様な形態で動作する動的システムをモデリングするための装置
GB2363557A (en) * 2000-06-16 2001-12-19 At & T Lab Cambridge Ltd Method of extracting a signal from a contaminated signal
JP2002251198A (ja) * 2000-12-19 2002-09-06 Atr Onsei Gengo Tsushin Kenkyusho:Kk 音声認識システム
US6928407B2 (en) * 2002-03-29 2005-08-09 International Business Machines Corporation System and method for the automatic discovery of salient segments in speech transcripts
US6931374B2 (en) * 2003-04-01 2005-08-16 Microsoft Corporation Method of speech recognition using variational inference with switching state space models

Also Published As

Publication number Publication date
EP1465154B1 (de) 2009-10-14
EP1465154A3 (de) 2007-06-06
US20050119887A1 (en) 2005-06-02
KR20040088368A (ko) 2004-10-16
US6931374B2 (en) 2005-08-16
JP2004310098A (ja) 2004-11-04
DE602004023555D1 (de) 2009-11-26
CN1534597A (zh) 2004-10-06
US7487087B2 (en) 2009-02-03
EP1465154A2 (de) 2004-10-06
US20040199386A1 (en) 2004-10-07

Similar Documents

Publication Publication Date Title
ATE445896T1 (de) Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt
CN101069230B (zh) 预测通信系统中使用的文本信息的音调模式信息
CA2270326C (en) A method of and a device for speech recognition employing neural network and markov model recognition techniques
US20180137109A1 (en) Methodology for automatic multilingual speech recognition
EP1748421A3 (de) Spracheingabeverarbeitung mit einer emotions-basierten Modell Antwort Generation
WO2004090866A3 (en) Phonetically based speech recognition system and method
DE50209455D1 (de) Verfahren zum Training oder zur Adaption eines Spracherkenners
KR102363324B1 (ko) 멜-스펙트로그램의 무음 부분을 결정하는 방법 및 음성 합성 시스템
TW201011735A (en) Method and system for generating dialogue managers with diversified dialogue acts
ATE394773T1 (de) Verfahren zur spracherkennung mit zeitabhängiger interpolation und verborgenen dynamischen wertklassen
EP1378885A3 (de) Worterkennungsvorrichtung, -methode und -programm
CN108536668A (zh) 唤醒词评估方法及装置、存储介质、电子设备
CN105206264B (zh) 语音合成方法和装置
Flores et al. Performance comparison of natural language understanding engines in the educational domain
Young Hey Cyba: The inner workings of a virtual personal assistant
KR100321463B1 (ko) 음성 인식 시스템과 연관된 확률에 불이익을 선택적으로지정하는 방법
JP2002221989A5 (de)
WO2007095413A2 (en) Method and apparatus for detecting affects in speech
Fuchs et al. Learning an Artificial F0-Contour for ALT Speech.
KR102463570B1 (ko) 무음 구간 검출을 통한 멜 스펙트로그램의 배치 구성 방법 및 음성 합성 시스템
Choi et al. Short-utterance embedding enhancement method based on time series forecasting technique for text-independent speaker verification
Bouziane et al. Towards an objective comparison of feature extraction techniques for automatic speaker recognition systems
Martinez et al. Emotion recognition in non-structured utterances for human-robot interaction
KR20220072593A (ko) 무음 멜-스펙트로그램을 이용하여 음성 데이터를 생성하는 방법 및 음성 합성 시스템
JP6287754B2 (ja) 応答生成装置、応答生成方法及び応答生成プログラム

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties