CN1534597A - 利用具有转换状态空间模型的变化推理的语音识别方法 - Google Patents

利用具有转换状态空间模型的变化推理的语音识别方法 Download PDF

Info

Publication number
CN1534597A
CN1534597A CNA2004100326977A CN200410032697A CN1534597A CN 1534597 A CN1534597 A CN 1534597A CN A2004100326977 A CNA2004100326977 A CN A2004100326977A CN 200410032697 A CN200410032697 A CN 200410032697A CN 1534597 A CN1534597 A CN 1534597A
Authority
CN
China
Prior art keywords
voice unit
probability
parameter
frame
prime
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2004100326977A
Other languages
English (en)
Chinese (zh)
Inventor
H
H·埃笛亚斯
L·J·李
邓立
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN1534597A publication Critical patent/CN1534597A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01DSEPARATION
    • B01D21/00Separation of suspended solid particles from liquids by sedimentation
    • B01D21/24Feed or discharge mechanisms for settling tanks
    • B01D21/245Discharge mechanisms for the sediments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01DSEPARATION
    • B01D21/00Separation of suspended solid particles from liquids by sedimentation
    • B01D21/24Feed or discharge mechanisms for settling tanks
    • B01D21/2433Discharge mechanisms for floating particles
    • CCHEMISTRY; METALLURGY
    • C02TREATMENT OF WATER, WASTE WATER, SEWAGE, OR SLUDGE
    • C02FTREATMENT OF WATER, WASTE WATER, SEWAGE, OR SLUDGE
    • C02F1/00Treatment of water, waste water, or sewage
    • C02F1/40Devices for separating or removing fatty or oily substances or similar floating material

Landscapes

  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Hydrology & Water Resources (AREA)
  • Environmental & Geological Engineering (AREA)
  • Water Supply & Treatment (AREA)
  • Organic Chemistry (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Complex Calculations (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Analysis (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Mobile Radio Communication Systems (AREA)
CNA2004100326977A 2003-04-01 2004-03-31 利用具有转换状态空间模型的变化推理的语音识别方法 Pending CN1534597A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/405,166 2003-04-01
US10/405,166 US6931374B2 (en) 2003-04-01 2003-04-01 Method of speech recognition using variational inference with switching state space models

Publications (1)

Publication Number Publication Date
CN1534597A true CN1534597A (zh) 2004-10-06

Family

ID=32850610

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2004100326977A Pending CN1534597A (zh) 2003-04-01 2004-03-31 利用具有转换状态空间模型的变化推理的语音识别方法

Country Status (7)

Country Link
US (2) US6931374B2 (de)
EP (1) EP1465154B1 (de)
JP (1) JP2004310098A (de)
KR (1) KR20040088368A (de)
CN (1) CN1534597A (de)
AT (1) ATE445896T1 (de)
DE (1) DE602004023555D1 (de)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102486922A (zh) * 2010-12-03 2012-06-06 株式会社理光 说话人识别方法、装置和系统
CN107680584A (zh) * 2017-09-29 2018-02-09 百度在线网络技术(北京)有限公司 用于切分音频的方法和装置

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7424423B2 (en) * 2003-04-01 2008-09-09 Microsoft Corporation Method and apparatus for formant tracking using a residual model
US6931374B2 (en) * 2003-04-01 2005-08-16 Microsoft Corporation Method of speech recognition using variational inference with switching state space models
US7277850B1 (en) * 2003-04-02 2007-10-02 At&T Corp. System and method of word graph matrix decomposition
US7643989B2 (en) * 2003-08-29 2010-01-05 Microsoft Corporation Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint
US7475011B2 (en) * 2004-08-25 2009-01-06 Microsoft Corporation Greedy algorithm for identifying values for vocal tract resonance vectors
US8938390B2 (en) 2007-01-23 2015-01-20 Lena Foundation System and method for expressive language and developmental disorder assessment
US10223934B2 (en) 2004-09-16 2019-03-05 Lena Foundation Systems and methods for expressive language, developmental disorder, and emotion assessment, and contextual feedback
US9355651B2 (en) 2004-09-16 2016-05-31 Lena Foundation System and method for expressive language, developmental disorder, and emotion assessment
US9240188B2 (en) 2004-09-16 2016-01-19 Lena Foundation System and method for expressive language, developmental disorder, and emotion assessment
US8078465B2 (en) * 2007-01-23 2011-12-13 Lena Foundation System and method for detection and analysis of speech
US7899761B2 (en) * 2005-04-25 2011-03-01 GM Global Technology Operations LLC System and method for signal prediction
US7877256B2 (en) * 2006-02-17 2011-01-25 Microsoft Corporation Time synchronous decoding for long-span hidden trajectory model
US8010356B2 (en) 2006-02-17 2011-08-30 Microsoft Corporation Parameter learning in a hidden trajectory model
US8234116B2 (en) * 2006-08-22 2012-07-31 Microsoft Corporation Calculating cost measures between HMM acoustic models
US7805308B2 (en) * 2007-01-19 2010-09-28 Microsoft Corporation Hidden trajectory modeling with differential cepstra for speech recognition
CA2676380C (en) 2007-01-23 2015-11-24 Infoture, Inc. System and method for detection and analysis of speech
US20080256613A1 (en) 2007-03-13 2008-10-16 Grover Noel J Voice print identification portal
EP2608351A1 (de) * 2011-12-20 2013-06-26 ABB Research Ltd. Handhabung von Resonanzen in einem Leistungsübertragungssystem
EP2736042A1 (de) 2012-11-23 2014-05-28 Samsung Electronics Co., Ltd Vorrichtung und Verfahren zur Erstellung eines mehrsprachigen akustischen Modells und computerlesbares Aufzeichnungsmedium für Speicherprogramm zur Ausführung des Verfahrens
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
WO2019113477A1 (en) 2017-12-07 2019-06-13 Lena Foundation Systems and methods for automatic determination of infant cry and discrimination of cry from fussiness

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5317673A (en) * 1992-06-22 1994-05-31 Sri International Method and apparatus for context-dependent estimation of multiple probability distributions of phonetic classes with multilayer perceptrons in a speech recognition system
JP3114468B2 (ja) * 1993-11-25 2000-12-04 松下電器産業株式会社 音声認識方法
US5799272A (en) * 1996-07-01 1998-08-25 Ess Technology, Inc. Switched multiple sequence excitation model for low bit rate speech compression
JPH10111862A (ja) * 1996-08-13 1998-04-28 Fujitsu Ltd 再帰型ニューラルネットワークに基づく時系列解析装置および方法
US5924066A (en) * 1997-09-26 1999-07-13 U S West, Inc. System and method for classifying a speech signal
TW413795B (en) * 1999-02-26 2000-12-01 Cyberlink Corp An image processing method of 3-D head motion with three face feature points
US6678658B1 (en) * 1999-07-09 2004-01-13 The Regents Of The University Of California Speech processing using conditional observable maximum likelihood continuity mapping
US6993462B1 (en) * 1999-09-16 2006-01-31 Hewlett-Packard Development Company, L.P. Method for motion synthesis and interpolation using switching linear dynamic system models
US6591146B1 (en) * 1999-09-16 2003-07-08 Hewlett-Packard Development Company L.C. Method for learning switching linear dynamic system models from data
JP2001126056A (ja) * 1999-10-26 2001-05-11 Mitsubishi Electric Inf Technol Center America Inc 複数の形態で動作するシステムをモデリングするための方法および多様な形態で動作する動的システムをモデリングするための装置
GB2363557A (en) * 2000-06-16 2001-12-19 At & T Lab Cambridge Ltd Method of extracting a signal from a contaminated signal
JP2002251198A (ja) * 2000-12-19 2002-09-06 Atr Onsei Gengo Tsushin Kenkyusho:Kk 音声認識システム
US6928407B2 (en) * 2002-03-29 2005-08-09 International Business Machines Corporation System and method for the automatic discovery of salient segments in speech transcripts
US6931374B2 (en) * 2003-04-01 2005-08-16 Microsoft Corporation Method of speech recognition using variational inference with switching state space models

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102486922A (zh) * 2010-12-03 2012-06-06 株式会社理光 说话人识别方法、装置和系统
CN102486922B (zh) * 2010-12-03 2014-12-03 株式会社理光 说话人识别方法、装置和系统
CN107680584A (zh) * 2017-09-29 2018-02-09 百度在线网络技术(北京)有限公司 用于切分音频的方法和装置
CN107680584B (zh) * 2017-09-29 2020-08-25 百度在线网络技术(北京)有限公司 用于切分音频的方法和装置

Also Published As

Publication number Publication date
US6931374B2 (en) 2005-08-16
US20050119887A1 (en) 2005-06-02
EP1465154B1 (de) 2009-10-14
KR20040088368A (ko) 2004-10-16
JP2004310098A (ja) 2004-11-04
US20040199386A1 (en) 2004-10-07
DE602004023555D1 (de) 2009-11-26
EP1465154A3 (de) 2007-06-06
US7487087B2 (en) 2009-02-03
ATE445896T1 (de) 2009-10-15
EP1465154A2 (de) 2004-10-06

Similar Documents

Publication Publication Date Title
CN1534597A (zh) 利用具有转换状态空间模型的变化推理的语音识别方法
CN1296886C (zh) 语音识别系统和方法
Sudhakara et al. An Improved Goodness of Pronunciation (GoP) Measure for Pronunciation Evaluation with DNN-HMM System Considering HMM Transition Probabilities.
US8280733B2 (en) Automatic speech recognition learning using categorization and selective incorporation of user-initiated corrections
CN1178202C (zh) 用于执行说话者适应或规范化的方法
US6959276B2 (en) Including the category of environmental noise when processing speech signals
JP5072206B2 (ja) 音声分類および音声認識のための隠れ条件付確率場モデル
CN1653520A (zh) 确定和降噪相关联的不确定性的方法
CN1157712C (zh) 语音识别方法和装置
CN1169116C (zh) 语音识别装置和识别方法
KR101237799B1 (ko) 문맥 종속형 음성 인식기의 환경적 변화들에 대한 강인성을 향상하는 방법
CN1622200A (zh) 多传感语音增强方法和装置
CN1129485A (zh) 信号分析装置
CN1645476A (zh) 使用切换状态空间模型的多模变分推导的语音识别方法
JP4515054B2 (ja) 音声認識の方法および音声信号を復号化する方法
CN1667700A (zh) 使用发音图表来改进新字的发音学习
CN1462366A (zh) 说话人声音的后台学习
CN1750120A (zh) 索引设备和索引方法
CN1238058A (zh) 语音处理系统
CN1534598A (zh) 采用增量贝叶斯学习进行噪声估计的方法
CN1521729A (zh) 使用隐轨迹和隐马尔可夫模型进行语音识别的方法
CN1692405A (zh) 语音处理设备、语言处理方法、存储介质及程序
US20070129946A1 (en) High quality speech reconstruction for a dialog method and system
CN112185340B (zh) 语音合成方法、语音合成装置、存储介质与电子设备
JP2009237336A (ja) 音声認識装置及び音声認識プログラム

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20041006

C20 Patent right or utility model deemed to be abandoned or is abandoned