ATE445896T1 - Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt - Google Patents
Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuztInfo
- Publication number
- ATE445896T1 ATE445896T1 AT04007985T AT04007985T ATE445896T1 AT E445896 T1 ATE445896 T1 AT E445896T1 AT 04007985 T AT04007985 T AT 04007985T AT 04007985 T AT04007985 T AT 04007985T AT E445896 T1 ATE445896 T1 AT E445896T1
- Authority
- AT
- Austria
- Prior art keywords
- state space
- speech recognition
- sequence
- recognition method
- changing state
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 239000000203 mixture Substances 0.000 abstract 1
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01D—SEPARATION
- B01D21/00—Separation of suspended solid particles from liquids by sedimentation
- B01D21/24—Feed or discharge mechanisms for settling tanks
- B01D21/245—Discharge mechanisms for the sediments
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01D—SEPARATION
- B01D21/00—Separation of suspended solid particles from liquids by sedimentation
- B01D21/24—Feed or discharge mechanisms for settling tanks
- B01D21/2433—Discharge mechanisms for floating particles
-
- C—CHEMISTRY; METALLURGY
- C02—TREATMENT OF WATER, WASTE WATER, SEWAGE, OR SLUDGE
- C02F—TREATMENT OF WATER, WASTE WATER, SEWAGE, OR SLUDGE
- C02F1/00—Treatment of water, waste water, or sewage
- C02F1/40—Devices for separating or removing fatty or oily substances or similar floating material
Landscapes
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Analytical Chemistry (AREA)
- Hydrology & Water Resources (AREA)
- Environmental & Geological Engineering (AREA)
- Water Supply & Treatment (AREA)
- Organic Chemistry (AREA)
- Complex Calculations (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Mobile Radio Communication Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/405,166 US6931374B2 (en) | 2003-04-01 | 2003-04-01 | Method of speech recognition using variational inference with switching state space models |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE445896T1 true ATE445896T1 (de) | 2009-10-15 |
Family
ID=32850610
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT04007985T ATE445896T1 (de) | 2003-04-01 | 2004-04-01 | Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt |
Country Status (7)
| Country | Link |
|---|---|
| US (2) | US6931374B2 (de) |
| EP (1) | EP1465154B1 (de) |
| JP (1) | JP2004310098A (de) |
| KR (1) | KR20040088368A (de) |
| CN (1) | CN1534597A (de) |
| AT (1) | ATE445896T1 (de) |
| DE (1) | DE602004023555D1 (de) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6931374B2 (en) * | 2003-04-01 | 2005-08-16 | Microsoft Corporation | Method of speech recognition using variational inference with switching state space models |
| US7424423B2 (en) * | 2003-04-01 | 2008-09-09 | Microsoft Corporation | Method and apparatus for formant tracking using a residual model |
| US7277850B1 (en) | 2003-04-02 | 2007-10-02 | At&T Corp. | System and method of word graph matrix decomposition |
| US7643989B2 (en) * | 2003-08-29 | 2010-01-05 | Microsoft Corporation | Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint |
| US7475011B2 (en) * | 2004-08-25 | 2009-01-06 | Microsoft Corporation | Greedy algorithm for identifying values for vocal tract resonance vectors |
| US8938390B2 (en) | 2007-01-23 | 2015-01-20 | Lena Foundation | System and method for expressive language and developmental disorder assessment |
| US8078465B2 (en) * | 2007-01-23 | 2011-12-13 | Lena Foundation | System and method for detection and analysis of speech |
| US10223934B2 (en) | 2004-09-16 | 2019-03-05 | Lena Foundation | Systems and methods for expressive language, developmental disorder, and emotion assessment, and contextual feedback |
| US9240188B2 (en) | 2004-09-16 | 2016-01-19 | Lena Foundation | System and method for expressive language, developmental disorder, and emotion assessment |
| US9355651B2 (en) | 2004-09-16 | 2016-05-31 | Lena Foundation | System and method for expressive language, developmental disorder, and emotion assessment |
| US7899761B2 (en) * | 2005-04-25 | 2011-03-01 | GM Global Technology Operations LLC | System and method for signal prediction |
| US8010356B2 (en) * | 2006-02-17 | 2011-08-30 | Microsoft Corporation | Parameter learning in a hidden trajectory model |
| US7877256B2 (en) * | 2006-02-17 | 2011-01-25 | Microsoft Corporation | Time synchronous decoding for long-span hidden trajectory model |
| US8234116B2 (en) * | 2006-08-22 | 2012-07-31 | Microsoft Corporation | Calculating cost measures between HMM acoustic models |
| US7805308B2 (en) * | 2007-01-19 | 2010-09-28 | Microsoft Corporation | Hidden trajectory modeling with differential cepstra for speech recognition |
| EP2126901B1 (de) | 2007-01-23 | 2015-07-01 | Infoture, Inc. | System zur sprachanalyse |
| US20080256613A1 (en) | 2007-03-13 | 2008-10-16 | Grover Noel J | Voice print identification portal |
| CN102486922B (zh) * | 2010-12-03 | 2014-12-03 | 株式会社理光 | 说话人识别方法、装置和系统 |
| EP2608351A1 (de) * | 2011-12-20 | 2013-06-26 | ABB Research Ltd. | Handhabung von Resonanzen in einem Leistungsübertragungssystem |
| EP2736042A1 (de) | 2012-11-23 | 2014-05-28 | Samsung Electronics Co., Ltd | Vorrichtung und Verfahren zur Erstellung eines mehrsprachigen akustischen Modells und computerlesbares Aufzeichnungsmedium für Speicherprogramm zur Ausführung des Verfahrens |
| US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
| CN107680584B (zh) * | 2017-09-29 | 2020-08-25 | 百度在线网络技术(北京)有限公司 | 用于切分音频的方法和装置 |
| US10529357B2 (en) | 2017-12-07 | 2020-01-07 | Lena Foundation | Systems and methods for automatic determination of infant cry and discrimination of cry from fussiness |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5317673A (en) * | 1992-06-22 | 1994-05-31 | Sri International | Method and apparatus for context-dependent estimation of multiple probability distributions of phonetic classes with multilayer perceptrons in a speech recognition system |
| JP3114468B2 (ja) * | 1993-11-25 | 2000-12-04 | 松下電器産業株式会社 | 音声認識方法 |
| US5799272A (en) * | 1996-07-01 | 1998-08-25 | Ess Technology, Inc. | Switched multiple sequence excitation model for low bit rate speech compression |
| JPH10111862A (ja) * | 1996-08-13 | 1998-04-28 | Fujitsu Ltd | 再帰型ニューラルネットワークに基づく時系列解析装置および方法 |
| US5924066A (en) * | 1997-09-26 | 1999-07-13 | U S West, Inc. | System and method for classifying a speech signal |
| TW413795B (en) * | 1999-02-26 | 2000-12-01 | Cyberlink Corp | An image processing method of 3-D head motion with three face feature points |
| US6678658B1 (en) * | 1999-07-09 | 2004-01-13 | The Regents Of The University Of California | Speech processing using conditional observable maximum likelihood continuity mapping |
| US6591146B1 (en) * | 1999-09-16 | 2003-07-08 | Hewlett-Packard Development Company L.C. | Method for learning switching linear dynamic system models from data |
| US6993462B1 (en) * | 1999-09-16 | 2006-01-31 | Hewlett-Packard Development Company, L.P. | Method for motion synthesis and interpolation using switching linear dynamic system models |
| JP2001126056A (ja) * | 1999-10-26 | 2001-05-11 | Mitsubishi Electric Inf Technol Center America Inc | 複数の形態で動作するシステムをモデリングするための方法および多様な形態で動作する動的システムをモデリングするための装置 |
| GB2363557A (en) * | 2000-06-16 | 2001-12-19 | At & T Lab Cambridge Ltd | Method of extracting a signal from a contaminated signal |
| JP2002251198A (ja) * | 2000-12-19 | 2002-09-06 | Atr Onsei Gengo Tsushin Kenkyusho:Kk | 音声認識システム |
| US6928407B2 (en) * | 2002-03-29 | 2005-08-09 | International Business Machines Corporation | System and method for the automatic discovery of salient segments in speech transcripts |
| US6931374B2 (en) * | 2003-04-01 | 2005-08-16 | Microsoft Corporation | Method of speech recognition using variational inference with switching state space models |
-
2003
- 2003-04-01 US US10/405,166 patent/US6931374B2/en not_active Expired - Fee Related
-
2004
- 2004-03-31 CN CNA2004100326977A patent/CN1534597A/zh active Pending
- 2004-03-31 KR KR1020040022168A patent/KR20040088368A/ko not_active Ceased
- 2004-04-01 EP EP04007985A patent/EP1465154B1/de not_active Expired - Lifetime
- 2004-04-01 AT AT04007985T patent/ATE445896T1/de not_active IP Right Cessation
- 2004-04-01 DE DE602004023555T patent/DE602004023555D1/de not_active Expired - Lifetime
- 2004-04-01 JP JP2004109419A patent/JP2004310098A/ja active Pending
- 2004-11-09 US US10/984,609 patent/US7487087B2/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| EP1465154B1 (de) | 2009-10-14 |
| EP1465154A3 (de) | 2007-06-06 |
| US20050119887A1 (en) | 2005-06-02 |
| KR20040088368A (ko) | 2004-10-16 |
| US6931374B2 (en) | 2005-08-16 |
| JP2004310098A (ja) | 2004-11-04 |
| DE602004023555D1 (de) | 2009-11-26 |
| CN1534597A (zh) | 2004-10-06 |
| US7487087B2 (en) | 2009-02-03 |
| EP1465154A2 (de) | 2004-10-06 |
| US20040199386A1 (en) | 2004-10-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE445896T1 (de) | Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt | |
| CN101069230B (zh) | 预测通信系统中使用的文本信息的音调模式信息 | |
| CA2270326C (en) | A method of and a device for speech recognition employing neural network and markov model recognition techniques | |
| US20180137109A1 (en) | Methodology for automatic multilingual speech recognition | |
| EP1748421A3 (de) | Spracheingabeverarbeitung mit einer emotions-basierten Modell Antwort Generation | |
| WO2004090866A3 (en) | Phonetically based speech recognition system and method | |
| DE50209455D1 (de) | Verfahren zum Training oder zur Adaption eines Spracherkenners | |
| KR102363324B1 (ko) | 멜-스펙트로그램의 무음 부분을 결정하는 방법 및 음성 합성 시스템 | |
| TW201011735A (en) | Method and system for generating dialogue managers with diversified dialogue acts | |
| ATE394773T1 (de) | Verfahren zur spracherkennung mit zeitabhängiger interpolation und verborgenen dynamischen wertklassen | |
| EP1378885A3 (de) | Worterkennungsvorrichtung, -methode und -programm | |
| CN108536668A (zh) | 唤醒词评估方法及装置、存储介质、电子设备 | |
| CN105206264B (zh) | 语音合成方法和装置 | |
| Flores et al. | Performance comparison of natural language understanding engines in the educational domain | |
| Young | Hey Cyba: The inner workings of a virtual personal assistant | |
| KR100321463B1 (ko) | 음성 인식 시스템과 연관된 확률에 불이익을 선택적으로지정하는 방법 | |
| JP2002221989A5 (de) | ||
| WO2007095413A2 (en) | Method and apparatus for detecting affects in speech | |
| Fuchs et al. | Learning an Artificial F0-Contour for ALT Speech. | |
| KR102463570B1 (ko) | 무음 구간 검출을 통한 멜 스펙트로그램의 배치 구성 방법 및 음성 합성 시스템 | |
| Choi et al. | Short-utterance embedding enhancement method based on time series forecasting technique for text-independent speaker verification | |
| Bouziane et al. | Towards an objective comparison of feature extraction techniques for automatic speaker recognition systems | |
| Martinez et al. | Emotion recognition in non-structured utterances for human-robot interaction | |
| KR20220072593A (ko) | 무음 멜-스펙트로그램을 이용하여 음성 데이터를 생성하는 방법 및 음성 합성 시스템 | |
| JP6287754B2 (ja) | 応答生成装置、応答生成方法及び応答生成プログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |