PL2959475T3 - System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli sieci bayesa - Google Patents
System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli sieci bayesaInfo
- Publication number
- PL2959475T3 PL2959475T3 PL13731759T PL13731759T PL2959475T3 PL 2959475 T3 PL2959475 T3 PL 2959475T3 PL 13731759 T PL13731759 T PL 13731759T PL 13731759 T PL13731759 T PL 13731759T PL 2959475 T3 PL2959475 T3 PL 2959475T3
- Authority
- PL
- Poland
- Prior art keywords
- speech recognition
- recognition system
- bayesian network
- network models
- dynamic bayesian
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PL403724A PL403724A1 (pl) | 2013-05-01 | 2013-05-01 | System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli i sieci Bayesa |
| PCT/EP2013/063330 WO2014177232A1 (en) | 2013-05-01 | 2013-06-26 | A speech recognition system and a method of using dynamic bayesian network models |
| EP13731759.0A EP2959475B1 (en) | 2013-05-01 | 2013-06-26 | A speech recognition system and a method of using dynamic bayesian network models |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| PL2959475T3 true PL2959475T3 (pl) | 2018-04-30 |
Family
ID=48699782
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PL403724A PL403724A1 (pl) | 2013-05-01 | 2013-05-01 | System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli i sieci Bayesa |
| PL13731759T PL2959475T3 (pl) | 2013-05-01 | 2013-06-26 | System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli sieci bayesa |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PL403724A PL403724A1 (pl) | 2013-05-01 | 2013-05-01 | System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli i sieci Bayesa |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US9552811B2 (pl) |
| EP (1) | EP2959475B1 (pl) |
| JP (1) | JP2016517047A (pl) |
| CN (1) | CN104541324B (pl) |
| AU (1) | AU2013388411A1 (pl) |
| CA (1) | CA2875727A1 (pl) |
| IN (1) | IN2014DN10400A (pl) |
| PL (2) | PL403724A1 (pl) |
| WO (1) | WO2014177232A1 (pl) |
Families Citing this family (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2017532082A (ja) | 2014-08-22 | 2017-11-02 | エスアールアイ インターナショナルSRI International | 患者の精神状態のスピーチベース評価のためのシステム |
| US10706873B2 (en) * | 2015-09-18 | 2020-07-07 | Sri International | Real-time speaker state analytics platform |
| US9792907B2 (en) | 2015-11-24 | 2017-10-17 | Intel IP Corporation | Low resource key phrase detection for wake on voice |
| CN105654944B (zh) * | 2015-12-30 | 2019-11-01 | 中国科学院自动化研究所 | 一种融合了短时与长时特征建模的环境声识别方法及装置 |
| US9972313B2 (en) * | 2016-03-01 | 2018-05-15 | Intel Corporation | Intermediate scoring and rejection loopback for improved key phrase detection |
| US10043521B2 (en) | 2016-07-01 | 2018-08-07 | Intel IP Corporation | User defined key phrase detection by user dependent sequence modeling |
| CN106297828B (zh) * | 2016-08-12 | 2020-03-24 | 苏州驰声信息科技有限公司 | 一种基于深度学习的误发音检测的检测方法和装置 |
| US10083689B2 (en) * | 2016-12-23 | 2018-09-25 | Intel Corporation | Linear scoring for low power wake on voice |
| WO2018209608A1 (en) * | 2017-05-17 | 2018-11-22 | Beijing Didi Infinity Technology And Development Co., Ltd. | Method and system for robust language identification |
| US10902738B2 (en) * | 2017-08-03 | 2021-01-26 | Microsoft Technology Licensing, Llc | Neural models for key phrase detection and question generation |
| CN107729381B (zh) * | 2017-09-15 | 2020-05-08 | 广州嘉影软件有限公司 | 基于多维特征识别的交互多媒体资源聚合方法及系统 |
| US10714122B2 (en) | 2018-06-06 | 2020-07-14 | Intel Corporation | Speech classification of audio for wake on voice |
| US10650807B2 (en) | 2018-09-18 | 2020-05-12 | Intel Corporation | Method and system of neural network keyphrase detection |
| US11127394B2 (en) | 2019-03-29 | 2021-09-21 | Intel Corporation | Method and system of high accuracy keyphrase detection for low resource devices |
| CN110838306B (zh) * | 2019-11-12 | 2022-05-13 | 广州视源电子科技股份有限公司 | 语音信号检测方法、计算机存储介质及相关设备 |
| US11640713B2 (en) * | 2020-07-29 | 2023-05-02 | Optima Sports Systems S.L. | Computing system and a computer-implemented method for sensing gameplay events and augmentation of video feed with overlay |
| CN114612810B (zh) * | 2020-11-23 | 2023-04-07 | 山东大卫国际建筑设计有限公司 | 一种动态自适应异常姿态识别方法及装置 |
| CN115718536B (zh) * | 2023-01-09 | 2023-04-18 | 苏州浪潮智能科技有限公司 | 一种调频方法、装置、电子设备及可读存储介质 |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6256046B1 (en) | 1997-04-18 | 2001-07-03 | Compaq Computer Corporation | Method and apparatus for visual sensing of humans for active public interfaces |
| US6292776B1 (en) * | 1999-03-12 | 2001-09-18 | Lucent Technologies Inc. | Hierarchial subband linear predictive cepstral features for HMM-based speech recognition |
| US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
| US7346510B2 (en) * | 2002-03-19 | 2008-03-18 | Microsoft Corporation | Method of speech recognition using variables representing dynamic aspects of speech |
| US20030212552A1 (en) * | 2002-05-09 | 2003-11-13 | Liang Lu Hong | Face recognition procedure useful for audiovisual speech recognition |
| WO2004027685A2 (en) * | 2002-09-19 | 2004-04-01 | The Penn State Research Foundation | Prosody based audio/visual co-analysis for co-verbal gesture recognition |
| US7203368B2 (en) | 2003-01-06 | 2007-04-10 | Intel Corporation | Embedded bayesian network for pattern recognition |
| US7454342B2 (en) * | 2003-03-19 | 2008-11-18 | Intel Corporation | Coupled hidden Markov model (CHMM) for continuous audiovisual speech recognition |
| US7454336B2 (en) * | 2003-06-20 | 2008-11-18 | Microsoft Corporation | Variational inference and learning for segmental switching state space models of hidden speech dynamics |
| JP4479191B2 (ja) * | 2003-08-25 | 2010-06-09 | カシオ計算機株式会社 | 音声認識装置、音声認識方法及び音声認識処理プログラム |
| US20050228673A1 (en) * | 2004-03-30 | 2005-10-13 | Nefian Ara V | Techniques for separating and evaluating audio and video source data |
| JP4843987B2 (ja) * | 2005-04-05 | 2011-12-21 | ソニー株式会社 | 情報処理装置、情報処理方法、およびプログラム |
| US8200648B2 (en) * | 2006-08-07 | 2012-06-12 | Yeda Research & Development Co. Ltd. At The Weizmann Institute Of Science | Data similarity and importance using local and global evidence scores |
| US9589380B2 (en) | 2007-02-27 | 2017-03-07 | International Business Machines Corporation | Avatar-based unsolicited advertisements in a virtual universe |
| US8972253B2 (en) * | 2010-09-15 | 2015-03-03 | Microsoft Technology Licensing, Llc | Deep belief network for large vocabulary continuous speech recognition |
| US9183843B2 (en) * | 2011-01-07 | 2015-11-10 | Nuance Communications, Inc. | Configurable speech recognition system using multiple recognizers |
-
2013
- 2013-05-01 PL PL403724A patent/PL403724A1/pl unknown
- 2013-06-26 AU AU2013388411A patent/AU2013388411A1/en not_active Abandoned
- 2013-06-26 CN CN201380031695.3A patent/CN104541324B/zh not_active Expired - Fee Related
- 2013-06-26 CA CA2875727A patent/CA2875727A1/en not_active Abandoned
- 2013-06-26 WO PCT/EP2013/063330 patent/WO2014177232A1/en not_active Ceased
- 2013-06-26 US US14/408,964 patent/US9552811B2/en not_active Expired - Fee Related
- 2013-06-26 PL PL13731759T patent/PL2959475T3/pl unknown
- 2013-06-26 IN IN10400DEN2014 patent/IN2014DN10400A/en unknown
- 2013-06-26 JP JP2016510953A patent/JP2016517047A/ja active Pending
- 2013-06-26 EP EP13731759.0A patent/EP2959475B1/en not_active Not-in-force
Also Published As
| Publication number | Publication date |
|---|---|
| IN2014DN10400A (pl) | 2015-08-14 |
| JP2016517047A (ja) | 2016-06-09 |
| PL403724A1 (pl) | 2014-11-10 |
| CN104541324B (zh) | 2019-09-13 |
| CA2875727A1 (en) | 2014-11-06 |
| CN104541324A (zh) | 2015-04-22 |
| US9552811B2 (en) | 2017-01-24 |
| EP2959475A1 (en) | 2015-12-30 |
| WO2014177232A1 (en) | 2014-11-06 |
| AU2013388411A1 (en) | 2015-01-22 |
| EP2959475B1 (en) | 2017-02-08 |
| US20160111086A1 (en) | 2016-04-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| PL2959475T3 (pl) | System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli sieci bayesa | |
| GB2517503B (en) | A speech processing system and method | |
| TWI560697B (en) | Method for building acoustic model, speech recognition method and electronic apparatus | |
| SG11201505403SA (en) | Method and system for recognizing speech commands | |
| SG11201505405TA (en) | Method and system for automatic speech recognition | |
| GB201315142D0 (en) | Audio-Visual Dialogue System and Method | |
| EP2973414A4 (en) | System and method for generation of a room model | |
| SG11201505402RA (en) | Method and system for automatic speech recognition | |
| EP3193328A4 (en) | Method and device for performing voice recognition using grammar model | |
| GB2518512B (en) | Speech recognition system | |
| IL245320A0 (en) | System and methods for facial representation | |
| EP3349125A4 (en) | Language model generation device, language model generation method and program therefor, voice recognition device, and voice recognition method and program therefor | |
| GB201403945D0 (en) | System and method of using a signed guid | |
| EP3077928A4 (en) | Systems and methods of modeling object networks | |
| GB201402736D0 (en) | Method of training a neural network | |
| EP3079120A4 (en) | System for guiding flow of people and method for guiding flow of people | |
| EP3124432A4 (en) | Hydrogen generation system and hydrogen generation method | |
| EP3053014A4 (en) | Method of recognizing multi-gaze and apparatus therefor | |
| PT3022891T (pt) | Sistema de rede telefónica e método | |
| IL252238A0 (en) | A system and method for providing and executing a specific language complex for cloud services infrastructure | |
| EP3211637A4 (en) | Speech synthesis device and method | |
| GB201322377D0 (en) | Method and apparatus for automatic speech recognition | |
| TWI560635B (en) | System and method for rating and selecting models | |
| GB201412239D0 (en) | Method and apparatus to provide surroundings awareness using sound recognition | |
| SG11201510254VA (en) | Method and system for human motion recognition |