IN2014DN10400A - - Google Patents

Download PDF

Info

Publication number
IN2014DN10400A
IN2014DN10400A IN10400DEN2014A IN2014DN10400A IN 2014DN10400 A IN2014DN10400 A IN 2014DN10400A IN 10400DEN2014 A IN10400DEN2014 A IN 10400DEN2014A IN 2014DN10400 A IN2014DN10400 A IN 2014DN10400A
Authority
IN
India
Prior art keywords
signal
analysis module
probabilities
analyzing
basis
Prior art date
Application number
Other languages
English (en)
Inventor
Bartosz ZIÓLKO
Tomasz Jadczyk
Original Assignee
Akademia Górniczo Hutnicza Im Stanislawa Staszica W Krakowie
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Akademia Górniczo Hutnicza Im Stanislawa Staszica W Krakowie filed Critical Akademia Górniczo Hutnicza Im Stanislawa Staszica W Krakowie
Publication of IN2014DN10400A publication Critical patent/IN2014DN10400A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
IN10400DEN2014 2013-05-01 2013-06-26 IN2014DN10400A (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
PL403724A PL403724A1 (pl) 2013-05-01 2013-05-01 System rozpoznawania mowy i sposób wykorzystania dynamicznych modeli i sieci Bayesa
PCT/EP2013/063330 WO2014177232A1 (fr) 2013-05-01 2013-06-26 Système de reconnaissance de la parole et procédé d'utilisation de modèles de réseau de bayes dynamique

Publications (1)

Publication Number Publication Date
IN2014DN10400A true IN2014DN10400A (fr) 2015-08-14

Family

ID=48699782

Family Applications (1)

Application Number Title Priority Date Filing Date
IN10400DEN2014 IN2014DN10400A (fr) 2013-05-01 2013-06-26

Country Status (9)

Country Link
US (1) US9552811B2 (fr)
EP (1) EP2959475B1 (fr)
JP (1) JP2016517047A (fr)
CN (1) CN104541324B (fr)
AU (1) AU2013388411A1 (fr)
CA (1) CA2875727A1 (fr)
IN (1) IN2014DN10400A (fr)
PL (2) PL403724A1 (fr)
WO (1) WO2014177232A1 (fr)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2017532082A (ja) 2014-08-22 2017-11-02 エスアールアイ インターナショナルSRI International 患者の精神状態のスピーチベース評価のためのシステム
US10706873B2 (en) * 2015-09-18 2020-07-07 Sri International Real-time speaker state analytics platform
US9792907B2 (en) 2015-11-24 2017-10-17 Intel IP Corporation Low resource key phrase detection for wake on voice
CN105654944B (zh) * 2015-12-30 2019-11-01 中国科学院自动化研究所 一种融合了短时与长时特征建模的环境声识别方法及装置
US9972313B2 (en) * 2016-03-01 2018-05-15 Intel Corporation Intermediate scoring and rejection loopback for improved key phrase detection
US10043521B2 (en) 2016-07-01 2018-08-07 Intel IP Corporation User defined key phrase detection by user dependent sequence modeling
CN106297828B (zh) * 2016-08-12 2020-03-24 苏州驰声信息科技有限公司 一种基于深度学习的误发音检测的检测方法和装置
US10083689B2 (en) * 2016-12-23 2018-09-25 Intel Corporation Linear scoring for low power wake on voice
WO2018209608A1 (fr) * 2017-05-17 2018-11-22 Beijing Didi Infinity Technology And Development Co., Ltd. Procédé et système permettant l'identification fiable d'une langue
US10902738B2 (en) * 2017-08-03 2021-01-26 Microsoft Technology Licensing, Llc Neural models for key phrase detection and question generation
CN107729381B (zh) * 2017-09-15 2020-05-08 广州嘉影软件有限公司 基于多维特征识别的交互多媒体资源聚合方法及系统
US10714122B2 (en) 2018-06-06 2020-07-14 Intel Corporation Speech classification of audio for wake on voice
US10650807B2 (en) 2018-09-18 2020-05-12 Intel Corporation Method and system of neural network keyphrase detection
US11127394B2 (en) 2019-03-29 2021-09-21 Intel Corporation Method and system of high accuracy keyphrase detection for low resource devices
CN110838306B (zh) * 2019-11-12 2022-05-13 广州视源电子科技股份有限公司 语音信号检测方法、计算机存储介质及相关设备
US20220036087A1 (en) * 2020-07-29 2022-02-03 Optima Sports Systems S.L. Computing system and a computer-implemented method for sensing events from geospatial data
CN114612810B (zh) * 2020-11-23 2023-04-07 山东大卫国际建筑设计有限公司 一种动态自适应异常姿态识别方法及装置
CN115718536B (zh) * 2023-01-09 2023-04-18 苏州浪潮智能科技有限公司 一种调频方法、装置、电子设备及可读存储介质

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6256046B1 (en) 1997-04-18 2001-07-03 Compaq Computer Corporation Method and apparatus for visual sensing of humans for active public interfaces
US6292776B1 (en) * 1999-03-12 2001-09-18 Lucent Technologies Inc. Hierarchial subband linear predictive cepstral features for HMM-based speech recognition
US6542866B1 (en) * 1999-09-22 2003-04-01 Microsoft Corporation Speech recognition method and apparatus utilizing multiple feature streams
US7346510B2 (en) * 2002-03-19 2008-03-18 Microsoft Corporation Method of speech recognition using variables representing dynamic aspects of speech
US20030212552A1 (en) * 2002-05-09 2003-11-13 Liang Lu Hong Face recognition procedure useful for audiovisual speech recognition
AU2003275134A1 (en) * 2002-09-19 2004-04-08 The Penn State Research Foundation Prosody based audio/visual co-analysis for co-verbal gesture recognition
US7203368B2 (en) 2003-01-06 2007-04-10 Intel Corporation Embedded bayesian network for pattern recognition
US7454342B2 (en) * 2003-03-19 2008-11-18 Intel Corporation Coupled hidden Markov model (CHMM) for continuous audiovisual speech recognition
US7454336B2 (en) * 2003-06-20 2008-11-18 Microsoft Corporation Variational inference and learning for segmental switching state space models of hidden speech dynamics
JP4479191B2 (ja) * 2003-08-25 2010-06-09 カシオ計算機株式会社 音声認識装置、音声認識方法及び音声認識処理プログラム
US20050228673A1 (en) * 2004-03-30 2005-10-13 Nefian Ara V Techniques for separating and evaluating audio and video source data
JP4843987B2 (ja) * 2005-04-05 2011-12-21 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム
EP2049983A2 (fr) * 2006-08-07 2009-04-22 Yeda Research And Development Co. Ltd. Importance et similarite de donnees utilisant des notes d'indication locales et mondiales
US9589380B2 (en) 2007-02-27 2017-03-07 International Business Machines Corporation Avatar-based unsolicited advertisements in a virtual universe
US8972253B2 (en) * 2010-09-15 2015-03-03 Microsoft Technology Licensing, Llc Deep belief network for large vocabulary continuous speech recognition
US9183843B2 (en) * 2011-01-07 2015-11-10 Nuance Communications, Inc. Configurable speech recognition system using multiple recognizers

Also Published As

Publication number Publication date
PL2959475T3 (pl) 2018-04-30
CN104541324A (zh) 2015-04-22
CA2875727A1 (fr) 2014-11-06
JP2016517047A (ja) 2016-06-09
EP2959475B1 (fr) 2017-02-08
CN104541324B (zh) 2019-09-13
US20160111086A1 (en) 2016-04-21
WO2014177232A1 (fr) 2014-11-06
AU2013388411A1 (en) 2015-01-22
EP2959475A1 (fr) 2015-12-30
US9552811B2 (en) 2017-01-24
PL403724A1 (pl) 2014-11-10

Similar Documents

Publication Publication Date Title
IN2014DN10400A (fr)
MX367096B (es) Discriminacion de expresiones ambiguas para mejorar la experiencia del usuario.
WO2018038385A3 (fr) Procédé de reconnaissance vocale et dispositif électronique destiné à sa mise en œuvre
WO2013009578A3 (fr) Systèmes et procédés de traitement d'instruction de paroles
WO2014140977A9 (fr) Amélioration de la reconnaissance d'entités dans les systèmes de traitement du langage naturel (nlp)
KR20180084576A (ko) 행동-인식 연결 학습 기반 의도 이해 장치, 방법 및 그 방법을 수행하기 위한 기록 매체
SG11201802373WA (en) Method and device for processing question clustering in automatic question and answering system
WO2014074698A3 (fr) Nlu/nlp distribué
EP4219071A3 (fr) Procédés et appareil pour compenser des données d'impression pour une mauvaise attention et/ou une non couverture par un propriétaire de base de données
MX2016004667A (es) Metodo y dispositivo para construir una plantilla, metodo y dispositivo para identificar informacion.
WO2015200110A3 (fr) Techniques pour une traduction automatique de langage d'un texte à partir d'une image sur la base d'informations contextuelles non textuelles de l'image
RU2014112242A (ru) Метод анализа тональности текстовых данных
GB2523973A (en) Audio analysis system and method using audio segment characterisation
WO2012100066A3 (fr) Analyse de sentiment
EP2851808A3 (fr) Processeur de langage naturel hybride
IN2014MU00919A (fr)
WO2012064408A3 (fr) Procédé pour la reconnaissance du ton/de l'intonation à l'aide d'indicateurs de l'attention d'un auditoire
MX2016005489A (es) Metodo y aparato para determinar similitud y terminal.
GB2501633A (en) A voice based system and method for data input
AU2014205024A8 (en) Methods and apparatus for identifying concepts corresponding to input information
MY185366A (en) Audio information processing method and device
WO2014105359A3 (fr) Guide d'inspection vocale
NZ700273A (en) Negative example (anti-word) based performance improvement for speech recognition
MY194297A (en) A method and device for providing search engine label
UA113173C2 (xx) Система та спосіб розпізнавання контенту програми мовлення