ATE349750T1 - Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung - Google Patents

Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung

Info

Publication number
ATE349750T1
ATE349750T1 AT03704595T AT03704595T ATE349750T1 AT E349750 T1 ATE349750 T1 AT E349750T1 AT 03704595 T AT03704595 T AT 03704595T AT 03704595 T AT03704595 T AT 03704595T AT E349750 T1 ATE349750 T1 AT E349750T1
Authority
AT
Austria
Prior art keywords
units
acoustic
neural network
subset
output level
Prior art date
Application number
AT03704595T
Other languages
English (en)
Inventor
Dario Albesano
Roberto Gemello
Original Assignee
Loquendo Spa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Loquendo Spa filed Critical Loquendo Spa
Application granted granted Critical
Publication of ATE349750T1 publication Critical patent/ATE349750T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Image Analysis (AREA)
  • Telephonic Communication Services (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
AT03704595T 2002-02-28 2003-02-12 Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung ATE349750T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IT2002TO000170A ITTO20020170A1 (it) 2002-02-28 2002-02-28 Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

Publications (1)

Publication Number Publication Date
ATE349750T1 true ATE349750T1 (de) 2007-01-15

Family

ID=27638874

Family Applications (1)

Application Number Title Priority Date Filing Date
AT03704595T ATE349750T1 (de) 2002-02-28 2003-02-12 Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung

Country Status (11)

Country Link
US (1) US7827031B2 (de)
EP (1) EP1479069B1 (de)
JP (1) JP4275537B2 (de)
AT (1) ATE349750T1 (de)
AU (1) AU2003206883A1 (de)
CA (1) CA2477525C (de)
DE (1) DE60310687T2 (de)
ES (1) ES2281622T3 (de)
IT (1) ITTO20020170A1 (de)
PT (1) PT1479069E (de)
WO (1) WO2003073416A1 (de)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7912713B2 (en) 2004-12-28 2011-03-22 Loquendo S.P.A. Automatic speech recognition system and method using weighted confidence measure
US9721563B2 (en) 2012-06-08 2017-08-01 Apple Inc. Name recognition system
EP3057493B1 (de) * 2013-10-20 2020-06-24 Massachusetts Institute Of Technology Verwendung einer korrelationsstruktur von sprachdynamik zum nachweis von neurologischen veränderungen
US9627532B2 (en) * 2014-06-18 2017-04-18 Nuance Communications, Inc. Methods and apparatus for training an artificial neural network for use in speech recognition
KR101844932B1 (ko) 2014-09-16 2018-04-03 한국전자통신연구원 신호처리 알고리즘이 통합된 심층 신경망 기반의 음성인식 장치 및 이의 학습방법
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9812128B2 (en) * 2014-10-09 2017-11-07 Google Inc. Device leadership negotiation among voice interface devices
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US9578173B2 (en) 2015-06-05 2017-02-21 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10366158B2 (en) * 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10062378B1 (en) 2017-02-24 2018-08-28 International Business Machines Corporation Sound identification utilizing periodic indications
DK201770439A1 (en) 2017-05-11 2018-12-13 Apple Inc. Offline personal assistant
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK201770432A1 (en) 2017-05-15 2018-12-21 Apple Inc. Hierarchical belief states for digital assistants
DK201770431A1 (en) 2017-05-15 2018-12-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
DK179560B1 (en) 2017-05-16 2019-02-18 Apple Inc. FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES
KR102606825B1 (ko) 2017-09-13 2023-11-27 삼성전자주식회사 뉴럴 네트워크 모델을 변형하는 뉴럴 네트워크 시스템, 이를 포함하는 어플리케이션 프로세서 및 뉴럴 네트워크 시스템의 동작방법
EP3564949A1 (de) * 2018-04-23 2019-11-06 Spotify AB Aktivierungsauslöserverarbeitung

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5822742A (en) * 1989-05-17 1998-10-13 The United States Of America As Represented By The Secretary Of Health & Human Services Dynamically stable associative learning neural network system
US5461696A (en) * 1992-10-28 1995-10-24 Motorola, Inc. Decision directed adaptive neural network
IT1270919B (it) 1993-05-05 1997-05-16 Cselt Centro Studi Lab Telecom Sistema per il riconoscimento di parole isolate indipendente dal parlatore mediante reti neurali
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
US5745649A (en) * 1994-07-07 1998-04-28 Nynex Science & Technology Corporation Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories
US5638487A (en) * 1994-12-30 1997-06-10 Purespeech, Inc. Automatic speech recognition
IT1280816B1 (it) * 1995-03-22 1998-02-11 Cselt Centro Studi Lab Telecom Metodo per velocizzare l'esecuzione di reti neurali per il trattamento di segnali correlati.
US5749066A (en) * 1995-04-24 1998-05-05 Ericsson Messaging Systems Inc. Method and apparatus for developing a neural network for phoneme recognition
US6151592A (en) * 1995-06-07 2000-11-21 Seiko Epson Corporation Recognition apparatus using neural network, and learning method therefor
US5960391A (en) * 1995-12-13 1999-09-28 Denso Corporation Signal extraction system, system and method for speech restoration, learning method for neural network model, constructing method of neural network model, and signal processing system
US6665639B2 (en) * 1996-12-06 2003-12-16 Sensory, Inc. Speech recognition in consumer electronic products
GB9802836D0 (en) * 1998-02-10 1998-04-08 Canon Kk Pattern matching method and apparatus
US6208963B1 (en) * 1998-06-24 2001-03-27 Tony R. Martinez Method and apparatus for signal classification using a multilayer network
IT1310154B1 (it) * 1999-09-02 2002-02-11 Cselt Centro Studi Lab Telecom Procedimento per realizzare un riconoscitore vocale, relativoriconoscitore e procedimento per il riconoscimento della voce

Also Published As

Publication number Publication date
DE60310687T2 (de) 2007-11-08
EP1479069B1 (de) 2006-12-27
CA2477525A1 (en) 2003-09-04
JP2005519314A (ja) 2005-06-30
PT1479069E (pt) 2007-04-30
EP1479069A1 (de) 2004-11-24
CA2477525C (en) 2012-01-24
WO2003073416A8 (en) 2005-03-24
ITTO20020170A0 (it) 2002-02-28
DE60310687D1 (de) 2007-02-08
JP4275537B2 (ja) 2009-06-10
ITTO20020170A1 (it) 2003-08-28
AU2003206883A1 (en) 2003-09-09
US7827031B2 (en) 2010-11-02
WO2003073416A1 (en) 2003-09-04
ES2281622T3 (es) 2007-10-01
US20050171766A1 (en) 2005-08-04

Similar Documents

Publication Publication Date Title
ATE349750T1 (de) Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung
CN103295575B (zh) 一种语音识别方法和客户端
DE112021001064T5 (de) Vorrichtungsgerichtete Äußerungserkennung
CN111862942B (zh) 普通话和四川话的混合语音识别模型的训练方法及系统
CN108172218A (zh) 一种语音建模方法及装置
DE69818231D1 (de) Verfahren zum diskriminativen training von spracherkennungsmodellen
EP0955628A3 (de) Verfahren und Vorrichtung zur Spracherkennung mittels sowohl eines neuralen Netzwerks als auch verborgener Markov-Modelle
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
CN101645269A (zh) 一种语种识别系统及方法
CN106653056A (zh) 基于lstm循环神经网络的基频提取模型及训练方法
CN104751227A (zh) 深度神经网络的构建方法及系统
CN113129927B (zh) 语音情绪识别方法、装置、设备及存储介质
EP1280137A1 (de) Verfahren zur Sprecheridentifikation
ATE353156T1 (de) Verfolgen von vokaltraktresonanzen unter verwendung einer zielgeführten einschränkung
ATE441918T1 (de) Sprachdialogverfahren und -system
CN112712823A (zh) 拖音的检测方法、装置、设备及存储介质
CN103226951B (zh) 基于模型顺序自适应技术的说话人确认系统创建方法
CN105023574A (zh) 一种实现合成语音增强的方法及系统
CN112309372A (zh) 基于语调的意图识别方法、装置、设备及存储介质
CN106887226A (zh) 一种基于人工智能识别的语音识别算法
Li et al. Speaker embedding extraction with multi-feature integration structure
CN110266894A (zh) 一种自动忙音检测的通话方法及系统
Lashkari et al. NMF-based cepstral features for speech emotion recognition
CN1825430A (zh) 可调适韵律的语音合成方法、装置及其对话系统
CN114155883B (zh) 基于进阶式的语音深度神经网络训练方法、装置

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties