ITTO20020170A1 - Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale. - Google Patents

Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

Info

Publication number
ITTO20020170A1
ITTO20020170A1 IT2002TO000170A ITTO20020170A ITTO20020170A1 IT TO20020170 A1 ITTO20020170 A1 IT TO20020170A1 IT 2002TO000170 A IT2002TO000170 A IT 2002TO000170A IT TO20020170 A ITTO20020170 A IT TO20020170A IT TO20020170 A1 ITTO20020170 A1 IT TO20020170A1
Authority
IT
Italy
Prior art keywords
voice recognition
units
acoustic
neural network
subset
Prior art date
Application number
IT2002TO000170A
Other languages
English (en)
Inventor
Dario Albesano
Roberto Gemello
Original Assignee
Loquendo Spa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Loquendo Spa filed Critical Loquendo Spa
Priority to IT2002TO000170A priority Critical patent/ITTO20020170A1/it
Publication of ITTO20020170A0 publication Critical patent/ITTO20020170A0/it
Priority to AT03704595T priority patent/ATE349750T1/de
Priority to JP2003572026A priority patent/JP4275537B2/ja
Priority to US10/504,491 priority patent/US7827031B2/en
Priority to ES03704595T priority patent/ES2281622T3/es
Priority to DE60310687T priority patent/DE60310687T2/de
Priority to PT03704595T priority patent/PT1479069E/pt
Priority to EP03704595A priority patent/EP1479069B1/en
Priority to AU2003206883A priority patent/AU2003206883A1/en
Priority to PCT/EP2003/001361 priority patent/WO2003073416A1/en
Priority to CA2477525A priority patent/CA2477525C/en
Publication of ITTO20020170A1 publication Critical patent/ITTO20020170A1/it

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Image Analysis (AREA)
  • Telephonic Communication Services (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
IT2002TO000170A 2002-02-28 2002-02-28 Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale. ITTO20020170A1 (it)

Priority Applications (11)

Application Number Priority Date Filing Date Title
IT2002TO000170A ITTO20020170A1 (it) 2002-02-28 2002-02-28 Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.
CA2477525A CA2477525C (en) 2002-02-28 2003-02-12 Method for accelerating the execution of speech recognition neural networks and the related speech recognition device
ES03704595T ES2281622T3 (es) 2002-02-28 2003-02-12 Metodo para acelerar la ejecucion de redes neuronales de reconocimiento de la voz y el mecanismo de la voz relacionado.
JP2003572026A JP4275537B2 (ja) 2002-02-28 2003-02-12 音声認識ニューラルネットワークの実行を加速するための方法及び関連の音声認識装置
US10/504,491 US7827031B2 (en) 2002-02-28 2003-02-12 Method for accelerating the execution of speech recognition neural networks and the related speech recognition device
AT03704595T ATE349750T1 (de) 2002-02-28 2003-02-12 Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung
DE60310687T DE60310687T2 (de) 2002-02-28 2003-02-12 Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung
PT03704595T PT1479069E (pt) 2002-02-28 2003-02-12 Processo para acelerar a execução de redes neurais de reconhecimento da fala e o dispositivo de reconhecimento da fala associado
EP03704595A EP1479069B1 (en) 2002-02-28 2003-02-12 Method for accelerating the execution of speech recognition neural networks and the related speech recognition device
AU2003206883A AU2003206883A1 (en) 2002-02-28 2003-02-12 Method for accelerating the execution of speech recognition neural networks and the related speech recognition device
PCT/EP2003/001361 WO2003073416A1 (en) 2002-02-28 2003-02-12 Method for accelerating the execution of speech recognition neural networks and the related speech recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IT2002TO000170A ITTO20020170A1 (it) 2002-02-28 2002-02-28 Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

Publications (2)

Publication Number Publication Date
ITTO20020170A0 ITTO20020170A0 (it) 2002-02-28
ITTO20020170A1 true ITTO20020170A1 (it) 2003-08-28

Family

ID=27638874

Family Applications (1)

Application Number Title Priority Date Filing Date
IT2002TO000170A ITTO20020170A1 (it) 2002-02-28 2002-02-28 Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.

Country Status (11)

Country Link
US (1) US7827031B2 (it)
EP (1) EP1479069B1 (it)
JP (1) JP4275537B2 (it)
AT (1) ATE349750T1 (it)
AU (1) AU2003206883A1 (it)
CA (1) CA2477525C (it)
DE (1) DE60310687T2 (it)
ES (1) ES2281622T3 (it)
IT (1) ITTO20020170A1 (it)
PT (1) PT1479069E (it)
WO (1) WO2003073416A1 (it)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1831870B1 (en) 2004-12-28 2008-07-30 Loquendo S.p.A. Automatic speech recognition system and method
US9721563B2 (en) 2012-06-08 2017-08-01 Apple Inc. Name recognition system
US10561361B2 (en) * 2013-10-20 2020-02-18 Massachusetts Institute Of Technology Using correlation structure of speech dynamics to detect neurological changes
US9627532B2 (en) * 2014-06-18 2017-04-18 Nuance Communications, Inc. Methods and apparatus for training an artificial neural network for use in speech recognition
KR101844932B1 (ko) 2014-09-16 2018-04-03 한국전자통신연구원 신호처리 알고리즘이 통합된 심층 신경망 기반의 음성인식 장치 및 이의 학습방법
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9812128B2 (en) * 2014-10-09 2017-11-07 Google Inc. Device leadership negotiation among voice interface devices
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US9578173B2 (en) 2015-06-05 2017-02-21 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10366158B2 (en) * 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10062378B1 (en) 2017-02-24 2018-08-28 International Business Machines Corporation Sound identification utilizing periodic indications
DK201770439A1 (en) 2017-05-11 2018-12-13 Apple Inc. Offline personal assistant
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
DK201770431A1 (en) 2017-05-15 2018-12-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
DK201770432A1 (en) 2017-05-15 2018-12-21 Apple Inc. Hierarchical belief states for digital assistants
DK179560B1 (en) 2017-05-16 2019-02-18 Apple Inc. FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES
KR102606825B1 (ko) 2017-09-13 2023-11-27 삼성전자주식회사 뉴럴 네트워크 모델을 변형하는 뉴럴 네트워크 시스템, 이를 포함하는 어플리케이션 프로세서 및 뉴럴 네트워크 시스템의 동작방법
EP3564949A1 (en) * 2018-04-23 2019-11-06 Spotify AB Activation trigger processing

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5822742A (en) * 1989-05-17 1998-10-13 The United States Of America As Represented By The Secretary Of Health & Human Services Dynamically stable associative learning neural network system
US5461696A (en) * 1992-10-28 1995-10-24 Motorola, Inc. Decision directed adaptive neural network
IT1270919B (it) 1993-05-05 1997-05-16 Cselt Centro Studi Lab Telecom Sistema per il riconoscimento di parole isolate indipendente dal parlatore mediante reti neurali
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
US5745649A (en) * 1994-07-07 1998-04-28 Nynex Science & Technology Corporation Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories
US5638487A (en) * 1994-12-30 1997-06-10 Purespeech, Inc. Automatic speech recognition
IT1280816B1 (it) * 1995-03-22 1998-02-11 Cselt Centro Studi Lab Telecom Metodo per velocizzare l'esecuzione di reti neurali per il trattamento di segnali correlati.
US5749066A (en) * 1995-04-24 1998-05-05 Ericsson Messaging Systems Inc. Method and apparatus for developing a neural network for phoneme recognition
US6151592A (en) * 1995-06-07 2000-11-21 Seiko Epson Corporation Recognition apparatus using neural network, and learning method therefor
US5960391A (en) * 1995-12-13 1999-09-28 Denso Corporation Signal extraction system, system and method for speech restoration, learning method for neural network model, constructing method of neural network model, and signal processing system
US6665639B2 (en) * 1996-12-06 2003-12-16 Sensory, Inc. Speech recognition in consumer electronic products
GB9802836D0 (en) * 1998-02-10 1998-04-08 Canon Kk Pattern matching method and apparatus
US6208963B1 (en) * 1998-06-24 2001-03-27 Tony R. Martinez Method and apparatus for signal classification using a multilayer network
IT1310154B1 (it) * 1999-09-02 2002-02-11 Cselt Centro Studi Lab Telecom Procedimento per realizzare un riconoscitore vocale, relativoriconoscitore e procedimento per il riconoscimento della voce

Also Published As

Publication number Publication date
ATE349750T1 (de) 2007-01-15
EP1479069A1 (en) 2004-11-24
DE60310687T2 (de) 2007-11-08
WO2003073416A8 (en) 2005-03-24
US20050171766A1 (en) 2005-08-04
ES2281622T3 (es) 2007-10-01
US7827031B2 (en) 2010-11-02
AU2003206883A1 (en) 2003-09-09
DE60310687D1 (de) 2007-02-08
CA2477525A1 (en) 2003-09-04
EP1479069B1 (en) 2006-12-27
WO2003073416A1 (en) 2003-09-04
JP2005519314A (ja) 2005-06-30
PT1479069E (pt) 2007-04-30
JP4275537B2 (ja) 2009-06-10
CA2477525C (en) 2012-01-24
ITTO20020170A0 (it) 2002-02-28

Similar Documents

Publication Publication Date Title
ITTO20020170A1 (it) Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.
CN107680597B (zh) 语音识别方法、装置、设备以及计算机可读存储介质
CN102509547B (zh) 基于矢量量化的声纹识别方法及系统
US20220013106A1 (en) Multi-speaker neural text-to-speech synthesis
US20050065789A1 (en) System and method with automated speech recognition engines
Chen et al. Pronunciation and silence probability modeling for ASR.
CN105118501A (zh) 语音识别的方法及系统
EP1280137B1 (en) Method for speaker identification
Bauer et al. HMM-based artificial bandwidth extension supported by neural networks
US7072750B2 (en) Method and apparatus for rejection of speech recognition results in accordance with confidence level
CN102938252A (zh) 结合韵律和发音学特征的汉语声调识别系统及方法
CN106297769B (zh) 一种应用于语种识别的鉴别性特征提取方法
Sharma et al. Speech and language recognition using MFCC and DELTA-MFCC
ATE441918T1 (de) Sprachdialogverfahren und -system
KR20040038419A (ko) 음성을 이용한 감정인식 시스템 및 감정인식 방법
KR102042344B1 (ko) 음성 유사도 판단 장치 및 음성 유사도 판단 방법
Wang et al. An experimental analysis on integrating multi-stream spectro-temporal, cepstral and pitch information for mandarin speech recognition
Nguyen et al. Vietnamese voice recognition for home automation using MFCC and DTW techniques
Wang et al. Generating TTS Based Adversarial Samples for Training Wake-Up Word Detection Systems Against Confusing Words.
Lilley et al. Unsupervised training of a DNN-based formant tracker
Sailaja et al. Text independent speaker identification with finite multivariate generalized gaussian mixture model and hierarchical clustering algorithm
CN109378004A (zh) 一种音素比对的方法、装置、设备及计算机可读存储介质
KR100596558B1 (ko) 화자인식시스템 성능 향상을 위한 특징벡터 변환방법
ATE398324T1 (de) Spracherkennung durch kontextuelle modellierung der spracheinheiten
Nofal et al. Arabic/English automatic spoken language identification