DE60036522D1 - Verziehung der Frequenzen für Spracherkennung - Google Patents

Verziehung der Frequenzen für Spracherkennung

Info

Publication number
DE60036522D1
DE60036522D1 DE60036522T DE60036522T DE60036522D1 DE 60036522 D1 DE60036522 D1 DE 60036522D1 DE 60036522 T DE60036522 T DE 60036522T DE 60036522 T DE60036522 T DE 60036522T DE 60036522 D1 DE60036522 D1 DE 60036522D1
Authority
DE
Germany
Prior art keywords
distortion
frequencies
speech recognition
speech
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60036522T
Other languages
English (en)
Other versions
DE60036522T2 (de
Inventor
Tadashi Emori
Koichi Shinoda
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of DE60036522D1 publication Critical patent/DE60036522D1/de
Application granted granted Critical
Publication of DE60036522T2 publication Critical patent/DE60036522T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/12Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Processing (AREA)
DE60036522T 1999-10-26 2000-10-26 Verziehung der Frequenzen für Spracherkennung Expired - Lifetime DE60036522T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP30468599 1999-10-26
JP30468599A JP3632529B2 (ja) 1999-10-26 1999-10-26 音声認識装置及び方法ならびに記録媒体

Publications (2)

Publication Number Publication Date
DE60036522D1 true DE60036522D1 (de) 2007-11-08
DE60036522T2 DE60036522T2 (de) 2008-06-26

Family

ID=17935997

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60036522T Expired - Lifetime DE60036522T2 (de) 1999-10-26 2000-10-26 Verziehung der Frequenzen für Spracherkennung

Country Status (4)

Country Link
US (1) US6934681B1 (de)
EP (1) EP1096475B1 (de)
JP (1) JP3632529B2 (de)
DE (1) DE60036522T2 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1312656C (zh) * 2002-09-24 2007-04-25 松下电器产业株式会社 说话人标准化方法及用该方法的语音识别装置
US20050010413A1 (en) * 2003-05-23 2005-01-13 Norsworthy Jon Byron Voice emulation and synthesis process
JP4194433B2 (ja) * 2003-07-07 2008-12-10 キヤノン株式会社 尤度算出装置および方法
US7567903B1 (en) 2005-01-12 2009-07-28 At&T Intellectual Property Ii, L.P. Low latency real-time vocal tract length normalization
US7970613B2 (en) 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8010358B2 (en) * 2006-02-21 2011-08-30 Sony Computer Entertainment Inc. Voice recognition with parallel gender and age normalization
WO2009041402A1 (ja) * 2007-09-25 2009-04-02 Nec Corporation 周波数軸伸縮係数推定装置とシステム方法並びにプログラム
US8788256B2 (en) * 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US8442833B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US8442829B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
JP5961950B2 (ja) * 2010-09-15 2016-08-03 ヤマハ株式会社 音声処理装置
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
US9865266B2 (en) * 2013-02-25 2018-01-09 Nuance Communications, Inc. Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system
CN109192193B (zh) * 2018-08-14 2020-05-05 四川虹美智能科技有限公司 一种语音识别产品测试方法和测试装置

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5268990A (en) * 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
US5390278A (en) * 1991-10-08 1995-02-14 Bell Canada Phoneme based speech recognition
JPH06214596A (ja) 1993-01-14 1994-08-05 Ricoh Co Ltd 音声認識装置および話者適応化方法
US5664059A (en) * 1993-04-29 1997-09-02 Panasonic Technologies, Inc. Self-learning speaker adaptation based on spectral variation source decomposition
US5737490A (en) * 1993-09-30 1998-04-07 Apple Computer, Inc. Method and apparatus for constructing continuous parameter fenonic hidden markov models by replacing phonetic models with continous fenonic models
US5625749A (en) * 1994-08-22 1997-04-29 Massachusetts Institute Of Technology Segment-based apparatus and method for speech recognition by analyzing multiple speech unit frames and modeling both temporal and spatial correlation
US5625747A (en) * 1994-09-21 1997-04-29 Lucent Technologies Inc. Speaker verification, speech recognition and channel normalization through dynamic time/frequency warping
US5742928A (en) * 1994-10-28 1998-04-21 Mitsubishi Denki Kabushiki Kaisha Apparatus and method for speech recognition in the presence of unnatural speech effects
US5864809A (en) * 1994-10-28 1999-01-26 Mitsubishi Denki Kabushiki Kaisha Modification of sub-phoneme speech spectral models for lombard speech recognition
US5930753A (en) 1997-03-20 1999-07-27 At&T Corp Combining frequency warping and spectral shaping in HMM based speech recognition
JPH118839A (ja) 1997-06-19 1999-01-12 Matsushita Electric Ind Co Ltd 映像信号変換装置
JP2986792B2 (ja) * 1998-03-16 1999-12-06 株式会社エイ・ティ・アール音声翻訳通信研究所 話者正規化処理装置及び音声認識装置

Also Published As

Publication number Publication date
JP2001125588A (ja) 2001-05-11
DE60036522T2 (de) 2008-06-26
EP1096475A2 (de) 2001-05-02
EP1096475A3 (de) 2001-09-12
EP1096475B1 (de) 2007-09-26
US6934681B1 (en) 2005-08-23
JP3632529B2 (ja) 2005-03-23

Similar Documents

Publication Publication Date Title
GB9910448D0 (en) Cancellation of non-stationary interfering signals for speech recognition
DE60036522D1 (de) Verziehung der Frequenzen für Spracherkennung
FI19992351A (fi) Puheentunnistus
DE69901606D1 (de) Breitbandsprachsynthese von schmalbandigen sprachsignalen
DE60019229D1 (de) Normalisierung der Grundfrequenz zur Spracherkennung
DE69720436D1 (de) Rauschunterdrückung für hochfrequenzsignale
DE69829235D1 (de) Registrierung für die Spracherkennung
GB2327173B (en) Voice recognition of telephone conversations
DE69506727D1 (de) Geräuscharme verstärker für mikrofon
FR2781636B1 (fr) Adaptateur pour ecouteur-microphone
DE50208467D1 (de) Schaltungsanordnung zur Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen
EP1204965A4 (de) System und verfahren zur messung von sprachverzerrungen aus signalmustern von fernsprechstimmen
DE60042820D1 (de) Druckgussformbeschichtung für steigendes- und unterdruckgiessen
DE60217448D1 (de) Prozessor für Audiosignale
DE60002649D1 (de) Lautsprecher für mobile Telefon
DE60014031D1 (de) Sprachererkennung durch korrelierung von spektrogrammen
DE69920714D1 (de) Spracherkennung
DE60130532D1 (de) Akustisch gedämpfer Deckel für Kraftwagen
DE60031838D1 (de) Gruppenantenne für mehrere frequenzen
DE50008703D1 (de) Spracherkennungsverfahren und -einrichtung
ID25924A (id) Earphone tanpa suara bising dari gelombang listrik searah untuk melindungi kehilangan hantaran pendengaran
IT1310154B1 (it) Procedimento per realizzare un riconoscitore vocale, relativoriconoscitore e procedimento per il riconoscimento della voce
DE60034836D1 (de) Herstellungsverfahren für akustische Oberflächenwellenanordnungen
DE69719260D1 (de) Breitbandiger Spektralquantisierer für Sprache
DE69939151D1 (de) Sprecheradaption für verwechselbare Wörter

Legal Events

Date Code Title Description
8364 No opposition during term of opposition