DE60036522D1 - Verziehung der Frequenzen für Spracherkennung - Google Patents
Verziehung der Frequenzen für SpracherkennungInfo
- Publication number
- DE60036522D1 DE60036522D1 DE60036522T DE60036522T DE60036522D1 DE 60036522 D1 DE60036522 D1 DE 60036522D1 DE 60036522 T DE60036522 T DE 60036522T DE 60036522 T DE60036522 T DE 60036522T DE 60036522 D1 DE60036522 D1 DE 60036522D1
- Authority
- DE
- Germany
- Prior art keywords
- distortion
- frequencies
- speech recognition
- speech
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/12—Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Processing (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP30468599 | 1999-10-26 | ||
JP30468599A JP3632529B2 (ja) | 1999-10-26 | 1999-10-26 | 音声認識装置及び方法ならびに記録媒体 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60036522D1 true DE60036522D1 (de) | 2007-11-08 |
DE60036522T2 DE60036522T2 (de) | 2008-06-26 |
Family
ID=17935997
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60036522T Expired - Lifetime DE60036522T2 (de) | 1999-10-26 | 2000-10-26 | Verziehung der Frequenzen für Spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US6934681B1 (de) |
EP (1) | EP1096475B1 (de) |
JP (1) | JP3632529B2 (de) |
DE (1) | DE60036522T2 (de) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1312656C (zh) * | 2002-09-24 | 2007-04-25 | 松下电器产业株式会社 | 说话人标准化方法及用该方法的语音识别装置 |
US20050010413A1 (en) * | 2003-05-23 | 2005-01-13 | Norsworthy Jon Byron | Voice emulation and synthesis process |
JP4194433B2 (ja) * | 2003-07-07 | 2008-12-10 | キヤノン株式会社 | 尤度算出装置および方法 |
US7567903B1 (en) | 2005-01-12 | 2009-07-28 | At&T Intellectual Property Ii, L.P. | Low latency real-time vocal tract length normalization |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
WO2009041402A1 (ja) * | 2007-09-25 | 2009-04-02 | Nec Corporation | 周波数軸伸縮係数推定装置とシステム方法並びにプログラム |
US8788256B2 (en) * | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
JP5961950B2 (ja) * | 2010-09-15 | 2016-08-03 | ヤマハ株式会社 | 音声処理装置 |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
US9865266B2 (en) * | 2013-02-25 | 2018-01-09 | Nuance Communications, Inc. | Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system |
CN109192193B (zh) * | 2018-08-14 | 2020-05-05 | 四川虹美智能科技有限公司 | 一种语音识别产品测试方法和测试装置 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5268990A (en) * | 1991-01-31 | 1993-12-07 | Sri International | Method for recognizing speech using linguistically-motivated hidden Markov models |
US5390278A (en) * | 1991-10-08 | 1995-02-14 | Bell Canada | Phoneme based speech recognition |
JPH06214596A (ja) | 1993-01-14 | 1994-08-05 | Ricoh Co Ltd | 音声認識装置および話者適応化方法 |
US5664059A (en) * | 1993-04-29 | 1997-09-02 | Panasonic Technologies, Inc. | Self-learning speaker adaptation based on spectral variation source decomposition |
US5737490A (en) * | 1993-09-30 | 1998-04-07 | Apple Computer, Inc. | Method and apparatus for constructing continuous parameter fenonic hidden markov models by replacing phonetic models with continous fenonic models |
US5625749A (en) * | 1994-08-22 | 1997-04-29 | Massachusetts Institute Of Technology | Segment-based apparatus and method for speech recognition by analyzing multiple speech unit frames and modeling both temporal and spatial correlation |
US5625747A (en) * | 1994-09-21 | 1997-04-29 | Lucent Technologies Inc. | Speaker verification, speech recognition and channel normalization through dynamic time/frequency warping |
US5742928A (en) * | 1994-10-28 | 1998-04-21 | Mitsubishi Denki Kabushiki Kaisha | Apparatus and method for speech recognition in the presence of unnatural speech effects |
US5864809A (en) * | 1994-10-28 | 1999-01-26 | Mitsubishi Denki Kabushiki Kaisha | Modification of sub-phoneme speech spectral models for lombard speech recognition |
US5930753A (en) | 1997-03-20 | 1999-07-27 | At&T Corp | Combining frequency warping and spectral shaping in HMM based speech recognition |
JPH118839A (ja) | 1997-06-19 | 1999-01-12 | Matsushita Electric Ind Co Ltd | 映像信号変換装置 |
JP2986792B2 (ja) * | 1998-03-16 | 1999-12-06 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 話者正規化処理装置及び音声認識装置 |
-
1999
- 1999-10-26 JP JP30468599A patent/JP3632529B2/ja not_active Expired - Fee Related
-
2000
- 2000-10-25 US US09/695,067 patent/US6934681B1/en not_active Expired - Lifetime
- 2000-10-26 EP EP00123247A patent/EP1096475B1/de not_active Expired - Lifetime
- 2000-10-26 DE DE60036522T patent/DE60036522T2/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
JP2001125588A (ja) | 2001-05-11 |
DE60036522T2 (de) | 2008-06-26 |
EP1096475A2 (de) | 2001-05-02 |
EP1096475A3 (de) | 2001-09-12 |
EP1096475B1 (de) | 2007-09-26 |
US6934681B1 (en) | 2005-08-23 |
JP3632529B2 (ja) | 2005-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB9910448D0 (en) | Cancellation of non-stationary interfering signals for speech recognition | |
DE60036522D1 (de) | Verziehung der Frequenzen für Spracherkennung | |
FI19992351A (fi) | Puheentunnistus | |
DE69901606D1 (de) | Breitbandsprachsynthese von schmalbandigen sprachsignalen | |
DE60019229D1 (de) | Normalisierung der Grundfrequenz zur Spracherkennung | |
DE69720436D1 (de) | Rauschunterdrückung für hochfrequenzsignale | |
DE69829235D1 (de) | Registrierung für die Spracherkennung | |
GB2327173B (en) | Voice recognition of telephone conversations | |
DE69506727D1 (de) | Geräuscharme verstärker für mikrofon | |
FR2781636B1 (fr) | Adaptateur pour ecouteur-microphone | |
DE50208467D1 (de) | Schaltungsanordnung zur Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen | |
EP1204965A4 (de) | System und verfahren zur messung von sprachverzerrungen aus signalmustern von fernsprechstimmen | |
DE60042820D1 (de) | Druckgussformbeschichtung für steigendes- und unterdruckgiessen | |
DE60217448D1 (de) | Prozessor für Audiosignale | |
DE60002649D1 (de) | Lautsprecher für mobile Telefon | |
DE60014031D1 (de) | Sprachererkennung durch korrelierung von spektrogrammen | |
DE69920714D1 (de) | Spracherkennung | |
DE60130532D1 (de) | Akustisch gedämpfer Deckel für Kraftwagen | |
DE60031838D1 (de) | Gruppenantenne für mehrere frequenzen | |
DE50008703D1 (de) | Spracherkennungsverfahren und -einrichtung | |
ID25924A (id) | Earphone tanpa suara bising dari gelombang listrik searah untuk melindungi kehilangan hantaran pendengaran | |
IT1310154B1 (it) | Procedimento per realizzare un riconoscitore vocale, relativoriconoscitore e procedimento per il riconoscimento della voce | |
DE60034836D1 (de) | Herstellungsverfahren für akustische Oberflächenwellenanordnungen | |
DE69719260D1 (de) | Breitbandiger Spektralquantisierer für Sprache | |
DE69939151D1 (de) | Sprecheradaption für verwechselbare Wörter |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |