EP0847041A3 - Verfahren und Vorrichtung zur Spracherkennung mit Rauschadaptierung - Google Patents
Verfahren und Vorrichtung zur Spracherkennung mit Rauschadaptierung Download PDFInfo
- Publication number
- EP0847041A3 EP0847041A3 EP97309678A EP97309678A EP0847041A3 EP 0847041 A3 EP0847041 A3 EP 0847041A3 EP 97309678 A EP97309678 A EP 97309678A EP 97309678 A EP97309678 A EP 97309678A EP 0847041 A3 EP0847041 A3 EP 0847041A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- noise
- model
- speech
- distribution
- pmc
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000006978 adaptation Effects 0.000 title 1
- 238000009826 distribution Methods 0.000 abstract 8
- 239000002131 composite material Substances 0.000 abstract 5
- 238000004519 manufacturing process Methods 0.000 abstract 3
- 238000006243 chemical reaction Methods 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP33629196 | 1996-12-03 | ||
JP8336291A JPH10161692A (ja) | 1996-12-03 | 1996-12-03 | 音声認識装置及び音声認識方法 |
JP336291/96 | 1996-12-03 |
Publications (3)
Publication Number | Publication Date |
---|---|
EP0847041A2 EP0847041A2 (de) | 1998-06-10 |
EP0847041A3 true EP0847041A3 (de) | 1999-02-03 |
EP0847041B1 EP0847041B1 (de) | 2003-09-24 |
Family
ID=18297591
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP97309678A Expired - Lifetime EP0847041B1 (de) | 1996-12-03 | 1997-12-02 | Verfahren und Vorrichtung zur Spracherkennung mit Rauschadaptierung |
Country Status (4)
Country | Link |
---|---|
US (1) | US5956679A (de) |
EP (1) | EP0847041B1 (de) |
JP (1) | JPH10161692A (de) |
DE (1) | DE69725106T2 (de) |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10254486A (ja) | 1997-03-13 | 1998-09-25 | Canon Inc | 音声認識装置および方法 |
JP2000047696A (ja) | 1998-07-29 | 2000-02-18 | Canon Inc | 情報処理方法及び装置、その記憶媒体 |
JP3969908B2 (ja) | 1999-09-14 | 2007-09-05 | キヤノン株式会社 | 音声入力端末器、音声認識装置、音声通信システム及び音声通信方法 |
JP4632384B2 (ja) * | 2000-03-31 | 2011-02-16 | キヤノン株式会社 | 音声情報処理装置及びその方法と記憶媒体 |
JP2001282278A (ja) * | 2000-03-31 | 2001-10-12 | Canon Inc | 音声情報処理装置及びその方法と記憶媒体 |
US7039588B2 (en) * | 2000-03-31 | 2006-05-02 | Canon Kabushiki Kaisha | Synthesis unit selection apparatus and method, and storage medium |
JP3728172B2 (ja) | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
JP3814459B2 (ja) | 2000-03-31 | 2006-08-30 | キヤノン株式会社 | 音声認識方法及び装置と記憶媒体 |
JP3728177B2 (ja) | 2000-05-24 | 2005-12-21 | キヤノン株式会社 | 音声処理システム、装置、方法及び記憶媒体 |
JP2002091478A (ja) * | 2000-09-18 | 2002-03-27 | Pioneer Electronic Corp | 音声認識システム |
JP4297602B2 (ja) * | 2000-09-18 | 2009-07-15 | パイオニア株式会社 | 音声認識システム |
JP3774698B2 (ja) * | 2000-10-11 | 2006-05-17 | キヤノン株式会社 | 情報処理装置、情報処理方法及び記憶媒体 |
US7451085B2 (en) * | 2000-10-13 | 2008-11-11 | At&T Intellectual Property Ii, L.P. | System and method for providing a compensated speech recognition model for speech recognition |
JP2002236494A (ja) * | 2001-02-09 | 2002-08-23 | Denso Corp | 音声区間判別装置、音声認識装置、プログラム及び記録媒体 |
JP2002268681A (ja) * | 2001-03-08 | 2002-09-20 | Canon Inc | 音声認識システム及び方法及び該システムに用いる情報処理装置とその方法 |
US7319954B2 (en) * | 2001-03-14 | 2008-01-15 | International Business Machines Corporation | Multi-channel codebook dependent compensation |
US6985858B2 (en) * | 2001-03-20 | 2006-01-10 | Microsoft Corporation | Method and apparatus for removing noise from feature vectors |
US20030033143A1 (en) * | 2001-08-13 | 2003-02-13 | Hagai Aronowitz | Decreasing noise sensitivity in speech processing under adverse conditions |
US7120580B2 (en) * | 2001-08-15 | 2006-10-10 | Sri International | Method and apparatus for recognizing speech in a noisy environment |
US6998068B2 (en) * | 2003-08-15 | 2006-02-14 | 3M Innovative Properties Company | Acene-thiophene semiconductors |
US6950796B2 (en) * | 2001-11-05 | 2005-09-27 | Motorola, Inc. | Speech recognition by dynamical noise model adaptation |
JP3542578B2 (ja) * | 2001-11-22 | 2004-07-14 | キヤノン株式会社 | 音声認識装置及びその方法、プログラム |
US7209881B2 (en) | 2001-12-20 | 2007-04-24 | Matsushita Electric Industrial Co., Ltd. | Preparing acoustic models by sufficient statistics and noise-superimposed speech data |
JP4061094B2 (ja) * | 2002-03-15 | 2008-03-12 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、その音声認識方法及びプログラム |
JP3885002B2 (ja) * | 2002-06-28 | 2007-02-21 | キヤノン株式会社 | 情報処理装置およびその方法 |
JP4109063B2 (ja) * | 2002-09-18 | 2008-06-25 | パイオニア株式会社 | 音声認識装置及び音声認識方法 |
US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
JP4217495B2 (ja) * | 2003-01-29 | 2009-02-04 | キヤノン株式会社 | 音声認識辞書作成方法、音声認識辞書作成装置及びプログラム、記録媒体 |
JP4357867B2 (ja) * | 2003-04-25 | 2009-11-04 | パイオニア株式会社 | 音声認識装置、音声認識方法、並びに、音声認識プログラムおよびそれを記録した記録媒体 |
JP3836815B2 (ja) * | 2003-05-21 | 2006-10-25 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、音声認識方法、該音声認識方法をコンピュータに対して実行させるためのコンピュータ実行可能なプログラムおよび記憶媒体 |
US7109519B2 (en) * | 2003-07-15 | 2006-09-19 | 3M Innovative Properties Company | Bis(2-acenyl)acetylene semiconductors |
US20070124143A1 (en) * | 2003-10-08 | 2007-05-31 | Koninkijkle Phillips Electronics, N.V. | Adaptation of environment mismatch for speech recognition systems |
JP2005249816A (ja) * | 2004-03-01 | 2005-09-15 | Internatl Business Mach Corp <Ibm> | 信号強調装置、方法及びプログラム、並びに音声認識装置、方法及びプログラム |
DE102004012209A1 (de) * | 2004-03-12 | 2005-10-06 | Siemens Ag | Durch einen Benutzer steuerbare oder durch externe Parameter beeinflussbare Geräuschreduktion |
JP4587160B2 (ja) * | 2004-03-26 | 2010-11-24 | キヤノン株式会社 | 信号処理装置および方法 |
JP4340686B2 (ja) * | 2004-03-31 | 2009-10-07 | パイオニア株式会社 | 音声認識装置及び音声認識方法 |
JP4510517B2 (ja) * | 2004-05-26 | 2010-07-28 | 日本電信電話株式会社 | 音響モデル雑音適応化方法およびこの方法を実施する装置 |
JP5992133B2 (ja) * | 2004-10-01 | 2016-09-14 | メルク パテント ゲーエムベーハー | 有機半導体を含む電子デバイス |
JP4822829B2 (ja) * | 2005-12-14 | 2011-11-24 | キヤノン株式会社 | 音声認識装置および方法 |
JP5286667B2 (ja) * | 2006-02-22 | 2013-09-11 | コニカミノルタ株式会社 | 映像表示装置、及び映像表示方法 |
JP4245617B2 (ja) * | 2006-04-06 | 2009-03-25 | 株式会社東芝 | 特徴量補正装置、特徴量補正方法および特徴量補正プログラム |
US8615393B2 (en) * | 2006-11-15 | 2013-12-24 | Microsoft Corporation | Noise suppressor for speech recognition |
CN101887725A (zh) * | 2010-04-30 | 2010-11-17 | 中国科学院声学研究所 | 一种基于音素混淆网络的音素后验概率计算方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3397372B2 (ja) * | 1993-06-16 | 2003-04-14 | キヤノン株式会社 | 音声認識方法及び装置 |
JP3581401B2 (ja) * | 1994-10-07 | 2004-10-27 | キヤノン株式会社 | 音声認識方法 |
JP3453456B2 (ja) * | 1995-06-19 | 2003-10-06 | キヤノン株式会社 | 状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置 |
-
1996
- 1996-12-03 JP JP8336291A patent/JPH10161692A/ja active Pending
-
1997
- 1997-12-02 DE DE69725106T patent/DE69725106T2/de not_active Expired - Fee Related
- 1997-12-02 EP EP97309678A patent/EP0847041B1/de not_active Expired - Lifetime
- 1997-12-02 US US08/982,385 patent/US5956679A/en not_active Expired - Fee Related
Non-Patent Citations (4)
Title |
---|
GALES M J F ET AL: "A FAST AND FLEXIBLE IMPLEMENTATION OF PARALLEL MODEL COMBINATION", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), DETROIT, MAY 9 - 12, 1995 SPEECH, vol. 1, 9 May 1995 (1995-05-09), INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, pages 133 - 136, XP000657948 * |
GALES M J F ET AL: "CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE", SPEECH COMMUNICATION, vol. 12, no. 3, 1 July 1993 (1993-07-01), pages 231 - 239, XP000393642 * |
GALES M J F ET AL: "ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USINGPARALLEL MODEL COMBINATION", COMPUTER SPEECH AND LANGUAGE, vol. 9, no. 4, October 1995 (1995-10-01), pages 289 - 307, XP000640904 * |
YAMAMOTO H ET AL: "Independent calculation of power parameters on PMC method", 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING CONFERENCE PROCEEDINGS (CAT. NO.96CH35903), 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING CONFERENCE PROCEEDINGS, ATLANTA, GA, USA, 7-10 M, ISBN 0-7803-3192-3, 1996, New York, NY, USA, IEEE, USA, pages 41 - 44 vol. 1, XP002086455 * |
Also Published As
Publication number | Publication date |
---|---|
JPH10161692A (ja) | 1998-06-19 |
DE69725106D1 (de) | 2003-10-30 |
EP0847041B1 (de) | 2003-09-24 |
DE69725106T2 (de) | 2004-04-29 |
US5956679A (en) | 1999-09-21 |
EP0847041A2 (de) | 1998-06-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0847041A3 (de) | Verfahren und Vorrichtung zur Spracherkennung mit Rauschadaptierung | |
EP0736857A3 (de) | Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem | |
CA2247006C (en) | Speech processing | |
EP1204091A3 (de) | System und Verfahren zur Mustererkennung im sehr hochdimensionalen Raum | |
EP0535146B1 (de) | System zur verarbeitung kontinuierlicher sprache | |
CN100401375C (zh) | 语音处理系统及方法 | |
CN1196104C (zh) | 语音处理 | |
EP0865030A3 (de) | Vorrichtung zur Berechnung einer a posteriori Wahrscheinlichkeit eines Phonemsymbols und Spracherkennungsvorrichtung | |
EP1083542A3 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
EP0109190A1 (de) | Einsilbenerkennungseinrichtung | |
EP1612719A3 (de) | Methode, Vorrichtung, System, Aufnahmemittel und Computerprogram zur Situationerkennung mittels optischer Information | |
EP0844583A3 (de) | Verfahren und Gerät zur Zeichenerkennung | |
WO2000042563A3 (en) | Signature recognition system and method | |
EP1022723A3 (de) | Unüberwachte Anpassung eines Spracherkenners mittels zuverlässiger Informationen aus mehrfachen Rechenhypothesen | |
TW357313B (en) | Methods and apparatus for handwriting recognition | |
EP0867857A3 (de) | Registrierung für die Spracherkennung | |
EP0326927A3 (de) | Verfahren und Vorrichtung zur Datenbankverarbeitung | |
EP1197950A3 (de) | Hierarchisierte Wörterbücher für die Spracherkennung | |
EP0387602A3 (de) | Verfahren und Einrichtung zur automatischen Bestimmung von phonologischen Regeln für ein System zur Erkennung kontinuierlicher Sprache | |
EP0831456A3 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
EP0903728A3 (de) | Blockalgorithmus für die Mustererkennung | |
EP1126436A3 (de) | Spracherkennung aus multimodalen Eingabe | |
EP0732685A3 (de) | Einrichtung zur Erkennung kontinuierlich gesprochener Sprache | |
WO2001031633A3 (en) | Speech recognition | |
EP0984385A3 (de) | Erkennungsverarbeitung für eine zweidimensionale Kodierung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB IT |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
17P | Request for examination filed |
Effective date: 19990616 |
|
AKX | Designation fees paid |
Free format text: DE FR GB IT |
|
17Q | First examination report despatched |
Effective date: 20020123 |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7G 10L 15/20 B Ipc: 7G 10L 15/14 A |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB IT |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRE;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.SCRIBED TIME-LIMIT Effective date: 20030924 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 69725106 Country of ref document: DE Date of ref document: 20031030 Kind code of ref document: P |
|
ET | Fr: translation filed | ||
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20040625 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20060224 Year of fee payment: 9 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20061218 Year of fee payment: 10 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20070703 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20061218 Year of fee payment: 10 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20071202 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20081020 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071202 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071231 |