DE69523219T2 - Anpassungsfähiges Lernverfahren zur Mustererkennung - Google Patents

Anpassungsfähiges Lernverfahren zur Mustererkennung

Info

Publication number
DE69523219T2
DE69523219T2 DE69523219T DE69523219T DE69523219T2 DE 69523219 T2 DE69523219 T2 DE 69523219T2 DE 69523219 T DE69523219 T DE 69523219T DE 69523219 T DE69523219 T DE 69523219T DE 69523219 T2 DE69523219 T2 DE 69523219T2
Authority
DE
Germany
Prior art keywords
pattern recognition
learning process
adaptable
adaptable learning
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69523219T
Other languages
English (en)
Other versions
DE69523219D1 (de
Inventor
Junichi Takahashi
Shigeki Sagayama
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nippon Telegraph and Telephone Corp
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP6156238A external-priority patent/JPH0822296A/ja
Priority claimed from JP6226505A external-priority patent/JPH0895592A/ja
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Application granted granted Critical
Publication of DE69523219D1 publication Critical patent/DE69523219D1/de
Publication of DE69523219T2 publication Critical patent/DE69523219T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/12Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)
DE69523219T 1994-07-07 1995-07-05 Anpassungsfähiges Lernverfahren zur Mustererkennung Expired - Lifetime DE69523219T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP6156238A JPH0822296A (ja) 1994-07-07 1994-07-07 パターン認識方法
JP6226505A JPH0895592A (ja) 1994-09-21 1994-09-21 パターン認識方法

Publications (2)

Publication Number Publication Date
DE69523219D1 DE69523219D1 (de) 2001-11-22
DE69523219T2 true DE69523219T2 (de) 2002-06-27

Family

ID=26484046

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69523219T Expired - Lifetime DE69523219T2 (de) 1994-07-07 1995-07-05 Anpassungsfähiges Lernverfahren zur Mustererkennung

Country Status (3)

Country Link
US (1) US5793891A (de)
EP (1) EP0691640B1 (de)
DE (1) DE69523219T2 (de)

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5960395A (en) * 1996-02-09 1999-09-28 Canon Kabushiki Kaisha Pattern matching method, apparatus and computer readable memory medium for speech recognition using dynamic programming
GB9602699D0 (en) * 1996-02-09 1996-04-10 Canon Kk Pattern matching method and apparatus
US6151575A (en) * 1996-10-28 2000-11-21 Dragon Systems, Inc. Rapid adaptation of speech models
ATE277405T1 (de) * 1997-01-27 2004-10-15 Microsoft Corp Stimmumwandlung
US6073096A (en) * 1998-02-04 2000-06-06 International Business Machines Corporation Speaker adaptation system and method based on class-specific pre-clustering training speakers
JPH11296192A (ja) * 1998-04-10 1999-10-29 Pioneer Electron Corp 音声認識における音声特徴量の補正方法、音声認識方法、音声認識装置及び音声認識プログラムを記録した記録媒体
US6263309B1 (en) * 1998-04-30 2001-07-17 Matsushita Electric Industrial Co., Ltd. Maximum likelihood method for finding an adapted speaker model in eigenvoice space
US6343267B1 (en) 1998-04-30 2002-01-29 Matsushita Electric Industrial Co., Ltd. Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
JP3156668B2 (ja) * 1998-06-19 2001-04-16 日本電気株式会社 音声認識装置
US6658385B1 (en) * 1999-03-12 2003-12-02 Texas Instruments Incorporated Method for transforming HMMs for speaker-independent recognition in a noisy environment
KR100307623B1 (ko) * 1999-10-21 2001-11-02 윤종용 엠.에이.피 화자 적응 조건에서 파라미터의 분별적 추정 방법 및 장치 및 이를 각각 포함한 음성 인식 방법 및 장치
US6421641B1 (en) * 1999-11-12 2002-07-16 International Business Machines Corporation Methods and apparatus for fast adaptation of a band-quantized speech decoding system
US6526379B1 (en) 1999-11-29 2003-02-25 Matsushita Electric Industrial Co., Ltd. Discriminative clustering methods for automatic speech recognition
US6571208B1 (en) 1999-11-29 2003-05-27 Matsushita Electric Industrial Co., Ltd. Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
JP2001166789A (ja) * 1999-12-10 2001-06-22 Matsushita Electric Ind Co Ltd 初頭/末尾の音素類似度ベクトルによる中国語の音声認識方法及びその装置
US6920421B2 (en) * 1999-12-28 2005-07-19 Sony Corporation Model adaptive apparatus for performing adaptation of a model used in pattern recognition considering recentness of a received pattern data
FR2808917B1 (fr) * 2000-05-09 2003-12-12 Thomson Csf Procede et dispositif de reconnaissance vocale dans des environnements a niveau de bruit fluctuant
TW473704B (en) * 2000-08-30 2002-01-21 Ind Tech Res Inst Adaptive voice recognition method with noise compensation
JP2002073072A (ja) * 2000-08-31 2002-03-12 Sony Corp モデル適応装置およびモデル適応方法、記録媒体、並びにパターン認識装置
WO2002087201A1 (en) * 2001-04-19 2002-10-31 British Telecommunications Public Limited Company Voice response system
JP2003308091A (ja) * 2002-04-17 2003-10-31 Pioneer Electronic Corp 音声認識装置、音声認識方法および音声認識プログラム
EP1376537B1 (de) * 2002-05-27 2009-04-08 Pioneer Corporation Vorrichtung, Verfahren und computerlesbares Aufzeichnungsmedium zur Erkennung von Schlüsselwörtern in spontaner Sprache
US7571097B2 (en) * 2003-03-13 2009-08-04 Microsoft Corporation Method for training of subspace coded gaussian models
US7516071B2 (en) * 2003-06-30 2009-04-07 International Business Machines Corporation Method of modeling single-enrollment classes in verification and identification tasks
US7539617B2 (en) * 2003-07-01 2009-05-26 France Telecom Method and system for analysis of vocal signals for a compressed representation of speakers using a probability density representing resemblances between a vocal representation of the speaker in a predetermined model and a predetermined set of vocal representations reference speakers
US7496509B2 (en) * 2004-05-28 2009-02-24 International Business Machines Corporation Methods and apparatus for statistical biometric model migration
JP4301102B2 (ja) * 2004-07-22 2009-07-22 ソニー株式会社 音声処理装置および音声処理方法、プログラム、並びに記録媒体
EP1794746A2 (de) * 2004-09-23 2007-06-13 Koninklijke Philips Electronics N.V. Verfahren zum trainieren eines robusten sprecherunabhängigen spracherkennungssystems mit sprecherabhängigen ausdrücken und robustes sprecherabhängiges spracherkennungssystem
US20070033044A1 (en) * 2005-08-03 2007-02-08 Texas Instruments, Incorporated System and method for creating generalized tied-mixture hidden Markov models for automatic speech recognition
US7593572B2 (en) * 2006-02-09 2009-09-22 Microsoft Corporation Ink-parser-parameter optimization
TWI311311B (en) * 2006-11-16 2009-06-21 Inst Information Industr Speech recognition device, method, application program, and computer readable medium for adjusting speech models with selected speech data
US20080162129A1 (en) * 2006-12-29 2008-07-03 Motorola, Inc. Method and apparatus pertaining to the processing of sampled audio content using a multi-resolution speech recognition search process
TWI319563B (en) * 2007-05-31 2010-01-11 Cyberon Corp Method and module for improving personal speech recognition capability
KR101217525B1 (ko) * 2008-12-22 2013-01-18 한국전자통신연구원 비터비 디코더와 이를 이용한 음성 인식 방법
TWI431563B (zh) * 2010-08-03 2014-03-21 Ind Tech Res Inst 語言學習系統、語言學習方法及其程式產品
US8892436B2 (en) * 2010-10-19 2014-11-18 Samsung Electronics Co., Ltd. Front-end processor for speech recognition, and speech recognizing apparatus and method using the same
GB2495110B (en) * 2011-09-28 2014-03-19 Toshiba Res Europ Ltd Antenna combining
US9001976B2 (en) * 2012-05-03 2015-04-07 Nexidia, Inc. Speaker adaptation
US9575952B2 (en) 2014-10-21 2017-02-21 At&T Intellectual Property I, L.P. Unsupervised topic modeling for short texts
US10719115B2 (en) * 2014-12-30 2020-07-21 Avago Technologies International Sales Pte. Limited Isolated word training and detection using generated phoneme concatenation models of audio inputs
CN105989849B (zh) * 2015-06-03 2019-12-03 乐融致新电子科技(天津)有限公司 一种语音增强方法、语音识别方法、聚类方法及装置
CN107564513B (zh) * 2016-06-30 2020-09-08 阿里巴巴集团控股有限公司 语音识别方法及装置
KR102637339B1 (ko) * 2018-08-31 2024-02-16 삼성전자주식회사 음성 인식 모델을 개인화하는 방법 및 장치
CN111191723B (zh) * 2019-12-30 2023-06-20 创新奇智(北京)科技有限公司 基于级联分类器的少样本商品分类系统及分类方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4741036A (en) * 1985-01-31 1988-04-26 International Business Machines Corporation Determination of phone weights for markov models in a speech recognition system
US5129002A (en) * 1987-12-16 1992-07-07 Matsushita Electric Industrial Co., Ltd. Pattern recognition apparatus
US5129001A (en) * 1990-04-25 1992-07-07 International Business Machines Corporation Method and apparatus for modeling words with multi-arc markov models
US5544257A (en) * 1992-01-08 1996-08-06 International Business Machines Corporation Continuous parameter hidden Markov model approach to automatic handwriting recognition
US5497447A (en) * 1993-03-08 1996-03-05 International Business Machines Corporation Speech coding apparatus having acoustic prototype vectors generated by tying to elementary models and clustering around reference vectors

Also Published As

Publication number Publication date
US5793891A (en) 1998-08-11
EP0691640A3 (de) 1997-11-26
EP0691640A2 (de) 1996-01-10
EP0691640B1 (de) 2001-10-17
DE69523219D1 (de) 2001-11-22

Similar Documents

Publication Publication Date Title
DE69523219D1 (de) Anpassungsfähiges Lernverfahren zur Mustererkennung
DE69616568T2 (de) Mustererkennung
DE69422446D1 (de) Mustererkennung
DE69610243D1 (de) Verfahren zum Trainieren einer Erkennungsanlage mit Zeichenmustern
DE69523567D1 (de) Verfahren zur handschrift-eingangsaufteilung
DE69517571T2 (de) Verfahren zur Erkennung von Mustern
DE69613293T2 (de) Vorrichtung zur Musteranpassung für Sprach- oder Mustererkennung
DE69610284D1 (de) Verfahren zur robotersteuerung
ATE181880T1 (de) Druckverfahren
DE69416250T2 (de) Vorrichtung zur oberflächenbehandlung
IL111039A0 (en) Handwritten pattern recognizer
DE69417273D1 (de) Verfahren und Vorrichtung zur Mustererkennung
FI953454A (fi) Menetelmä värikorttien tuottamiseksi
DE69737849D1 (de) Vorrichtung zur Musterwiedererkennung
DE69521003D1 (de) Gerät zur Erkennung von dreidimensionalen Formen
DE69425166D1 (de) Verfahren und Gerät zur Mustererkennung
DE69514573D1 (de) Vorrichtung zur Spracherkennung
DE69632135D1 (de) Prozess zur hydrierenden entschwefelung.
DE69603724D1 (de) Vorrichtung zur oberflächenbehandlung
DE69427677D1 (de) Bildmusteridentifikations/Erkennungsverfahren
DE19680889D2 (de) Verfahren zur Entfernung von Zinn
DE69619154D1 (de) Verfahren und Vorrichtung zur Mustererkennung
NO983959D0 (no) Treningsprosess
DE69028002T2 (de) Einrichtung zur Mustererkennung
KR960013697A (ko) 형틀을 이용한 광택 문자 또는 문양의 제조 방법

Legal Events

Date Code Title Description
8364 No opposition during term of opposition