DE69941499D1 - Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles - Google Patents

Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles

Info

Publication number
DE69941499D1
DE69941499D1 DE69941499T DE69941499T DE69941499D1 DE 69941499 D1 DE69941499 D1 DE 69941499D1 DE 69941499 T DE69941499 T DE 69941499T DE 69941499 T DE69941499 T DE 69941499T DE 69941499 D1 DE69941499 D1 DE 69941499D1
Authority
DE
Germany
Prior art keywords
learning
applying
distance
methods
transition model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69941499T
Other languages
English (en)
Inventor
Tetsujiro Kondo
Norifumi Yoshiwara
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Application granted granted Critical
Publication of DE69941499D1 publication Critical patent/DE69941499D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24137Distances to cluster centroïds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/72Data preparation, e.g. statistical preprocessing of image or video features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/762Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Psychiatry (AREA)
  • Social Psychology (AREA)
  • Image Analysis (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Character Discrimination (AREA)
  • Radar Systems Or Details Thereof (AREA)
DE69941499T 1998-10-09 1999-10-12 Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles Expired - Lifetime DE69941499D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP28803898 1998-10-09

Publications (1)

Publication Number Publication Date
DE69941499D1 true DE69941499D1 (de) 2009-11-12

Family

ID=17725033

Family Applications (3)

Application Number Title Priority Date Filing Date
DE69941499T Expired - Lifetime DE69941499D1 (de) 1998-10-09 1999-10-12 Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles
DE69941999T Expired - Lifetime DE69941999D1 (de) 1998-10-09 1999-10-12 Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium
DE69943018T Expired - Lifetime DE69943018D1 (de) 1998-10-09 1999-10-12 Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium

Family Applications After (2)

Application Number Title Priority Date Filing Date
DE69941999T Expired - Lifetime DE69941999D1 (de) 1998-10-09 1999-10-12 Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium
DE69943018T Expired - Lifetime DE69943018D1 (de) 1998-10-09 1999-10-12 Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium

Country Status (5)

Country Link
US (3) US6449591B1 (de)
EP (4) EP1039446B1 (de)
KR (1) KR100729316B1 (de)
DE (3) DE69941499D1 (de)
WO (1) WO2000022607A1 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1039446B1 (de) * 1998-10-09 2010-12-08 Sony Corporation Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium
CN1202514C (zh) * 2000-11-27 2005-05-18 日本电信电话株式会社 编码和解码语音及其参数的方法、编码器、解码器
US7356466B2 (en) * 2002-06-28 2008-04-08 Samsung Electronics Co., Ltd. Method and apparatus for performing observation probability calculations
US7640164B2 (en) * 2002-07-04 2009-12-29 Denso Corporation System for performing interactive dialog
JP4639784B2 (ja) * 2004-12-06 2011-02-23 ソニー株式会社 学習装置および学習方法、並びにプログラム
JP2006285899A (ja) * 2005-04-05 2006-10-19 Sony Corp 学習装置および学習方法、生成装置および生成方法、並びにプログラム
EP2333718B1 (de) * 2009-01-29 2013-08-28 Nec Corporation Vorrichtung zur auswahl von merkmalsmengen
CN101950376B (zh) * 2009-07-09 2014-10-29 索尼公司 隐马尔可夫模型学习设备和方法
US9197736B2 (en) * 2009-12-31 2015-11-24 Digimarc Corporation Intuitive computing methods and systems
CN102782733B (zh) 2009-12-31 2015-11-25 数字标记公司 采用配备有传感器的智能电话的方法和配置方案
GB2477324A (en) * 2010-02-01 2011-08-03 Rolls Royce Plc Device monitoring
JP2011223287A (ja) * 2010-04-09 2011-11-04 Sony Corp 情報処理装置、情報処理方法、及び、プログラム
US8490056B2 (en) * 2010-04-28 2013-07-16 International Business Machines Corporation Automatic identification of subroutines from test scripts
US9311640B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods and arrangements for smartphone payments and transactions
JP6828741B2 (ja) * 2016-05-16 2021-02-10 ソニー株式会社 情報処理装置
US10332515B2 (en) * 2017-03-14 2019-06-25 Google Llc Query endpointing based on lip detection
WO2019123544A1 (ja) * 2017-12-19 2019-06-27 オリンパス株式会社 データ処理方法およびデータ処理装置

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4608708A (en) 1981-12-24 1986-08-26 Nippon Electric Co., Ltd. Pattern matching system
JPS58143396A (ja) * 1982-02-19 1983-08-25 日本電気株式会社 音声認識装置
US5054085A (en) * 1983-05-18 1991-10-01 Speech Systems, Inc. Preprocessing system for speech recognition
US4817158A (en) * 1984-10-19 1989-03-28 International Business Machines Corporation Normalization of speech signals
JP2709386B2 (ja) * 1987-06-24 1998-02-04 株式会社 エイ・ティ・ア−ル自動翻訳電話研究所 スペクトログラムの正規化方法
JPH02195400A (ja) * 1989-01-24 1990-08-01 Canon Inc 音声認識装置
JP2979711B2 (ja) * 1991-04-24 1999-11-15 日本電気株式会社 パターン認識方式および標準パターン学習方式
US5263097A (en) * 1991-07-24 1993-11-16 Texas Instruments Incorporated Parameter normalized features for classification procedures, systems and methods
US5586215A (en) * 1992-05-26 1996-12-17 Ricoh Corporation Neural network acoustic and visual speech recognition system
US5502774A (en) * 1992-06-09 1996-03-26 International Business Machines Corporation Automatic recognition of a consistent message using multiple complimentary sources of information
JPH064093A (ja) * 1992-06-18 1994-01-14 Matsushita Electric Ind Co Ltd Hmm作成装置、hmm記憶装置、尤度計算装置及び、認識装置
US5515475A (en) * 1993-06-24 1996-05-07 Northern Telecom Limited Speech recognition method using a two-pass search
US5692100A (en) * 1994-02-02 1997-11-25 Matsushita Electric Industrial Co., Ltd. Vector quantizer
JP2775140B2 (ja) * 1994-03-18 1998-07-16 株式会社エイ・ティ・アール人間情報通信研究所 パターン認識方法、音声認識方法および音声認識装置
JP3533696B2 (ja) * 1994-03-22 2004-05-31 三菱電機株式会社 音声認識の境界推定方法及び音声認識装置
US6471420B1 (en) * 1994-05-13 2002-10-29 Matsushita Electric Industrial Co., Ltd. Voice selection apparatus voice response apparatus, and game apparatus using word tables from which selected words are output as voice selections
CN1159704C (zh) * 1994-06-13 2004-07-28 松下电器产业株式会社 信号分析装置
JPH08123462A (ja) * 1994-10-27 1996-05-17 Sony Corp 音声認識装置
JPH08211897A (ja) 1995-02-07 1996-08-20 Toyota Motor Corp 音声認識装置
JP3627299B2 (ja) * 1995-07-19 2005-03-09 ソニー株式会社 音声認識方法及び装置
JPH0981183A (ja) * 1995-09-14 1997-03-28 Pioneer Electron Corp 音声モデルの作成方法およびこれを用いた音声認識装置
US5729694A (en) * 1996-02-06 1998-03-17 The Regents Of The University Of California Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
US6006175A (en) * 1996-02-06 1999-12-21 The Regents Of The University Of California Methods and apparatus for non-acoustic speech characterization and recognition
JP3702978B2 (ja) * 1996-12-26 2005-10-05 ソニー株式会社 認識装置および認識方法、並びに学習装置および学習方法
JPH10288038A (ja) * 1997-04-15 1998-10-27 Nissan Motor Co Ltd 直接噴射式ディーゼルエンジン
KR20000001476U (ko) * 1998-06-20 2000-01-25 조병호 특정문장 화자인식에 의한 도어록 장치 고안
US6185529B1 (en) * 1998-09-14 2001-02-06 International Business Machines Corporation Speech recognition aided by lateral profile image
EP1039446B1 (de) * 1998-10-09 2010-12-08 Sony Corporation Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium
JP4345156B2 (ja) * 1998-10-09 2009-10-14 ソニー株式会社 学習装置および学習方法、認識装置および認識方法、並びに記録媒体

Also Published As

Publication number Publication date
EP1863013A2 (de) 2007-12-05
EP1863014A3 (de) 2008-08-06
DE69941999D1 (de) 2010-03-25
EP1039446A1 (de) 2000-09-27
EP1863013A3 (de) 2008-08-06
EP1863014A2 (de) 2007-12-05
EP1039446B1 (de) 2010-12-08
EP2056290B1 (de) 2010-02-03
US20050096902A1 (en) 2005-05-05
US20020184011A1 (en) 2002-12-05
EP1039446A4 (de) 2005-07-20
US6449591B1 (en) 2002-09-10
WO2000022607A1 (fr) 2000-04-20
KR100729316B1 (ko) 2007-06-19
US7072829B2 (en) 2006-07-04
KR20010032920A (ko) 2001-04-25
EP1863014B1 (de) 2009-09-30
EP2056290A1 (de) 2009-05-06
DE69943018D1 (de) 2011-01-20
EP1863013B1 (de) 2013-01-02

Similar Documents

Publication Publication Date Title
DE69510055D1 (de) Verfahren und gerät zum anbringen eines metallfundaments
DE69606784T2 (de) Verfahren und vorrichtung zum positionieren eines magnetoresistiven kopfes
DE59712087D1 (de) Verfahren und vorrichtung zum ansteuern eines kapazitiven stellgliedes
DE69812696D1 (de) Verfahren und vorrichtung zum einstellen eines oder mehrerer projektoren
DE69831642D1 (de) Verfahren und Gerät zur Simulation eines rollenden Reifens
DE59800828D1 (de) Vorrichtung und Verfahren zum Ziehen eines Einkristalls
DE60017228D1 (de) Verfahren zum Identifizieren eines Trainierenden
DE69941499D1 (de) Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles
DE69925844D1 (de) Verfahren und vorichtung zum falten eines gassacks
DE59710480D1 (de) Verfahren und vorrichtung zum ansteuern eines kapazitiven stellgliedes
DE69501037T2 (de) Verfahren und Vorrichtung zum Falten eines Etiketts
DE69607003T2 (de) Verfahren und vorrichtung zum steuern eines beweglichen geräts
ATE482737T1 (de) Vorrichtung und verfahren zum positionieren und manipulieren eines gerätes
DE69601123T2 (de) Vorrichtung und Verfahren zum Anbringen eines rohrförmigen Elements um einen Gegenstand
DE50015915D1 (de) Verfahren und Vorrichtung zum Steuern eines Roboters
DE69923876D1 (de) Verfahren und Vorrichtung zum Anbringen eines selbsthebenden Bandes
DE69931728D1 (de) Verfahren und Apparat zum automatischen Aufnehmen einer Wellenform
DE69609113D1 (de) Vorrichtung und Verfahren zum Steuern eines Schreibkopfes
DE10085273T1 (de) Verfahren und Einrichtung zum Konstruieren eines Vorab-eingeplante Befehle-Cache
DE59807532D1 (de) Vorrichtung und Verfahren zum Erstellen eines Einzelpositionbezugwertes in einem Druckprozess
DE59807939D1 (de) Verfahren und Vorrichtung zum Auswerfen eines Gutes
DE69932776D1 (de) Verfahren und Gerät zum Aufzeichnen und Wiedergeben eines übertragenen Programmbeitrags
DE59903273D1 (de) Verfahren zum herstellen eines bauelementes und bauelement
DE69701072T2 (de) Vorrichtung und Verfahren zum Stabilisieren eines Bohrloches
DE19781967T1 (de) Verfahren und Vorrichtung zum Ziehen eines Einkristalls

Legal Events

Date Code Title Description
8364 No opposition during term of opposition