DE69941999D1 - Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium - Google Patents

Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium

Info

Publication number
DE69941999D1
DE69941999D1 DE69941999T DE69941999T DE69941999D1 DE 69941999 D1 DE69941999 D1 DE 69941999D1 DE 69941999 T DE69941999 T DE 69941999T DE 69941999 T DE69941999 T DE 69941999T DE 69941999 D1 DE69941999 D1 DE 69941999D1
Authority
DE
Germany
Prior art keywords
recognition
recording medium
recognition device
recognition method
recording
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69941999T
Other languages
English (en)
Inventor
Tetsujiro Kondo
Norifumi Yoshiwara
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Application granted granted Critical
Publication of DE69941999D1 publication Critical patent/DE69941999D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/10Pre-processing; Data cleansing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24137Distances to cluster centroïds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/72Data preparation, e.g. statistical preprocessing of image or video features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/762Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
DE69941999T 1998-10-09 1999-10-12 Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium Expired - Lifetime DE69941999D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP28803898 1998-10-09

Publications (1)

Publication Number Publication Date
DE69941999D1 true DE69941999D1 (de) 2010-03-25

Family

ID=17725033

Family Applications (3)

Application Number Title Priority Date Filing Date
DE69941499T Expired - Lifetime DE69941499D1 (de) 1998-10-09 1999-10-12 Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles
DE69943018T Expired - Lifetime DE69943018D1 (de) 1998-10-09 1999-10-12 Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium
DE69941999T Expired - Lifetime DE69941999D1 (de) 1998-10-09 1999-10-12 Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium

Family Applications Before (2)

Application Number Title Priority Date Filing Date
DE69941499T Expired - Lifetime DE69941499D1 (de) 1998-10-09 1999-10-12 Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles
DE69943018T Expired - Lifetime DE69943018D1 (de) 1998-10-09 1999-10-12 Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium

Country Status (5)

Country Link
US (3) US6449591B1 (de)
EP (4) EP1039446B1 (de)
KR (1) KR100729316B1 (de)
DE (3) DE69941499D1 (de)
WO (1) WO2000022607A1 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69941499D1 (de) * 1998-10-09 2009-11-12 Sony Corp Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles
CN1202514C (zh) * 2000-11-27 2005-05-18 日本电信电话株式会社 编码和解码语音及其参数的方法、编码器、解码器
US7356466B2 (en) * 2002-06-28 2008-04-08 Samsung Electronics Co., Ltd. Method and apparatus for performing observation probability calculations
US7640164B2 (en) * 2002-07-04 2009-12-29 Denso Corporation System for performing interactive dialog
JP4639784B2 (ja) * 2004-12-06 2011-02-23 ソニー株式会社 学習装置および学習方法、並びにプログラム
JP2006285899A (ja) * 2005-04-05 2006-10-19 Sony Corp 学習装置および学習方法、生成装置および生成方法、並びにプログラム
JP4766197B2 (ja) * 2009-01-29 2011-09-07 日本電気株式会社 特徴量選択装置
US8725510B2 (en) * 2009-07-09 2014-05-13 Sony Corporation HMM learning device and method, program, and recording medium
US9197736B2 (en) * 2009-12-31 2015-11-24 Digimarc Corporation Intuitive computing methods and systems
US9143603B2 (en) 2009-12-31 2015-09-22 Digimarc Corporation Methods and arrangements employing sensor-equipped smart phones
GB2477324A (en) * 2010-02-01 2011-08-03 Rolls Royce Plc Device monitoring
JP2011223287A (ja) * 2010-04-09 2011-11-04 Sony Corp 情報処理装置、情報処理方法、及び、プログラム
US8490056B2 (en) * 2010-04-28 2013-07-16 International Business Machines Corporation Automatic identification of subroutines from test scripts
US9311639B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods, apparatus and arrangements for device to device communication
JP6828741B2 (ja) * 2016-05-16 2021-02-10 ソニー株式会社 情報処理装置
US10332515B2 (en) * 2017-03-14 2019-06-25 Google Llc Query endpointing based on lip detection
WO2019123544A1 (ja) * 2017-12-19 2019-06-27 オリンパス株式会社 データ処理方法およびデータ処理装置

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4608708A (en) 1981-12-24 1986-08-26 Nippon Electric Co., Ltd. Pattern matching system
JPS58143396A (ja) * 1982-02-19 1983-08-25 日本電気株式会社 音声認識装置
US5054085A (en) * 1983-05-18 1991-10-01 Speech Systems, Inc. Preprocessing system for speech recognition
US4817158A (en) * 1984-10-19 1989-03-28 International Business Machines Corporation Normalization of speech signals
JP2709386B2 (ja) * 1987-06-24 1998-02-04 株式会社 エイ・ティ・ア−ル自動翻訳電話研究所 スペクトログラムの正規化方法
JPH02195400A (ja) * 1989-01-24 1990-08-01 Canon Inc 音声認識装置
JP2979711B2 (ja) * 1991-04-24 1999-11-15 日本電気株式会社 パターン認識方式および標準パターン学習方式
US5263097A (en) * 1991-07-24 1993-11-16 Texas Instruments Incorporated Parameter normalized features for classification procedures, systems and methods
US5586215A (en) * 1992-05-26 1996-12-17 Ricoh Corporation Neural network acoustic and visual speech recognition system
US5502774A (en) * 1992-06-09 1996-03-26 International Business Machines Corporation Automatic recognition of a consistent message using multiple complimentary sources of information
JPH064093A (ja) * 1992-06-18 1994-01-14 Matsushita Electric Ind Co Ltd Hmm作成装置、hmm記憶装置、尤度計算装置及び、認識装置
US5515475A (en) * 1993-06-24 1996-05-07 Northern Telecom Limited Speech recognition method using a two-pass search
US5692100A (en) * 1994-02-02 1997-11-25 Matsushita Electric Industrial Co., Ltd. Vector quantizer
JP2775140B2 (ja) * 1994-03-18 1998-07-16 株式会社エイ・ティ・アール人間情報通信研究所 パターン認識方法、音声認識方法および音声認識装置
JP3533696B2 (ja) * 1994-03-22 2004-05-31 三菱電機株式会社 音声認識の境界推定方法及び音声認識装置
US6471420B1 (en) * 1994-05-13 2002-10-29 Matsushita Electric Industrial Co., Ltd. Voice selection apparatus voice response apparatus, and game apparatus using word tables from which selected words are output as voice selections
KR100324988B1 (ko) * 1994-06-13 2002-08-27 마츠시타 덴끼 산교 가부시키가이샤 신호해석장치
JPH08123462A (ja) * 1994-10-27 1996-05-17 Sony Corp 音声認識装置
JPH08211897A (ja) * 1995-02-07 1996-08-20 Toyota Motor Corp 音声認識装置
JP3627299B2 (ja) * 1995-07-19 2005-03-09 ソニー株式会社 音声認識方法及び装置
JPH0981183A (ja) * 1995-09-14 1997-03-28 Pioneer Electron Corp 音声モデルの作成方法およびこれを用いた音声認識装置
US6006175A (en) * 1996-02-06 1999-12-21 The Regents Of The University Of California Methods and apparatus for non-acoustic speech characterization and recognition
US5729694A (en) * 1996-02-06 1998-03-17 The Regents Of The University Of California Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
JP3702978B2 (ja) * 1996-12-26 2005-10-05 ソニー株式会社 認識装置および認識方法、並びに学習装置および学習方法
JPH10288038A (ja) * 1997-04-15 1998-10-27 Nissan Motor Co Ltd 直接噴射式ディーゼルエンジン
KR20000001476U (ko) * 1998-06-20 2000-01-25 조병호 특정문장 화자인식에 의한 도어록 장치 고안
US6185529B1 (en) * 1998-09-14 2001-02-06 International Business Machines Corporation Speech recognition aided by lateral profile image
DE69941499D1 (de) * 1998-10-09 2009-11-12 Sony Corp Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles
JP4345156B2 (ja) * 1998-10-09 2009-10-14 ソニー株式会社 学習装置および学習方法、認識装置および認識方法、並びに記録媒体

Also Published As

Publication number Publication date
US20050096902A1 (en) 2005-05-05
EP1863014B1 (de) 2009-09-30
DE69941499D1 (de) 2009-11-12
US7072829B2 (en) 2006-07-04
EP2056290B1 (de) 2010-02-03
US20020184011A1 (en) 2002-12-05
EP1863013A2 (de) 2007-12-05
KR100729316B1 (ko) 2007-06-19
EP1863014A3 (de) 2008-08-06
EP1039446B1 (de) 2010-12-08
KR20010032920A (ko) 2001-04-25
EP1863013B1 (de) 2013-01-02
EP1039446A1 (de) 2000-09-27
EP1863013A3 (de) 2008-08-06
EP2056290A1 (de) 2009-05-06
EP1039446A4 (de) 2005-07-20
WO2000022607A1 (fr) 2000-04-20
EP1863014A2 (de) 2007-12-05
DE69943018D1 (de) 2011-01-20
US6449591B1 (en) 2002-09-10

Similar Documents

Publication Publication Date Title
DE69900901D1 (de) Informationsaufzeichnungsmedium, Informationsaufzeichnungsverfahren und Informationsaufzeichnungsgerät
DE69912857D1 (de) Aufzeichnungsgerät, Aufzeichnungsverfahren und komputerlesbares Speichermedium
DE60033543D1 (de) Aufnahmegerät, Aufnahmeverfahren, Wiedergabegerät, Wiedergabeverfahren und Aufnahmemedium
DE69704174D1 (de) Informationsaufzeichnungsverfahren, und Informationsaufzeichnungsgerät
DE69816359D1 (de) Druckgerät, Druckverfahren und Aufzeichnungsmedium
DE69842139D1 (de) Speichergerät, Speichersteuerverfahren und Speichermedium
DE60237878D1 (de) Aufzeichnungsgerät, aufzeichnungsverfahren und aufzeichnungsmedium
DE69729621D1 (de) Spielvorrichtung und verarbeitungsmethode dafür sowie aufnahmemedium
DE60106164D1 (de) Informationsschnittgerät, Informationsschnittverfahren und Informationsaufzeichnungsmedium
EP1120787A4 (de) Informationsaufzeichnungsverfahren, informationsaufzeichnungsvorrichtung und informationsaufzeichnungsmedium
DE69808377T2 (de) Aufzeichnungsträgertyp-Identifikationsverfahren und -gerät
DE69627992D1 (de) Informationsaufzeichnungsmedium, aufzeichnungsverfahren und wiedergabegerät
DE60040263D1 (de) Aufzeichnungsgerät, Aufzeichnungsverfahren und plattenförmiges Aufzeichnungsmedium
DE69917994D1 (de) Aufzeichnungsmedium und Aufzeichnungsgerät
DE69627741D1 (de) Informationsaufzeichnungsmethode, Wiedergabemethode und Wiedergabegerät
DE60036599D1 (de) Robotervorrichtung, steuerverfahren und aufzeichnungsmedium
DE60042185D1 (de) Informationsverarbeitungsvorrichtung, Informationsverarbeitungsverfahren, und Programmaufzeichsnungsmedium
DE69841745D1 (de) Aufzeichnungsgerät, Aufzeichnungsverfahren, Aufzeichnungsmedium
DE69935548D1 (de) Zugriffsteuerungsverfahren, Speichergerät und Speichermedium
DE60039147D1 (de) Aufzeichnungsmedium, Wiedergabegerät und -verfahren
DE69519497D1 (de) Aufzeichnungskopf, -verfahren und zugehöriges Gerät
DE69941999D1 (de) Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium
DE69926283D1 (de) Wiedergabegerät, Aufzeichnungsgerät und Aufzeichnungs-/Wiedergabevorrichtung
DE69520487T2 (de) Optisches aufzeichnungsverfahren, optisches aufzeichnungsgerät und optisches aufzeichnungsmedium
DE69932108D1 (de) Aufzeichnungsgerät, Aufzeichnungsverfahren und Speichermedium

Legal Events

Date Code Title Description
8364 No opposition during term of opposition