DE69941499D1 - Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles - Google Patents
Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-ModellesInfo
- Publication number
- DE69941499D1 DE69941499D1 DE69941499T DE69941499T DE69941499D1 DE 69941499 D1 DE69941499 D1 DE 69941499D1 DE 69941499 T DE69941499 T DE 69941499T DE 69941499 T DE69941499 T DE 69941499T DE 69941499 D1 DE69941499 D1 DE 69941499D1
- Authority
- DE
- Germany
- Prior art keywords
- learning
- applying
- distance
- methods
- transition model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
- G06F18/24133—Distances to prototypes
- G06F18/24137—Distances to cluster centroïds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/72—Data preparation, e.g. statistical preprocessing of image or video features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/762—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
- G06V40/171—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- Image Analysis (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Character Discrimination (AREA)
- Radar Systems Or Details Thereof (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP28803898 | 1998-10-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
DE69941499D1 true DE69941499D1 (de) | 2009-11-12 |
Family
ID=17725033
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69941499T Expired - Lifetime DE69941499D1 (de) | 1998-10-09 | 1999-10-12 | Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles |
DE69941999T Expired - Lifetime DE69941999D1 (de) | 1998-10-09 | 1999-10-12 | Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium |
DE69943018T Expired - Lifetime DE69943018D1 (de) | 1998-10-09 | 1999-10-12 | Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69941999T Expired - Lifetime DE69941999D1 (de) | 1998-10-09 | 1999-10-12 | Erkennungsvorrichtung, Erkennungsverfahren und Aufzeichnungsmedium |
DE69943018T Expired - Lifetime DE69943018D1 (de) | 1998-10-09 | 1999-10-12 | Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium |
Country Status (5)
Country | Link |
---|---|
US (3) | US6449591B1 (de) |
EP (4) | EP1039446B1 (de) |
KR (1) | KR100729316B1 (de) |
DE (3) | DE69941499D1 (de) |
WO (1) | WO2000022607A1 (de) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1039446B1 (de) * | 1998-10-09 | 2010-12-08 | Sony Corporation | Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium |
CN1202514C (zh) * | 2000-11-27 | 2005-05-18 | 日本电信电话株式会社 | 编码和解码语音及其参数的方法、编码器、解码器 |
US7356466B2 (en) * | 2002-06-28 | 2008-04-08 | Samsung Electronics Co., Ltd. | Method and apparatus for performing observation probability calculations |
US7640164B2 (en) * | 2002-07-04 | 2009-12-29 | Denso Corporation | System for performing interactive dialog |
JP4639784B2 (ja) * | 2004-12-06 | 2011-02-23 | ソニー株式会社 | 学習装置および学習方法、並びにプログラム |
JP2006285899A (ja) * | 2005-04-05 | 2006-10-19 | Sony Corp | 学習装置および学習方法、生成装置および生成方法、並びにプログラム |
EP2333718B1 (de) * | 2009-01-29 | 2013-08-28 | Nec Corporation | Vorrichtung zur auswahl von merkmalsmengen |
CN101950376B (zh) * | 2009-07-09 | 2014-10-29 | 索尼公司 | 隐马尔可夫模型学习设备和方法 |
US9197736B2 (en) * | 2009-12-31 | 2015-11-24 | Digimarc Corporation | Intuitive computing methods and systems |
CN102782733B (zh) | 2009-12-31 | 2015-11-25 | 数字标记公司 | 采用配备有传感器的智能电话的方法和配置方案 |
GB2477324A (en) * | 2010-02-01 | 2011-08-03 | Rolls Royce Plc | Device monitoring |
JP2011223287A (ja) * | 2010-04-09 | 2011-11-04 | Sony Corp | 情報処理装置、情報処理方法、及び、プログラム |
US8490056B2 (en) * | 2010-04-28 | 2013-07-16 | International Business Machines Corporation | Automatic identification of subroutines from test scripts |
US9311640B2 (en) | 2014-02-11 | 2016-04-12 | Digimarc Corporation | Methods and arrangements for smartphone payments and transactions |
JP6828741B2 (ja) * | 2016-05-16 | 2021-02-10 | ソニー株式会社 | 情報処理装置 |
US10332515B2 (en) * | 2017-03-14 | 2019-06-25 | Google Llc | Query endpointing based on lip detection |
WO2019123544A1 (ja) * | 2017-12-19 | 2019-06-27 | オリンパス株式会社 | データ処理方法およびデータ処理装置 |
Family Cites Families (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4608708A (en) | 1981-12-24 | 1986-08-26 | Nippon Electric Co., Ltd. | Pattern matching system |
JPS58143396A (ja) * | 1982-02-19 | 1983-08-25 | 日本電気株式会社 | 音声認識装置 |
US5054085A (en) * | 1983-05-18 | 1991-10-01 | Speech Systems, Inc. | Preprocessing system for speech recognition |
US4817158A (en) * | 1984-10-19 | 1989-03-28 | International Business Machines Corporation | Normalization of speech signals |
JP2709386B2 (ja) * | 1987-06-24 | 1998-02-04 | 株式会社 エイ・ティ・ア−ル自動翻訳電話研究所 | スペクトログラムの正規化方法 |
JPH02195400A (ja) * | 1989-01-24 | 1990-08-01 | Canon Inc | 音声認識装置 |
JP2979711B2 (ja) * | 1991-04-24 | 1999-11-15 | 日本電気株式会社 | パターン認識方式および標準パターン学習方式 |
US5263097A (en) * | 1991-07-24 | 1993-11-16 | Texas Instruments Incorporated | Parameter normalized features for classification procedures, systems and methods |
US5586215A (en) * | 1992-05-26 | 1996-12-17 | Ricoh Corporation | Neural network acoustic and visual speech recognition system |
US5502774A (en) * | 1992-06-09 | 1996-03-26 | International Business Machines Corporation | Automatic recognition of a consistent message using multiple complimentary sources of information |
JPH064093A (ja) * | 1992-06-18 | 1994-01-14 | Matsushita Electric Ind Co Ltd | Hmm作成装置、hmm記憶装置、尤度計算装置及び、認識装置 |
US5515475A (en) * | 1993-06-24 | 1996-05-07 | Northern Telecom Limited | Speech recognition method using a two-pass search |
US5692100A (en) * | 1994-02-02 | 1997-11-25 | Matsushita Electric Industrial Co., Ltd. | Vector quantizer |
JP2775140B2 (ja) * | 1994-03-18 | 1998-07-16 | 株式会社エイ・ティ・アール人間情報通信研究所 | パターン認識方法、音声認識方法および音声認識装置 |
JP3533696B2 (ja) * | 1994-03-22 | 2004-05-31 | 三菱電機株式会社 | 音声認識の境界推定方法及び音声認識装置 |
US6471420B1 (en) * | 1994-05-13 | 2002-10-29 | Matsushita Electric Industrial Co., Ltd. | Voice selection apparatus voice response apparatus, and game apparatus using word tables from which selected words are output as voice selections |
CN1159704C (zh) * | 1994-06-13 | 2004-07-28 | 松下电器产业株式会社 | 信号分析装置 |
JPH08123462A (ja) * | 1994-10-27 | 1996-05-17 | Sony Corp | 音声認識装置 |
JPH08211897A (ja) | 1995-02-07 | 1996-08-20 | Toyota Motor Corp | 音声認識装置 |
JP3627299B2 (ja) * | 1995-07-19 | 2005-03-09 | ソニー株式会社 | 音声認識方法及び装置 |
JPH0981183A (ja) * | 1995-09-14 | 1997-03-28 | Pioneer Electron Corp | 音声モデルの作成方法およびこれを用いた音声認識装置 |
US5729694A (en) * | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
US6006175A (en) * | 1996-02-06 | 1999-12-21 | The Regents Of The University Of California | Methods and apparatus for non-acoustic speech characterization and recognition |
JP3702978B2 (ja) * | 1996-12-26 | 2005-10-05 | ソニー株式会社 | 認識装置および認識方法、並びに学習装置および学習方法 |
JPH10288038A (ja) * | 1997-04-15 | 1998-10-27 | Nissan Motor Co Ltd | 直接噴射式ディーゼルエンジン |
KR20000001476U (ko) * | 1998-06-20 | 2000-01-25 | 조병호 | 특정문장 화자인식에 의한 도어록 장치 고안 |
US6185529B1 (en) * | 1998-09-14 | 2001-02-06 | International Business Machines Corporation | Speech recognition aided by lateral profile image |
EP1039446B1 (de) * | 1998-10-09 | 2010-12-08 | Sony Corporation | Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium |
JP4345156B2 (ja) * | 1998-10-09 | 2009-10-14 | ソニー株式会社 | 学習装置および学習方法、認識装置および認識方法、並びに記録媒体 |
-
1999
- 1999-10-12 EP EP99970495A patent/EP1039446B1/de not_active Expired - Lifetime
- 1999-10-12 DE DE69941499T patent/DE69941499D1/de not_active Expired - Lifetime
- 1999-10-12 DE DE69941999T patent/DE69941999D1/de not_active Expired - Lifetime
- 1999-10-12 EP EP07116722A patent/EP1863014B1/de not_active Expired - Lifetime
- 1999-10-12 KR KR1020007006263A patent/KR100729316B1/ko not_active IP Right Cessation
- 1999-10-12 EP EP09151870A patent/EP2056290B1/de not_active Expired - Lifetime
- 1999-10-12 DE DE69943018T patent/DE69943018D1/de not_active Expired - Lifetime
- 1999-10-12 EP EP07117038A patent/EP1863013B1/de not_active Expired - Lifetime
- 1999-10-12 WO PCT/JP1999/005619 patent/WO2000022607A1/ja not_active Application Discontinuation
-
2000
- 2000-05-31 US US09/584,260 patent/US6449591B1/en not_active Expired - Fee Related
-
2002
- 2002-06-10 US US10/167,104 patent/US7072829B2/en not_active Expired - Fee Related
-
2004
- 2004-12-09 US US11/009,337 patent/US20050096902A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP1863013A2 (de) | 2007-12-05 |
EP1863014A3 (de) | 2008-08-06 |
DE69941999D1 (de) | 2010-03-25 |
EP1039446A1 (de) | 2000-09-27 |
EP1863013A3 (de) | 2008-08-06 |
EP1863014A2 (de) | 2007-12-05 |
EP1039446B1 (de) | 2010-12-08 |
EP2056290B1 (de) | 2010-02-03 |
US20050096902A1 (en) | 2005-05-05 |
US20020184011A1 (en) | 2002-12-05 |
EP1039446A4 (de) | 2005-07-20 |
US6449591B1 (en) | 2002-09-10 |
WO2000022607A1 (fr) | 2000-04-20 |
KR100729316B1 (ko) | 2007-06-19 |
US7072829B2 (en) | 2006-07-04 |
KR20010032920A (ko) | 2001-04-25 |
EP1863014B1 (de) | 2009-09-30 |
EP2056290A1 (de) | 2009-05-06 |
DE69943018D1 (de) | 2011-01-20 |
EP1863013B1 (de) | 2013-01-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69510055D1 (de) | Verfahren und gerät zum anbringen eines metallfundaments | |
DE69606784T2 (de) | Verfahren und vorrichtung zum positionieren eines magnetoresistiven kopfes | |
DE59712087D1 (de) | Verfahren und vorrichtung zum ansteuern eines kapazitiven stellgliedes | |
DE69812696D1 (de) | Verfahren und vorrichtung zum einstellen eines oder mehrerer projektoren | |
DE69831642D1 (de) | Verfahren und Gerät zur Simulation eines rollenden Reifens | |
DE59800828D1 (de) | Vorrichtung und Verfahren zum Ziehen eines Einkristalls | |
DE60017228D1 (de) | Verfahren zum Identifizieren eines Trainierenden | |
DE69941499D1 (de) | Vorrichtungen und Verfahren zum Lernen und Anwenden eines Abstand-Transition-Modelles | |
DE69925844D1 (de) | Verfahren und vorichtung zum falten eines gassacks | |
DE59710480D1 (de) | Verfahren und vorrichtung zum ansteuern eines kapazitiven stellgliedes | |
DE69501037T2 (de) | Verfahren und Vorrichtung zum Falten eines Etiketts | |
DE69607003T2 (de) | Verfahren und vorrichtung zum steuern eines beweglichen geräts | |
ATE482737T1 (de) | Vorrichtung und verfahren zum positionieren und manipulieren eines gerätes | |
DE69601123T2 (de) | Vorrichtung und Verfahren zum Anbringen eines rohrförmigen Elements um einen Gegenstand | |
DE50015915D1 (de) | Verfahren und Vorrichtung zum Steuern eines Roboters | |
DE69923876D1 (de) | Verfahren und Vorrichtung zum Anbringen eines selbsthebenden Bandes | |
DE69931728D1 (de) | Verfahren und Apparat zum automatischen Aufnehmen einer Wellenform | |
DE69609113D1 (de) | Vorrichtung und Verfahren zum Steuern eines Schreibkopfes | |
DE10085273T1 (de) | Verfahren und Einrichtung zum Konstruieren eines Vorab-eingeplante Befehle-Cache | |
DE59807532D1 (de) | Vorrichtung und Verfahren zum Erstellen eines Einzelpositionbezugwertes in einem Druckprozess | |
DE59807939D1 (de) | Verfahren und Vorrichtung zum Auswerfen eines Gutes | |
DE69932776D1 (de) | Verfahren und Gerät zum Aufzeichnen und Wiedergeben eines übertragenen Programmbeitrags | |
DE59903273D1 (de) | Verfahren zum herstellen eines bauelementes und bauelement | |
DE69701072T2 (de) | Vorrichtung und Verfahren zum Stabilisieren eines Bohrloches | |
DE19781967T1 (de) | Verfahren und Vorrichtung zum Ziehen eines Einkristalls |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |