JP5351958B2 - デジタルコンテンツ記録のための意味論的イベント検出 - Google Patents
デジタルコンテンツ記録のための意味論的イベント検出 Download PDFInfo
- Publication number
- JP5351958B2 JP5351958B2 JP2011512451A JP2011512451A JP5351958B2 JP 5351958 B2 JP5351958 B2 JP 5351958B2 JP 2011512451 A JP2011512451 A JP 2011512451A JP 2011512451 A JP2011512451 A JP 2011512451A JP 5351958 B2 JP5351958 B2 JP 5351958B2
- Authority
- JP
- Japan
- Prior art keywords
- event
- concept
- semantic
- image
- visual
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/5838—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using colour
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7847—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
- G06F16/785—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7847—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
- G06F16/7854—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using shape
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7847—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
- G06F16/7857—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using texture
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
Description
・アフィンマトリクスAij=S(xi,xj) (i≠jの場合),かつAii=0を得る。
・対角線マトリクスDij=ΣjAijを定義する。L=D-1/2AD-1/2を得る。
・最大のものから順にn個の固有値に対応するLの固有ベクトル
・Uの行が単位長さを有するように再正規化することによって、UからマトリクスVを得る。
・Vにおける各行をRn(元のi番目のデータ点に対応するi番目の行)における点として取扱い、K−meansアルゴリズムを介して全ての点をn個のクラスタにクラスタ化する。
Claims (2)
- イベントに関連する画像記録のグループの意味論的イベント分類を容易にする方法であって、
前記画像記録の各々から複数の視覚的特徴を抽出するステップと、
前記視覚的特徴を使用して前記画像記録の各々に対する複数の概念スコアを生成するステップと、
前記画像記録の前記概念スコアに基づいて、各イベントを記述するためのBOF特徴ベクトルを、意味論的イベントに対応する予め定められたコードブックに前記イベントの前記画像記録の前記概念スコアをマッピングすることにより生成するステップと、
前記イベントに意味論的イベントが現れる確率の指標である検出スコアを生成する意味論的イベント分類器に前記マッピングされた特徴ベクトルを供給するステップと、
を包含し、
各前記概念スコアは、視覚的概念に対応し、前記画像記録が前記視覚的概念を含む確率の指標であることを特徴とする方法。 - 請求項1に記載のイベントに関連する画像記録のグループの意味論的イベント分類を容易にする方法であって、
前記画像記録の対の間のペアワイズ類似性を決定するステップと、
スペクトルクラスタ化を適用して、前記決定されたペアワイズ類似性に基づいて、前記意味論的イベントの訓練画像記録を各クラスタが一つのコードワードに対応する異なるクラスタにグループ化することによって各前記意味論的イベントの前記コードブックを生成するステップと、
前記訓練イベントの前記画像記録の前記概念スコアを意味論的イベントに対応する前記コードブックにマッピングして、各前記訓練イベントを記述するためのBOF特徴ベクトルを生成するステップと、
前記イベント分類器を前記訓練イベントに対応する前記BOF特徴ベクトルに基づいて訓練するステップと、
を包含する訓練プロセスを有することを特徴とする方法。
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US5820108P | 2008-06-02 | 2008-06-02 | |
US61/058,201 | 2008-06-02 | ||
US12/331,927 US8358856B2 (en) | 2008-06-02 | 2008-12-10 | Semantic event detection for digital content records |
US12/331,927 | 2008-12-10 | ||
PCT/US2009/003160 WO2009148518A2 (en) | 2008-06-02 | 2009-05-22 | Semantic event detection for digital content records |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2011525012A JP2011525012A (ja) | 2011-09-08 |
JP2011525012A5 JP2011525012A5 (ja) | 2012-05-17 |
JP5351958B2 true JP5351958B2 (ja) | 2013-11-27 |
Family
ID=41379891
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2011512451A Expired - Fee Related JP5351958B2 (ja) | 2008-06-02 | 2009-05-22 | デジタルコンテンツ記録のための意味論的イベント検出 |
Country Status (4)
Country | Link |
---|---|
US (1) | US8358856B2 (ja) |
EP (1) | EP2289021B1 (ja) |
JP (1) | JP5351958B2 (ja) |
WO (1) | WO2009148518A2 (ja) |
Families Citing this family (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20100052676A (ko) * | 2008-11-11 | 2010-05-20 | 삼성전자주식회사 | 컨텐츠 앨범화 장치 및 그 방법 |
US8611677B2 (en) * | 2008-11-19 | 2013-12-17 | Intellectual Ventures Fund 83 Llc | Method for event-based semantic classification |
US8406460B2 (en) | 2010-04-27 | 2013-03-26 | Intellectual Ventures Fund 83 Llc | Automated template layout method |
US8406461B2 (en) | 2010-04-27 | 2013-03-26 | Intellectual Ventures Fund 83 Llc | Automated template layout system |
US8970720B2 (en) | 2010-07-26 | 2015-03-03 | Apple Inc. | Automatic digital camera photography mode selection |
JP5598159B2 (ja) * | 2010-08-23 | 2014-10-01 | 株式会社ニコン | 画像処理装置、撮像システム、画像処理方法、およびプログラム |
US20130132377A1 (en) * | 2010-08-26 | 2013-05-23 | Zhe Lin | Systems and Methods for Localized Bag-of-Features Retrieval |
JP5649425B2 (ja) * | 2010-12-06 | 2015-01-07 | 株式会社東芝 | 映像検索装置 |
US8923607B1 (en) | 2010-12-08 | 2014-12-30 | Google Inc. | Learning sports highlights using event detection |
US8635197B2 (en) | 2011-02-28 | 2014-01-21 | International Business Machines Corporation | Systems and methods for efficient development of a rule-based system using crowd-sourcing |
US20120275714A1 (en) * | 2011-04-27 | 2012-11-01 | Yuli Gao | Determination of an image selection representative of a storyline |
US9055276B2 (en) | 2011-07-29 | 2015-06-09 | Apple Inc. | Camera having processing customized for identified persons |
US8983940B2 (en) | 2011-09-02 | 2015-03-17 | Adobe Systems Incorporated | K-nearest neighbor re-ranking |
US8634660B2 (en) * | 2011-09-07 | 2014-01-21 | Intellectual Ventures Fund 83 Llc | Event classification method using lit candle detection |
US8634661B2 (en) | 2011-09-07 | 2014-01-21 | Intellectual Ventures Fund 83 Llc | Event classification method using light source detection |
US20130058577A1 (en) * | 2011-09-07 | 2013-03-07 | Peter O. Stubler | Event classification method for related digital images |
US8805116B2 (en) | 2011-09-17 | 2014-08-12 | Adobe Systems Incorporated | Methods and apparatus for visual search |
EA201590485A1 (ru) * | 2012-09-05 | 2015-12-30 | Элемент, Инк. | Система и способ биометрической аутентификации с использованием устройств, снабженных камерами |
US8880563B2 (en) | 2012-09-21 | 2014-11-04 | Adobe Systems Incorporated | Image search by query object segmentation |
US9898685B2 (en) | 2014-04-29 | 2018-02-20 | At&T Intellectual Property I, L.P. | Method and apparatus for analyzing media content |
US9451335B2 (en) | 2014-04-29 | 2016-09-20 | At&T Intellectual Property I, Lp | Method and apparatus for augmenting media content |
US9913135B2 (en) | 2014-05-13 | 2018-03-06 | Element, Inc. | System and method for electronic key provisioning and access management in connection with mobile devices |
JP6415607B2 (ja) | 2014-06-03 | 2018-10-31 | エレメント,インク. | モバイル・デバイスに関連する出退認証および管理 |
CN105335595A (zh) | 2014-06-30 | 2016-02-17 | 杜比实验室特许公司 | 基于感受的多媒体处理 |
CN104133917B (zh) * | 2014-08-15 | 2018-08-10 | 百度在线网络技术(北京)有限公司 | 照片的分类存储方法及装置 |
AU2014218444B2 (en) | 2014-08-29 | 2017-06-15 | Canon Kabushiki Kaisha | Dynamic feature selection for joint probabilistic recognition |
US10572735B2 (en) * | 2015-03-31 | 2020-02-25 | Beijing Shunyuan Kaihua Technology Limited | Detect sports video highlights for mobile computing devices |
CN104915685A (zh) * | 2015-07-02 | 2015-09-16 | 北京联合大学 | 基于多矩形划分的图像表示方法 |
KR102225088B1 (ko) * | 2015-10-26 | 2021-03-08 | 에스케이텔레콤 주식회사 | 상황 정보 기반의 태그 생성 방법 및 장치 |
US9961202B2 (en) * | 2015-12-31 | 2018-05-01 | Nice Ltd. | Automated call classification |
US11205103B2 (en) | 2016-12-09 | 2021-12-21 | The Research Foundation for the State University | Semisupervised autoencoder for sentiment analysis |
US20190019107A1 (en) * | 2017-07-12 | 2019-01-17 | Samsung Electronics Co., Ltd. | Method of machine learning by remote storage device and remote storage device employing method of machine learning |
MX2020002941A (es) | 2017-09-18 | 2022-05-31 | Element Inc | Métodos, sistemas y medios para la detección de suplantación de identidad en la autenticación móvil. |
CN108090199B (zh) * | 2017-12-22 | 2020-02-21 | 浙江大学 | 一种大型图像集的语义信息提取和可视化方法 |
EP3938953A4 (en) | 2019-03-12 | 2022-12-28 | Element, Inc. | FACIAL RECOGNITION SPOOFING DETECTION WITH MOBILE DEVICES |
US11586861B2 (en) | 2019-09-13 | 2023-02-21 | Toyota Research Institute, Inc. | Embeddings + SVM for teaching traversability |
CN110781963B (zh) * | 2019-10-28 | 2022-03-04 | 西安电子科技大学 | 基于K-means聚类的空中目标分群方法 |
US11507248B2 (en) | 2019-12-16 | 2022-11-22 | Element Inc. | Methods, systems, and media for anti-spoofing using eye-tracking |
CN111221984B (zh) * | 2020-01-15 | 2024-03-01 | 北京百度网讯科技有限公司 | 多模态内容处理方法、装置、设备及存储介质 |
US20230036109A1 (en) * | 2020-02-27 | 2023-02-02 | Panasonic Intellectual Property Management Co., Ltd. | Image processing device and image processing method |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6832006B2 (en) * | 2001-07-23 | 2004-12-14 | Eastman Kodak Company | System and method for controlling image compression based on image emphasis |
US7039239B2 (en) * | 2002-02-07 | 2006-05-02 | Eastman Kodak Company | Method for image region classification using unsupervised and supervised learning |
US20030233232A1 (en) * | 2002-06-12 | 2003-12-18 | Lucent Technologies Inc. | System and method for measuring domain independence of semantic classes |
US7383260B2 (en) * | 2004-08-03 | 2008-06-03 | International Business Machines Corporation | Method and apparatus for ontology-based classification of media content |
US7545978B2 (en) * | 2005-07-01 | 2009-06-09 | International Business Machines Corporation | Methods and apparatus for filtering video packets for large-scale video stream monitoring |
JP2007317077A (ja) * | 2006-05-29 | 2007-12-06 | Fujifilm Corp | 画像分類装置および方法ならびにプログラム |
US8165406B2 (en) * | 2007-12-12 | 2012-04-24 | Microsoft Corp. | Interactive concept learning in image search |
-
2008
- 2008-12-10 US US12/331,927 patent/US8358856B2/en not_active Expired - Fee Related
-
2009
- 2009-05-22 WO PCT/US2009/003160 patent/WO2009148518A2/en active Application Filing
- 2009-05-22 EP EP09758682A patent/EP2289021B1/en not_active Not-in-force
- 2009-05-22 JP JP2011512451A patent/JP5351958B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US20090297032A1 (en) | 2009-12-03 |
EP2289021A2 (en) | 2011-03-02 |
JP2011525012A (ja) | 2011-09-08 |
WO2009148518A2 (en) | 2009-12-10 |
EP2289021B1 (en) | 2013-01-02 |
US8358856B2 (en) | 2013-01-22 |
WO2009148518A3 (en) | 2010-01-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5351958B2 (ja) | デジタルコンテンツ記録のための意味論的イベント検出 | |
US8213725B2 (en) | Semantic event detection using cross-domain knowledge | |
Ali et al. | A novel image retrieval based on visual words integration of SIFT and SURF | |
US20230376527A1 (en) | Generating congruous metadata for multimedia | |
US9317781B2 (en) | Multiple cluster instance learning for image classification | |
US8548256B2 (en) | Method for fast scene matching | |
Quelhas et al. | A thousand words in a scene | |
Galleguillos et al. | Weakly supervised object localization with stable segmentations | |
US8837820B2 (en) | Image selection based on photographic style | |
US8533204B2 (en) | Text-based searching of image data | |
US20100226582A1 (en) | Assigning labels to images in a collection | |
JP2016134175A (ja) | ワイルドカードを用いてテキスト−画像クエリを実施するための方法およびシステム | |
Amores et al. | Context-based object-class recognition and retrieval by generalized correlograms | |
Abdullah et al. | Fixed partitioning and salient points with MPEG-7 cluster correlograms for image categorization | |
Demirkus et al. | Hierarchical temporal graphical model for head pose estimation and subsequent attribute classification in real-world videos | |
Jiang | Super: towards real-time event recognition in internet videos | |
Abraham et al. | Automatically classifying crime scene images using machine learning methodologies | |
Oussama et al. | A fast weighted multi-view Bayesian learning scheme with deep learning for text-based image retrieval from unlabeled galleries | |
Wu et al. | Discriminative two-level feature selection for realistic human action recognition | |
Jiang et al. | Semantic event detection for consumer photo and video collections | |
Tao | Visual concept detection and real time object detection | |
Borovikov et al. | Face matching for post-disaster family reunification | |
Chen et al. | An efficient framework for location-based scene matching in image databases | |
Jain | Enhanced image and video representation for visual recognition | |
Shiue et al. | Image retrieval using a scale-invariant feature transform bag-of-features model with salient object detection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20120322 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20120322 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20130311 |
|
A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20130403 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20130731 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20130806 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20130823 |
|
R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
LAPS | Cancellation because of no payment of annual fees |