JP7395767B2 - 情報処理装置、情報処理方法及び情報処理プログラム - Google Patents

情報処理装置、情報処理方法及び情報処理プログラム Download PDF

Info

Publication number
JP7395767B2
JP7395767B2 JP2022557886A JP2022557886A JP7395767B2 JP 7395767 B2 JP7395767 B2 JP 7395767B2 JP 2022557886 A JP2022557886 A JP 2022557886A JP 2022557886 A JP2022557886 A JP 2022557886A JP 7395767 B2 JP7395767 B2 JP 7395767B2
Authority
JP
Japan
Prior art keywords
heat map
input image
intermediate heat
machine learning
information processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022557886A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2023053364A5 (https=
JPWO2023053364A1 (https=
Inventor
ヒヤ ロイ
満 中澤
ビヨン シュテンガー
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rakuten Group Inc
Original Assignee
Rakuten Group Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rakuten Group Inc filed Critical Rakuten Group Inc
Publication of JPWO2023053364A1 publication Critical patent/JPWO2023053364A1/ja
Publication of JPWO2023053364A5 publication Critical patent/JPWO2023053364A5/ja
Application granted granted Critical
Publication of JP7395767B2 publication Critical patent/JP7395767B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/00Two-dimensional [2D] image generation
    • G06T11/10Texturing; Colouring; Generation of textures or colours
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • G06T2207/20132Image cropping
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Image Analysis (AREA)
JP2022557886A 2021-09-30 2021-09-30 情報処理装置、情報処理方法及び情報処理プログラム Active JP7395767B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/036195 WO2023053364A1 (ja) 2021-09-30 2021-09-30 情報処理装置、情報処理方法及び情報処理プログラム

Publications (3)

Publication Number Publication Date
JPWO2023053364A1 JPWO2023053364A1 (https=) 2023-04-06
JPWO2023053364A5 JPWO2023053364A5 (https=) 2023-09-06
JP7395767B2 true JP7395767B2 (ja) 2023-12-11

Family

ID=85782009

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022557886A Active JP7395767B2 (ja) 2021-09-30 2021-09-30 情報処理装置、情報処理方法及び情報処理プログラム

Country Status (4)

Country Link
US (1) US12597176B2 (https=)
EP (1) EP4184432A4 (https=)
JP (1) JP7395767B2 (https=)
WO (1) WO2023053364A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025164632A1 (ja) * 2024-01-31 2025-08-07 京セラ株式会社 学習方法、学習装置、学習システム、制御プログラムおよび記録媒体

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7594075B1 (ja) 2023-11-17 2024-12-03 楽天グループ株式会社 画像生成装置、画像生成方法、および画像生成プログラム

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000075889A (ja) 1998-09-01 2000-03-14 Oki Electric Ind Co Ltd 音声認識システム及び音声認識方法
US20090208118A1 (en) 2008-02-19 2009-08-20 Xerox Corporation Context dependent intelligent thumbnail images
US20190057515A1 (en) 2017-08-15 2019-02-21 Siemens Healthcare Gmbh Internal Body Marker Prediction From Surface Data In Medical Imaging
JP2019032773A (ja) 2017-08-09 2019-02-28 キヤノン株式会社 画像処理装置、画像処理方法
JP2020516427A (ja) 2017-04-11 2020-06-11 ケイロン メディカル テクノロジーズ リミテッド 腫瘍進行のrecist評価
JP2020149641A (ja) 2019-03-15 2020-09-17 オムロン株式会社 物体追跡装置および物体追跡方法
JP2021081793A (ja) 2019-11-14 2021-05-27 キヤノン株式会社 情報処理装置、情報処理装置の制御方法およびプログラム
JP2021516646A (ja) 2019-02-28 2021-07-08 上海商▲湯▼▲臨▼港智能科技有限公司 車両のドアロック解除方法及び装置、システム、車両、電子機器並びに記憶媒体
JP2021103347A (ja) 2019-12-24 2021-07-15 キヤノン株式会社 情報処理装置、情報処理方法及びプログラム

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150170053A1 (en) * 2013-12-13 2015-06-18 Microsoft Corporation Personalized machine learning models
JP6960722B2 (ja) * 2016-05-27 2021-11-05 ヤフー株式会社 生成装置、生成方法、及び生成プログラム
US10909401B2 (en) 2018-05-29 2021-02-02 Sri International Attention-based explanations for artificial intelligence behavior
GB201812050D0 (en) * 2018-07-24 2018-09-05 Dysis Medical Ltd Computer classification of biological tissue
US12014530B2 (en) * 2018-12-21 2024-06-18 Hitachi High-Tech Corporation Image recognition device and method
CN111488475B (zh) * 2019-01-29 2025-08-19 北京三星通信技术研究有限公司 图像检索方法、装置、电子设备及计算机可读存储介质
JP6929322B2 (ja) * 2019-05-31 2021-09-01 楽天グループ株式会社 データ拡張システム、データ拡張方法、及びプログラム
JP2021005301A (ja) * 2019-06-27 2021-01-14 株式会社パスコ 建物抽出処理装置及びプログラム
US11532036B2 (en) * 2019-11-04 2022-12-20 Adobe Inc. Digital image ordering using object position and aesthetics
US12217501B2 (en) * 2019-12-24 2025-02-04 Nec Corporation Identification apparatus, object identification method, learning apparatus, learning method, and recording medium
US20220180528A1 (en) * 2020-02-10 2022-06-09 Nvidia Corporation Disentanglement of image attributes using a neural network
CN111611240B (zh) * 2020-04-17 2024-09-06 第四范式(北京)技术有限公司 执行自动机器学习过程的方法、装置及设备
CN111629212B (zh) 2020-04-30 2023-01-20 网宿科技股份有限公司 一种对视频进行转码的方法和装置
US11657230B2 (en) * 2020-06-12 2023-05-23 Adobe Inc. Referring image segmentation
US12004871B1 (en) * 2020-08-05 2024-06-11 Amazon Technologies, Inc. Personalized three-dimensional body models and body change journey
CN111709533B (zh) * 2020-08-19 2021-03-30 腾讯科技(深圳)有限公司 机器学习模型的分布式训练方法、装置以及计算机设备
US12008811B2 (en) * 2020-12-30 2024-06-11 Snap Inc. Machine learning-based selection of a representative video frame within a messaging application
US12406023B1 (en) 2021-01-04 2025-09-02 Nvidia Corporation Neural network training method
CN112802034B (zh) 2021-02-04 2024-04-12 精英数智科技股份有限公司 图像分割、识别方法、模型构建方法、装置及电子设备
US12175703B2 (en) * 2021-02-19 2024-12-24 Nvidia Corporation Single-stage category-level object pose estimation
US11636663B2 (en) * 2021-02-19 2023-04-25 Microsoft Technology Licensing, Llc Localizing relevant objects in multi-object images
US12437523B2 (en) * 2021-04-26 2025-10-07 Jidoka Technologies Private Limited Anomaly detection using a convolutional neural network and feature based memories
US12164556B2 (en) * 2021-06-01 2024-12-10 Google Llc Smart suggestions for image zoom regions
US20230069310A1 (en) * 2021-08-10 2023-03-02 Nvidia Corporation Object classification using one or more neural networks
US20230153374A1 (en) * 2021-11-16 2023-05-18 Nvidia Corporation High-precision matrix multiplication for neural networks
US12417602B2 (en) * 2023-02-27 2025-09-16 Nvidia Corporation Text-driven 3D object stylization using neural networks

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000075889A (ja) 1998-09-01 2000-03-14 Oki Electric Ind Co Ltd 音声認識システム及び音声認識方法
US20090208118A1 (en) 2008-02-19 2009-08-20 Xerox Corporation Context dependent intelligent thumbnail images
JP2020516427A (ja) 2017-04-11 2020-06-11 ケイロン メディカル テクノロジーズ リミテッド 腫瘍進行のrecist評価
JP2019032773A (ja) 2017-08-09 2019-02-28 キヤノン株式会社 画像処理装置、画像処理方法
US20190057515A1 (en) 2017-08-15 2019-02-21 Siemens Healthcare Gmbh Internal Body Marker Prediction From Surface Data In Medical Imaging
JP2021516646A (ja) 2019-02-28 2021-07-08 上海商▲湯▼▲臨▼港智能科技有限公司 車両のドアロック解除方法及び装置、システム、車両、電子機器並びに記憶媒体
JP2020149641A (ja) 2019-03-15 2020-09-17 オムロン株式会社 物体追跡装置および物体追跡方法
JP2021081793A (ja) 2019-11-14 2021-05-27 キヤノン株式会社 情報処理装置、情報処理装置の制御方法およびプログラム
JP2021103347A (ja) 2019-12-24 2021-07-15 キヤノン株式会社 情報処理装置、情報処理方法及びプログラム

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025164632A1 (ja) * 2024-01-31 2025-08-07 京セラ株式会社 学習方法、学習装置、学習システム、制御プログラムおよび記録媒体

Also Published As

Publication number Publication date
EP4184432A1 (en) 2023-05-24
US20240362831A1 (en) 2024-10-31
WO2023053364A1 (ja) 2023-04-06
US12597176B2 (en) 2026-04-07
EP4184432A4 (en) 2023-10-11
JPWO2023053364A1 (https=) 2023-04-06

Similar Documents

Publication Publication Date Title
US9501724B1 (en) Font recognition and font similarity learning using a deep neural network
CN111274981B (zh) 目标检测网络构建方法及装置、目标检测方法
US20230153965A1 (en) Image processing method and related device
JP2023527615A (ja) 目標対象検出モデルのトレーニング方法、目標対象検出方法、機器、電子機器、記憶媒体及びコンピュータプログラム
CN110569839B (zh) 一种基于ctpn和crnn的银行卡号识别方法
Chen et al. Improved seam carving combining with 3D saliency for image retargeting
JP7395767B2 (ja) 情報処理装置、情報処理方法及び情報処理プログラム
CN110827373A (zh) 广告图片生成方法、装置以及存储介质
JP2022185144A (ja) 対象検出方法、対象検出モデルのレーニング方法および装置
WO2025016121A1 (zh) 图像处理方法、装置、电子设备和存储介质
CN114581657B (zh) 基于多尺度条形空洞卷积的图像语义分割方法、设备和介质
CN115294636A (zh) 一种基于自注意力机制的人脸聚类方法和装置
WO2023093851A1 (zh) 图像裁剪方法、装置及电子设备
CN110276818A (zh) 用于自动合成内容感知填充的交互式系统
KR20190117838A (ko) 객체 인식 시스템 및 그 방법
CN118228743B (zh) 一种基于文图注意力机制的多模态机器翻译方法及装置
CN107533760A (zh) 一种图像分割方法和装置
WO2023133285A1 (en) Anti-aliasing of object borders with alpha blending of multiple segmented 3d surfaces
CN109034070A (zh) 一种置换混叠图像盲分离方法及装置
JP7265686B1 (ja) 情報処理装置、情報処理方法及び情報処理プログラム
JP7265690B2 (ja) 情報処理装置、情報処理方法及び情報処理プログラム
Lee Automatic photomosaic algorithm through adaptive tiling and block matching
Bie et al. Intent-aware image cloning
Lee et al. CartoonModes: Cartoon stylization of video objects through modal analysis
JP5745370B2 (ja) 特定領域抽出装置及び特定領域抽出プログラム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220922

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20220922

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20220922

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230110

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230301

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230523

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230606

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230815

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230927

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20231121

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20231129

R150 Certificate of patent or registration of utility model

Ref document number: 7395767

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150