JP7034265B2 - 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム - Google Patents

画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム Download PDF

Info

Publication number
JP7034265B2
JP7034265B2 JP2020514281A JP2020514281A JP7034265B2 JP 7034265 B2 JP7034265 B2 JP 7034265B2 JP 2020514281 A JP2020514281 A JP 2020514281A JP 2020514281 A JP2020514281 A JP 2020514281A JP 7034265 B2 JP7034265 B2 JP 7034265B2
Authority
JP
Japan
Prior art keywords
image
machine learning
computer
training
label
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2020514281A
Other languages
English (en)
Japanese (ja)
Other versions
JP2020534594A5 (https=
JP2020534594A (ja
Inventor
拓 井上
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of JP2020534594A publication Critical patent/JP2020534594A/ja
Publication of JP2020534594A5 publication Critical patent/JP2020534594A5/ja
Application granted granted Critical
Publication of JP7034265B2 publication Critical patent/JP7034265B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06F18/2155Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • G06F18/24155Bayesian classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06V10/7753Incorporation of unlabelled data, e.g. multiple instance learning [MIL]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/29Graphical models, e.g. Bayesian networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20076Probabilistic image processing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)
JP2020514281A 2017-09-21 2018-09-20 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム Active JP7034265B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US15/711,756 2017-09-21
US15/711,756 US10496902B2 (en) 2017-09-21 2017-09-21 Data augmentation for image classification tasks
US15/843,687 US10614346B2 (en) 2017-09-21 2017-12-15 Data augmentation for image classification tasks
US15/843,687 2017-12-15
PCT/IB2018/057257 WO2019058300A1 (en) 2017-09-21 2018-09-20 INCREASING DATA FOR IMAGE CLASSIFICATION TASKS

Publications (3)

Publication Number Publication Date
JP2020534594A JP2020534594A (ja) 2020-11-26
JP2020534594A5 JP2020534594A5 (https=) 2021-01-14
JP7034265B2 true JP7034265B2 (ja) 2022-03-11

Family

ID=65719394

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2020514281A Active JP7034265B2 (ja) 2017-09-21 2018-09-20 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム

Country Status (5)

Country Link
US (4) US10496902B2 (https=)
JP (1) JP7034265B2 (https=)
CN (1) CN111033523B (https=)
GB (1) GB2580002B (https=)
WO (1) WO2019058300A1 (https=)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10496902B2 (en) * 2017-09-21 2019-12-03 International Business Machines Corporation Data augmentation for image classification tasks
US10740400B2 (en) * 2018-08-28 2020-08-11 Google Llc Image analysis for results of textual image queries
US11544500B2 (en) * 2019-02-12 2023-01-03 International Business Machines Corporation Data augmentation for image classification tasks
CN109978071A (zh) * 2019-04-03 2019-07-05 西北工业大学 基于数据增广和分类器融合的高光谱图像分类方法
CN111798000A (zh) * 2019-04-09 2020-10-20 Oppo广东移动通信有限公司 数据优化方法、装置、存储介质及电子设备
EP3731155B1 (en) * 2019-04-25 2025-09-17 ABB Schweiz AG Apparatus and method for drive selection using machine learning
US11520521B2 (en) * 2019-06-20 2022-12-06 Western Digital Technologies, Inc. Storage controller having data augmentation components for use with non-volatile memory die
WO2021006622A1 (en) 2019-07-09 2021-01-14 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
CN110796176B (zh) * 2019-10-09 2022-12-16 武汉大学 一种基于像素对和加权投票的高分辨率影像分类方法和系统
CN111275080B (zh) * 2020-01-14 2021-01-08 腾讯科技(深圳)有限公司 基于人工智能的图像分类模型训练方法、分类方法及装置
CN111507378A (zh) * 2020-03-24 2020-08-07 华为技术有限公司 训练图像处理模型的方法和装置
KR102528405B1 (ko) * 2020-04-08 2023-05-02 에스케이텔레콤 주식회사 이미지를 분류하도록 훈련된 신경망을 이용하는 이미지 분류 장치 및 방법
CA3166581A1 (en) * 2020-05-22 2021-11-25 Parisa Darvish Zadeh Varcheie Method and system for training inspection equipment for automatic defect classification
CN111885280B (zh) * 2020-07-17 2021-04-13 电子科技大学 一种混合卷积神经网络视频编码环路滤波方法
US12159213B2 (en) 2020-11-04 2024-12-03 Intrinsic Innovation Llc Source-agnostic image processing
US11170581B1 (en) * 2020-11-12 2021-11-09 Intrinsic Innovation Llc Supervised domain adaptation
US20240005643A1 (en) * 2020-12-09 2024-01-04 Sony Group Corporation Information processing apparatus, information processing method, computer program, imaging device, vehicle device, and medical robot device
CN114691912A (zh) 2020-12-25 2022-07-01 日本电气株式会社 图像处理的方法、设备和计算机可读存储介质
US12266098B2 (en) * 2021-11-11 2025-04-01 International Business Machines Corporation Improving model performance by artificial blending of healthy tissue
CN116318481A (zh) * 2021-12-20 2023-06-23 华为技术有限公司 一种通信方法及装置
US20230205873A1 (en) * 2021-12-29 2023-06-29 Micron Technology, Inc. Training procedure change determination to detect attack
CN116523760A (zh) * 2022-01-21 2023-08-01 戴尔产品有限公司 数据增强的方法、设备和计算机程序产品
JP7815035B2 (ja) * 2022-06-01 2026-02-17 株式会社東芝 表現学習装置、方法及びプログラム
KR102898537B1 (ko) * 2022-07-05 2025-12-10 광주과학기술원 미분류 데이터를 이용하여 신경망의 학습을 조기 종료하는 방법
CN115618962B (zh) * 2022-10-18 2023-05-23 支付宝(杭州)信息技术有限公司 一种模型训练的方法、业务风控的方法及装置
US12586356B2 (en) 2023-02-02 2026-03-24 Ford Global Technologies, Llc Artificial image generation with traffic signs

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2014123366A (ja) 2012-12-21 2014-07-03 Honda Motor Co Ltd 歩行者姿勢分類に適用される3次元人間モデル
JP2019003565A (ja) 2017-06-19 2019-01-10 コニカミノルタ株式会社 画像処理装置、画像処理方法、及び画像処理プログラム

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8416847B2 (en) * 1998-12-21 2013-04-09 Zin Stai Pte. In, Llc Separate plane compression using plurality of compression methods including ZLN and ZLD methods
US7082211B2 (en) * 2002-05-31 2006-07-25 Eastman Kodak Company Method and system for enhancing portrait images
CN101452575B (zh) * 2008-12-12 2010-07-28 北京航空航天大学 一种基于神经网络的图像自适应增强方法
US8873813B2 (en) * 2012-09-17 2014-10-28 Z Advanced Computing, Inc. Application of Z-webs and Z-factors to analytics, search engine, learning, recognition, natural language, and other utilities
CN103310229B (zh) 2013-06-15 2016-09-07 浙江大学 一种用于图像分类的多任务机器学习方法及其装置
US9536293B2 (en) 2014-07-30 2017-01-03 Adobe Systems Incorporated Image assessment using deep convolutional neural networks
CN104318266B (zh) * 2014-10-19 2017-06-13 温州大学 一种图像智能分析处理预警方法
GB2532075A (en) * 2014-11-10 2016-05-11 Lego As System and method for toy recognition and detection based on convolutional neural networks
US20160140438A1 (en) 2014-11-13 2016-05-19 Nec Laboratories America, Inc. Hyper-class Augmented and Regularized Deep Learning for Fine-grained Image Classification
EP3029606A3 (en) 2014-11-14 2016-09-14 Thomson Licensing Method and apparatus for image classification with joint feature adaptation and classifier learning
US9501707B2 (en) * 2015-04-16 2016-11-22 Xerox Corporation Method and system for bootstrapping an OCR engine for license plate recognition
CN106296623B (zh) * 2015-06-10 2019-07-05 腾讯科技(深圳)有限公司 一种图片处理方法及装置
US10176642B2 (en) * 2015-07-17 2019-01-08 Bao Tran Systems and methods for computer assisted operation
US9684967B2 (en) 2015-10-23 2017-06-20 International Business Machines Corporation Imaging segmentation using multi-scale machine learning approach
US10068171B2 (en) 2015-11-12 2018-09-04 Conduent Business Services, Llc Multi-layer fusion in a convolutional neural network for image classification
US9721334B2 (en) * 2015-12-03 2017-08-01 International Business Machines Corporation Work-piece defect inspection via optical images and CT images
US10373073B2 (en) 2016-01-11 2019-08-06 International Business Machines Corporation Creating deep learning models using feature augmentation
US9864931B2 (en) * 2016-04-13 2018-01-09 Conduent Business Services, Llc Target domain characterization for data augmentation
CN106169081B (zh) 2016-06-29 2019-07-05 北京工业大学 一种基于不同照度的图像分类及处理方法
US20180137391A1 (en) * 2016-11-13 2018-05-17 Imagry (Israel) Ltd. System and method for training image classifier
CN106709907A (zh) * 2016-12-08 2017-05-24 上海联影医疗科技有限公司 Mr图像的处理方法及装置
CN106934815A (zh) * 2017-02-27 2017-07-07 南京理工大学 基于混合区域的活动轮廓模型图像分割方法
CN107169939A (zh) * 2017-05-31 2017-09-15 广东欧珀移动通信有限公司 图像处理方法及相关产品
US10496902B2 (en) * 2017-09-21 2019-12-03 International Business Machines Corporation Data augmentation for image classification tasks

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2014123366A (ja) 2012-12-21 2014-07-03 Honda Motor Co Ltd 歩行者姿勢分類に適用される3次元人間モデル
JP2019003565A (ja) 2017-06-19 2019-01-10 コニカミノルタ株式会社 画像処理装置、画像処理方法、及び画像処理プログラム

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
酒向 慎司 他2名,3次元モデルとCNNを用いた指文字認識の検討,電子情報通信学会技術研究報告,日本,一般社団法人電子情報通信学会,2017年05月20日,Vol.117 No.66

Also Published As

Publication number Publication date
US11238317B2 (en) 2022-02-01
US10614346B2 (en) 2020-04-07
US11120309B2 (en) 2021-09-14
US20190087694A1 (en) 2019-03-21
US20200175343A1 (en) 2020-06-04
CN111033523B (zh) 2023-12-29
GB2580002B (en) 2021-01-13
US20200082229A1 (en) 2020-03-12
US20190087695A1 (en) 2019-03-21
CN111033523A (zh) 2020-04-17
WO2019058300A1 (en) 2019-03-28
GB2580002A (en) 2020-07-08
US10496902B2 (en) 2019-12-03
JP2020534594A (ja) 2020-11-26
GB202005186D0 (en) 2020-05-20

Similar Documents

Publication Publication Date Title
JP7034265B2 (ja) 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム
US11989237B2 (en) Natural language interaction with automated machine learning systems
US10503827B2 (en) Supervised training for word embedding
US10748102B2 (en) Just in time learning driven by point of sale or other data and metrics
US10915529B2 (en) Selecting an optimal combination of systems for query processing
US20220358358A1 (en) Accelerating inference of neural network models via dynamic early exits
US20230118271A1 (en) Tuning a container orchestrator
US10750022B2 (en) Enhancing customer service processing using data analytics and cognitive computing
US20180157554A1 (en) Resolving conflicts between multiple software and hardware processes
US10262266B2 (en) Identifying and analyzing impact of an event on relationships
US20200211536A1 (en) Visually exploring and prioritizing discordant conversations with conversational bots
US11150934B2 (en) Region based processing and storage of data
US11157554B2 (en) Video response generation and modification
US10902461B2 (en) Environmental modification using tone model analysis
US9892091B2 (en) Computing intersection cardinality
JP7816894B2 (ja) エンドツーエンド音声認識のためのニューラルトランスデューサモデルにおける音響情報と言語情報との分離
US20200026963A1 (en) Reducing computational costs of deep reinforcement learning by gated convolutional neural network
US11188968B2 (en) Component based review system
US20230153318A1 (en) Shape and data format conversion for accelerators
US20230069628A1 (en) External language model fusing method for speech recognition

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20201117

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20210222

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20220209

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20220222

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20220301

R150 Certificate of patent or registration of utility model

Ref document number: 7034265

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150