JP7034265B2 - 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム - Google Patents
画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム Download PDFInfo
- Publication number
- JP7034265B2 JP7034265B2 JP2020514281A JP2020514281A JP7034265B2 JP 7034265 B2 JP7034265 B2 JP 7034265B2 JP 2020514281 A JP2020514281 A JP 2020514281A JP 2020514281 A JP2020514281 A JP 2020514281A JP 7034265 B2 JP7034265 B2 JP 7034265B2
- Authority
- JP
- Japan
- Prior art keywords
- image
- machine learning
- computer
- training
- label
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/75—Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2155—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
- G06F18/24155—Bayesian classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
- G06V10/7753—Incorporation of unlabelled data, e.g. multiple instance learning [MIL]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/29—Graphical models, e.g. Bayesian networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20076—Probabilistic image processing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Multimedia (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medical Informatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/711,756 | 2017-09-21 | ||
| US15/711,756 US10496902B2 (en) | 2017-09-21 | 2017-09-21 | Data augmentation for image classification tasks |
| US15/843,687 US10614346B2 (en) | 2017-09-21 | 2017-12-15 | Data augmentation for image classification tasks |
| US15/843,687 | 2017-12-15 | ||
| PCT/IB2018/057257 WO2019058300A1 (en) | 2017-09-21 | 2018-09-20 | INCREASING DATA FOR IMAGE CLASSIFICATION TASKS |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2020534594A JP2020534594A (ja) | 2020-11-26 |
| JP2020534594A5 JP2020534594A5 (https=) | 2021-01-14 |
| JP7034265B2 true JP7034265B2 (ja) | 2022-03-11 |
Family
ID=65719394
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2020514281A Active JP7034265B2 (ja) | 2017-09-21 | 2018-09-20 | 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム |
Country Status (5)
| Country | Link |
|---|---|
| US (4) | US10496902B2 (https=) |
| JP (1) | JP7034265B2 (https=) |
| CN (1) | CN111033523B (https=) |
| GB (1) | GB2580002B (https=) |
| WO (1) | WO2019058300A1 (https=) |
Families Citing this family (26)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10496902B2 (en) * | 2017-09-21 | 2019-12-03 | International Business Machines Corporation | Data augmentation for image classification tasks |
| US10740400B2 (en) * | 2018-08-28 | 2020-08-11 | Google Llc | Image analysis for results of textual image queries |
| US11544500B2 (en) * | 2019-02-12 | 2023-01-03 | International Business Machines Corporation | Data augmentation for image classification tasks |
| CN109978071A (zh) * | 2019-04-03 | 2019-07-05 | 西北工业大学 | 基于数据增广和分类器融合的高光谱图像分类方法 |
| CN111798000A (zh) * | 2019-04-09 | 2020-10-20 | Oppo广东移动通信有限公司 | 数据优化方法、装置、存储介质及电子设备 |
| EP3731155B1 (en) * | 2019-04-25 | 2025-09-17 | ABB Schweiz AG | Apparatus and method for drive selection using machine learning |
| US11520521B2 (en) * | 2019-06-20 | 2022-12-06 | Western Digital Technologies, Inc. | Storage controller having data augmentation components for use with non-volatile memory die |
| WO2021006622A1 (en) | 2019-07-09 | 2021-01-14 | Samsung Electronics Co., Ltd. | Electronic apparatus and controlling method thereof |
| CN110796176B (zh) * | 2019-10-09 | 2022-12-16 | 武汉大学 | 一种基于像素对和加权投票的高分辨率影像分类方法和系统 |
| CN111275080B (zh) * | 2020-01-14 | 2021-01-08 | 腾讯科技(深圳)有限公司 | 基于人工智能的图像分类模型训练方法、分类方法及装置 |
| CN111507378A (zh) * | 2020-03-24 | 2020-08-07 | 华为技术有限公司 | 训练图像处理模型的方法和装置 |
| KR102528405B1 (ko) * | 2020-04-08 | 2023-05-02 | 에스케이텔레콤 주식회사 | 이미지를 분류하도록 훈련된 신경망을 이용하는 이미지 분류 장치 및 방법 |
| CA3166581A1 (en) * | 2020-05-22 | 2021-11-25 | Parisa Darvish Zadeh Varcheie | Method and system for training inspection equipment for automatic defect classification |
| CN111885280B (zh) * | 2020-07-17 | 2021-04-13 | 电子科技大学 | 一种混合卷积神经网络视频编码环路滤波方法 |
| US12159213B2 (en) | 2020-11-04 | 2024-12-03 | Intrinsic Innovation Llc | Source-agnostic image processing |
| US11170581B1 (en) * | 2020-11-12 | 2021-11-09 | Intrinsic Innovation Llc | Supervised domain adaptation |
| US20240005643A1 (en) * | 2020-12-09 | 2024-01-04 | Sony Group Corporation | Information processing apparatus, information processing method, computer program, imaging device, vehicle device, and medical robot device |
| CN114691912A (zh) | 2020-12-25 | 2022-07-01 | 日本电气株式会社 | 图像处理的方法、设备和计算机可读存储介质 |
| US12266098B2 (en) * | 2021-11-11 | 2025-04-01 | International Business Machines Corporation | Improving model performance by artificial blending of healthy tissue |
| CN116318481A (zh) * | 2021-12-20 | 2023-06-23 | 华为技术有限公司 | 一种通信方法及装置 |
| US20230205873A1 (en) * | 2021-12-29 | 2023-06-29 | Micron Technology, Inc. | Training procedure change determination to detect attack |
| CN116523760A (zh) * | 2022-01-21 | 2023-08-01 | 戴尔产品有限公司 | 数据增强的方法、设备和计算机程序产品 |
| JP7815035B2 (ja) * | 2022-06-01 | 2026-02-17 | 株式会社東芝 | 表現学習装置、方法及びプログラム |
| KR102898537B1 (ko) * | 2022-07-05 | 2025-12-10 | 광주과학기술원 | 미분류 데이터를 이용하여 신경망의 학습을 조기 종료하는 방법 |
| CN115618962B (zh) * | 2022-10-18 | 2023-05-23 | 支付宝(杭州)信息技术有限公司 | 一种模型训练的方法、业务风控的方法及装置 |
| US12586356B2 (en) | 2023-02-02 | 2026-03-24 | Ford Global Technologies, Llc | Artificial image generation with traffic signs |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2014123366A (ja) | 2012-12-21 | 2014-07-03 | Honda Motor Co Ltd | 歩行者姿勢分類に適用される3次元人間モデル |
| JP2019003565A (ja) | 2017-06-19 | 2019-01-10 | コニカミノルタ株式会社 | 画像処理装置、画像処理方法、及び画像処理プログラム |
Family Cites Families (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8416847B2 (en) * | 1998-12-21 | 2013-04-09 | Zin Stai Pte. In, Llc | Separate plane compression using plurality of compression methods including ZLN and ZLD methods |
| US7082211B2 (en) * | 2002-05-31 | 2006-07-25 | Eastman Kodak Company | Method and system for enhancing portrait images |
| CN101452575B (zh) * | 2008-12-12 | 2010-07-28 | 北京航空航天大学 | 一种基于神经网络的图像自适应增强方法 |
| US8873813B2 (en) * | 2012-09-17 | 2014-10-28 | Z Advanced Computing, Inc. | Application of Z-webs and Z-factors to analytics, search engine, learning, recognition, natural language, and other utilities |
| CN103310229B (zh) | 2013-06-15 | 2016-09-07 | 浙江大学 | 一种用于图像分类的多任务机器学习方法及其装置 |
| US9536293B2 (en) | 2014-07-30 | 2017-01-03 | Adobe Systems Incorporated | Image assessment using deep convolutional neural networks |
| CN104318266B (zh) * | 2014-10-19 | 2017-06-13 | 温州大学 | 一种图像智能分析处理预警方法 |
| GB2532075A (en) * | 2014-11-10 | 2016-05-11 | Lego As | System and method for toy recognition and detection based on convolutional neural networks |
| US20160140438A1 (en) | 2014-11-13 | 2016-05-19 | Nec Laboratories America, Inc. | Hyper-class Augmented and Regularized Deep Learning for Fine-grained Image Classification |
| EP3029606A3 (en) | 2014-11-14 | 2016-09-14 | Thomson Licensing | Method and apparatus for image classification with joint feature adaptation and classifier learning |
| US9501707B2 (en) * | 2015-04-16 | 2016-11-22 | Xerox Corporation | Method and system for bootstrapping an OCR engine for license plate recognition |
| CN106296623B (zh) * | 2015-06-10 | 2019-07-05 | 腾讯科技(深圳)有限公司 | 一种图片处理方法及装置 |
| US10176642B2 (en) * | 2015-07-17 | 2019-01-08 | Bao Tran | Systems and methods for computer assisted operation |
| US9684967B2 (en) | 2015-10-23 | 2017-06-20 | International Business Machines Corporation | Imaging segmentation using multi-scale machine learning approach |
| US10068171B2 (en) | 2015-11-12 | 2018-09-04 | Conduent Business Services, Llc | Multi-layer fusion in a convolutional neural network for image classification |
| US9721334B2 (en) * | 2015-12-03 | 2017-08-01 | International Business Machines Corporation | Work-piece defect inspection via optical images and CT images |
| US10373073B2 (en) | 2016-01-11 | 2019-08-06 | International Business Machines Corporation | Creating deep learning models using feature augmentation |
| US9864931B2 (en) * | 2016-04-13 | 2018-01-09 | Conduent Business Services, Llc | Target domain characterization for data augmentation |
| CN106169081B (zh) | 2016-06-29 | 2019-07-05 | 北京工业大学 | 一种基于不同照度的图像分类及处理方法 |
| US20180137391A1 (en) * | 2016-11-13 | 2018-05-17 | Imagry (Israel) Ltd. | System and method for training image classifier |
| CN106709907A (zh) * | 2016-12-08 | 2017-05-24 | 上海联影医疗科技有限公司 | Mr图像的处理方法及装置 |
| CN106934815A (zh) * | 2017-02-27 | 2017-07-07 | 南京理工大学 | 基于混合区域的活动轮廓模型图像分割方法 |
| CN107169939A (zh) * | 2017-05-31 | 2017-09-15 | 广东欧珀移动通信有限公司 | 图像处理方法及相关产品 |
| US10496902B2 (en) * | 2017-09-21 | 2019-12-03 | International Business Machines Corporation | Data augmentation for image classification tasks |
-
2017
- 2017-09-21 US US15/711,756 patent/US10496902B2/en active Active
- 2017-12-15 US US15/843,687 patent/US10614346B2/en active Active
-
2018
- 2018-09-20 CN CN201880054715.1A patent/CN111033523B/zh active Active
- 2018-09-20 GB GB2005186.8A patent/GB2580002B/en active Active
- 2018-09-20 JP JP2020514281A patent/JP7034265B2/ja active Active
- 2018-09-20 WO PCT/IB2018/057257 patent/WO2019058300A1/en not_active Ceased
-
2019
- 2019-11-12 US US16/681,373 patent/US11120309B2/en not_active Expired - Fee Related
-
2020
- 2020-02-04 US US16/781,411 patent/US11238317B2/en active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2014123366A (ja) | 2012-12-21 | 2014-07-03 | Honda Motor Co Ltd | 歩行者姿勢分類に適用される3次元人間モデル |
| JP2019003565A (ja) | 2017-06-19 | 2019-01-10 | コニカミノルタ株式会社 | 画像処理装置、画像処理方法、及び画像処理プログラム |
Non-Patent Citations (1)
| Title |
|---|
| 酒向 慎司 他2名,3次元モデルとCNNを用いた指文字認識の検討,電子情報通信学会技術研究報告,日本,一般社団法人電子情報通信学会,2017年05月20日,Vol.117 No.66 |
Also Published As
| Publication number | Publication date |
|---|---|
| US11238317B2 (en) | 2022-02-01 |
| US10614346B2 (en) | 2020-04-07 |
| US11120309B2 (en) | 2021-09-14 |
| US20190087694A1 (en) | 2019-03-21 |
| US20200175343A1 (en) | 2020-06-04 |
| CN111033523B (zh) | 2023-12-29 |
| GB2580002B (en) | 2021-01-13 |
| US20200082229A1 (en) | 2020-03-12 |
| US20190087695A1 (en) | 2019-03-21 |
| CN111033523A (zh) | 2020-04-17 |
| WO2019058300A1 (en) | 2019-03-28 |
| GB2580002A (en) | 2020-07-08 |
| US10496902B2 (en) | 2019-12-03 |
| JP2020534594A (ja) | 2020-11-26 |
| GB202005186D0 (en) | 2020-05-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7034265B2 (ja) | 画像分類タスクの機械学習を実施するためのコンピュータ実装方法、コンピュータ・プログラム製品、およびコンピュータ処理システム、ならびに自動車のための先進運転者支援システム | |
| US11989237B2 (en) | Natural language interaction with automated machine learning systems | |
| US10503827B2 (en) | Supervised training for word embedding | |
| US10748102B2 (en) | Just in time learning driven by point of sale or other data and metrics | |
| US10915529B2 (en) | Selecting an optimal combination of systems for query processing | |
| US20220358358A1 (en) | Accelerating inference of neural network models via dynamic early exits | |
| US20230118271A1 (en) | Tuning a container orchestrator | |
| US10750022B2 (en) | Enhancing customer service processing using data analytics and cognitive computing | |
| US20180157554A1 (en) | Resolving conflicts between multiple software and hardware processes | |
| US10262266B2 (en) | Identifying and analyzing impact of an event on relationships | |
| US20200211536A1 (en) | Visually exploring and prioritizing discordant conversations with conversational bots | |
| US11150934B2 (en) | Region based processing and storage of data | |
| US11157554B2 (en) | Video response generation and modification | |
| US10902461B2 (en) | Environmental modification using tone model analysis | |
| US9892091B2 (en) | Computing intersection cardinality | |
| JP7816894B2 (ja) | エンドツーエンド音声認識のためのニューラルトランスデューサモデルにおける音響情報と言語情報との分離 | |
| US20200026963A1 (en) | Reducing computational costs of deep reinforcement learning by gated convolutional neural network | |
| US11188968B2 (en) | Component based review system | |
| US20230153318A1 (en) | Shape and data format conversion for accelerators | |
| US20230069628A1 (en) | External language model fusing method for speech recognition |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20201117 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20210222 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20220209 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20220222 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20220301 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7034265 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |