CN119213447A - 训练程序、训练方法以及信息处理装置 - Google Patents

训练程序、训练方法以及信息处理装置 Download PDF

Info

Publication number
CN119213447A
CN119213447A CN202280095608.XA CN202280095608A CN119213447A CN 119213447 A CN119213447 A CN 119213447A CN 202280095608 A CN202280095608 A CN 202280095608A CN 119213447 A CN119213447 A CN 119213447A
Authority
CN
China
Prior art keywords
data
object feature
training
image data
objects
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280095608.XA
Other languages
English (en)
Chinese (zh)
Inventor
長谷川创
广本正之
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Publication of CN119213447A publication Critical patent/CN119213447A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06V10/7753Incorporation of unlabelled data, e.g. multiple instance learning [MIL]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0895Weakly supervised learning, e.g. semi-supervised or self-supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Multimedia (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)
CN202280095608.XA 2022-06-15 2022-06-15 训练程序、训练方法以及信息处理装置 Pending CN119213447A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2022/024037 WO2023243020A1 (ja) 2022-06-15 2022-06-15 訓練プログラム,訓練方法及び情報処理装置

Publications (1)

Publication Number Publication Date
CN119213447A true CN119213447A (zh) 2024-12-27

Family

ID=89192495

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280095608.XA Pending CN119213447A (zh) 2022-06-15 2022-06-15 训练程序、训练方法以及信息处理装置

Country Status (5)

Country Link
US (1) US20250029373A1 (https=)
EP (1) EP4542463A4 (https=)
JP (1) JP7794314B2 (https=)
CN (1) CN119213447A (https=)
WO (1) WO2023243020A1 (https=)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4788525B2 (ja) * 2006-08-30 2011-10-05 日本電気株式会社 物体識別パラメータ学習システム、物体識別パラメータ学習方法および物体識別パラメータ学習用プログラム
FI20185863A1 (fi) 2018-10-13 2020-04-14 Iprally Tech Oy Järjestelmä luonnollisen kielen dokumenttien hakemiseksi
US12475384B2 (en) 2020-11-09 2025-11-18 Adobe Inc. Self-supervised visual-relationship probing

Also Published As

Publication number Publication date
EP4542463A1 (en) 2025-04-23
EP4542463A4 (en) 2025-07-23
JP7794314B2 (ja) 2026-01-06
US20250029373A1 (en) 2025-01-23
JPWO2023243020A1 (https=) 2023-12-21
WO2023243020A1 (ja) 2023-12-21

Similar Documents

Publication Publication Date Title
Shehzad et al. Graph transformers: A survey
He et al. Diff-font: Diffusion model for robust one-shot font generation
Wang et al. A deep convolutional neural network for topology optimization with perceptible generalization ability
Yu et al. Neighborhood rough sets based multi-label classification for automatic image annotation
US11816185B1 (en) Multi-view image analysis using neural networks
US20160350930A1 (en) Joint Depth Estimation and Semantic Segmentation from a Single Image
Sun et al. PGCNet: patch graph convolutional network for point cloud segmentation of indoor scenes
Xiao et al. Semi-supervised semantic segmentation with cross teacher training
Han et al. A novel approach to pre-extracting support vectors based on the theory of belief functions
US20220270341A1 (en) Method and device of inputting annotation of object boundary information
Ferrante et al. Deformable registration through learning of context-specific metric aggregation
Huang et al. MuraNet: Multi-task floor plan recognition with relation attention
Belharbi et al. Deep neural networks regularization for structured output prediction
Zhu et al. DiffSwinTr: A diffusion model using 3D Swin Transformer for brain tumor segmentation
Behzadi et al. Taming connectedness in machine-learning-based topology optimization with connectivity graphs
Yu et al. Leverage cross-attention for end-to-end open-vocabulary panoptic reconstruction
Chen et al. Unsupervised domain adaptation of dynamic extension networks based on class decision boundaries
CN116415632B (zh) 用于神经网络预测域的局部可解释性的方法和系统
Li et al. SketchMLP: effectively utilize rasterized images and drawing sequences for sketch recognition
Ning et al. HSBNet: fusing semantics and anisotropic thermal diffusion fields for boundary-aware point cloud segmentation
Xie et al. Semantic image segmentation method with multiple adjacency trees and multiscale features
JPWO2017168601A1 (ja) 類似画像検索方法およびシステム
CN119213447A (zh) 训练程序、训练方法以及信息处理装置
CN114565752A (zh) 一种基于类不可知前景挖掘的图像弱监督目标检测方法
Saboori et al. Adversarial discriminative active Deep Learning for domain adaptation in hyperspectral images classification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination