CN121605423A - 学习装置、学习方法以及程序 - Google Patents

学习装置、学习方法以及程序

Info

Publication number
CN121605423A
CN121605423A CN202480042444.3A CN202480042444A CN121605423A CN 121605423 A CN121605423 A CN 121605423A CN 202480042444 A CN202480042444 A CN 202480042444A CN 121605423 A CN121605423 A CN 121605423A
Authority
CN
China
Prior art keywords
data
input
determined
error
correct answer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202480042444.3A
Other languages
English (en)
Chinese (zh)
Inventor
细见直希
翠辉久
田中骏平
杨巍
杉浦孔明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Keio School
Honda Motor Co Ltd
Original Assignee
Keio School
Honda Motor Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Keio School, Honda Motor Co Ltd filed Critical Keio School
Publication of CN121605423A publication Critical patent/CN121605423A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)
CN202480042444.3A 2023-06-26 2024-05-16 学习装置、学习方法以及程序 Pending CN121605423A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US18/213,980 US12579820B2 (en) 2023-06-26 2023-06-26 Learning apparatus and learning method
US18/213,980 2023-06-26
PCT/JP2024/018221 WO2025004590A1 (ja) 2023-06-26 2024-05-16 学習装置、学習方法、及びプログラム

Publications (1)

Publication Number Publication Date
CN121605423A true CN121605423A (zh) 2026-03-03

Family

ID=93929730

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202480042444.3A Pending CN121605423A (zh) 2023-06-26 2024-05-16 学习装置、学习方法以及程序

Country Status (5)

Country Link
US (1) US12579820B2 (https=)
EP (1) EP4730218A1 (https=)
JP (1) JPWO2025004590A1 (https=)
CN (1) CN121605423A (https=)
WO (1) WO2025004590A1 (https=)

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5169273B2 (ja) 2008-02-12 2013-03-27 株式会社安川電機 移動ロボットの制御装置および移動ロボットシステム
JP2015149013A (ja) 2014-02-07 2015-08-20 トヨタ自動車株式会社 移動体の目標位置の設定方法
US10977501B2 (en) 2018-12-21 2021-04-13 Waymo Llc Object classification using extra-regional context
US11574142B2 (en) * 2020-07-30 2023-02-07 Adobe Inc. Semantic image manipulation using visual-semantic joint embeddings
US11468688B2 (en) * 2020-07-31 2022-10-11 Toyota Motor Engineering & Manufacturing North America, Inc. Vehicle sensor data sharing
US12394085B2 (en) * 2020-11-16 2025-08-19 Waymo Llc Long range distance estimation using reference objects
US11663294B2 (en) * 2021-03-16 2023-05-30 Toyota Research Institute, Inc. System and method for training a model using localized textual supervision
US12271792B2 (en) * 2021-05-26 2025-04-08 Salesforce, Inc. Systems and methods for vision-and-language representation learning
WO2023101679A1 (en) 2021-12-02 2023-06-08 Innopeak Technology, Inc. Text-image cross-modal retrieval based on virtual word expansion
US12254707B2 (en) * 2022-09-28 2025-03-18 Lemon Inc. Pre-training for scene text detection
US20240257536A1 (en) * 2023-01-30 2024-08-01 Argo AI, LLC System, Method, and Computer Program Product for Streaming Data Mining with Text-Image Joint Embeddings
CN116310920B (zh) 2023-03-20 2025-08-08 重庆邮电大学 一种基于场景上下文感知的图像隐私预测方法
US20250095393A1 (en) * 2023-09-20 2025-03-20 Adobe Inc. Text-augmented object centric relationship detection
US11978271B1 (en) * 2023-10-27 2024-05-07 Google Llc Instance level scene recognition with a vision language model

Also Published As

Publication number Publication date
JPWO2025004590A1 (https=) 2025-01-02
EP4730218A1 (en) 2026-04-22
WO2025004590A1 (ja) 2025-01-02
US20240428597A1 (en) 2024-12-26
US12579820B2 (en) 2026-03-17

Similar Documents

Publication Publication Date Title
CN113826119B (zh) 纯注意力的计算机视觉
CN116635877B (zh) 模型生成装置、推测装置、模型生成方法以及存储介质
JP2017059207A (ja) 画像認識方法
CN110268338B (zh) 使用视觉输入进行代理导航
JP2018156451A (ja) ネットワーク学習装置、ネットワーク学習システム、ネットワーク学習方法およびプログラム
CN115803587A (zh) 模型生成装置及方法、路径搜索装置以及模型生成程序
CN111598087A (zh) 不规则文字的识别方法、装置、计算机设备及存储介质
CN118228743B (zh) 一种基于文图注意力机制的多模态机器翻译方法及装置
CN118537834A (zh) 车辆感知信息获取方法、装置、设备及存储介质
CN118229593A (zh) 掩模版校正方法、装置、计算机设备及可读存储介质
Zhang et al. Distilling diffusion models to efficient 3d lidar scene completion
CN116363750A (zh) 人体姿态预测方法、装置、设备及可读存储介质
CN115856874A (zh) 毫米波雷达点云降噪方法、装置、设备及存储介质
JP2021068141A (ja) 領域分割装置、領域分割方法および領域分割プログラム
CN121605423A (zh) 学习装置、学习方法以及程序
CN118535565B (zh) 前端表单数据验证方法及装置
CN120043542A (zh) 基于知识蒸馏的矢量地图构建模型训练方法及装置
CN118297987B (zh) 目标跟踪方法、装置、设备及存储介质
CN117152223B (zh) 深度图像生成方法、系统、电子设备及可读存储介质
CN110610228A (zh) 信息处理装置、信息处理方法及记录介质
CN117523671A (zh) 一种基于深度学习的群体行为识别方法与系统
CN113592980B (zh) 招牌拓扑关系的构建方法、装置、电子设备和存储介质
US20250315971A1 (en) Learning apparatus, estimation apparatus, learning method, estimation method, and storage medium
CN113869186A (zh) 模型训练方法、装置、电子设备和计算机可读存储介质
CN120725178A (zh) 学习装置、推定装置、学习方法、推定方法、存储介质及程序产品

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination