JPWO2025004590A1 - - Google Patents

Info

Publication number
JPWO2025004590A1
JPWO2025004590A1 JP2025529506A JP2025529506A JPWO2025004590A1 JP WO2025004590 A1 JPWO2025004590 A1 JP WO2025004590A1 JP 2025529506 A JP2025529506 A JP 2025529506A JP 2025529506 A JP2025529506 A JP 2025529506A JP WO2025004590 A1 JPWO2025004590 A1 JP WO2025004590A1
Authority
JP
Japan
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2025529506A
Other languages
Japanese (ja)
Other versions
JPWO2025004590A5 (https=
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed filed Critical
Publication of JPWO2025004590A1 publication Critical patent/JPWO2025004590A1/ja
Publication of JPWO2025004590A5 publication Critical patent/JPWO2025004590A5/ja
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)
JP2025529506A 2023-06-26 2024-05-16 Pending JPWO2025004590A1 (https=)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US18/213,980 US12579820B2 (en) 2023-06-26 2023-06-26 Learning apparatus and learning method
PCT/JP2024/018221 WO2025004590A1 (ja) 2023-06-26 2024-05-16 学習装置、学習方法、及びプログラム

Publications (2)

Publication Number Publication Date
JPWO2025004590A1 true JPWO2025004590A1 (https=) 2025-01-02
JPWO2025004590A5 JPWO2025004590A5 (https=) 2026-04-13

Family

ID=93929730

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2025529506A Pending JPWO2025004590A1 (https=) 2023-06-26 2024-05-16

Country Status (5)

Country Link
US (1) US12579820B2 (https=)
EP (1) EP4730218A1 (https=)
JP (1) JPWO2025004590A1 (https=)
CN (1) CN121605423A (https=)
WO (1) WO2025004590A1 (https=)

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5169273B2 (ja) 2008-02-12 2013-03-27 株式会社安川電機 移動ロボットの制御装置および移動ロボットシステム
JP2015149013A (ja) 2014-02-07 2015-08-20 トヨタ自動車株式会社 移動体の目標位置の設定方法
US10977501B2 (en) 2018-12-21 2021-04-13 Waymo Llc Object classification using extra-regional context
US11574142B2 (en) * 2020-07-30 2023-02-07 Adobe Inc. Semantic image manipulation using visual-semantic joint embeddings
US11468688B2 (en) * 2020-07-31 2022-10-11 Toyota Motor Engineering & Manufacturing North America, Inc. Vehicle sensor data sharing
US12394085B2 (en) * 2020-11-16 2025-08-19 Waymo Llc Long range distance estimation using reference objects
US11663294B2 (en) * 2021-03-16 2023-05-30 Toyota Research Institute, Inc. System and method for training a model using localized textual supervision
US12271792B2 (en) * 2021-05-26 2025-04-08 Salesforce, Inc. Systems and methods for vision-and-language representation learning
WO2023101679A1 (en) 2021-12-02 2023-06-08 Innopeak Technology, Inc. Text-image cross-modal retrieval based on virtual word expansion
US12254707B2 (en) * 2022-09-28 2025-03-18 Lemon Inc. Pre-training for scene text detection
US20240257536A1 (en) * 2023-01-30 2024-08-01 Argo AI, LLC System, Method, and Computer Program Product for Streaming Data Mining with Text-Image Joint Embeddings
CN116310920B (zh) 2023-03-20 2025-08-08 重庆邮电大学 一种基于场景上下文感知的图像隐私预测方法
US20250095393A1 (en) * 2023-09-20 2025-03-20 Adobe Inc. Text-augmented object centric relationship detection
US11978271B1 (en) * 2023-10-27 2024-05-07 Google Llc Instance level scene recognition with a vision language model

Also Published As

Publication number Publication date
EP4730218A1 (en) 2026-04-22
WO2025004590A1 (ja) 2025-01-02
US20240428597A1 (en) 2024-12-26
CN121605423A (zh) 2026-03-03
US12579820B2 (en) 2026-03-17

Similar Documents

Publication Publication Date Title
BR102023014872A2 (https=)
BR102023012440A2 (https=)
BR102023010976A2 (https=)
BR102023009641A2 (https=)
BR102023007252A2 (https=)
BR102023005164A2 (https=)
BR102023001877A2 (https=)
BR102023000289A2 (https=)
BR102022026909A2 (https=)
BR202022009269U2 (https=)
BR202022005961U2 (https=)
BR202022001779U2 (https=)
BR202022000931U2 (https=)
BY13150U (https=)
BY13161U (https=)
BY13135U (https=)
BY13137U (https=)
BY13140U (https=)
BY13141U (https=)
BY13142U (https=)
BY13143U (https=)
BY13144U (https=)
BY13145U (https=)
BY13149U (https=)
CN307047926S (https=)

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20251219

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20251219