JP7420236B2 - 学習装置、学習方法および学習プログラム - Google Patents

学習装置、学習方法および学習プログラム Download PDF

Info

Publication number
JP7420236B2
JP7420236B2 JP2022522086A JP2022522086A JP7420236B2 JP 7420236 B2 JP7420236 B2 JP 7420236B2 JP 2022522086 A JP2022522086 A JP 2022522086A JP 2022522086 A JP2022522086 A JP 2022522086A JP 7420236 B2 JP7420236 B2 JP 7420236B2
Authority
JP
Japan
Prior art keywords
target
learning
objective function
output
outputs
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022522086A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2021229625A5 (https=
JPWO2021229625A1 (https=
Inventor
大 窪田
力 江藤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of JPWO2021229625A1 publication Critical patent/JPWO2021229625A1/ja
Publication of JPWO2021229625A5 publication Critical patent/JPWO2021229625A5/ja
Application granted granted Critical
Publication of JP7420236B2 publication Critical patent/JP7420236B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/092Reinforcement learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Machine Translation (AREA)
JP2022522086A 2020-05-11 2020-05-11 学習装置、学習方法および学習プログラム Active JP7420236B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/018767 WO2021229625A1 (ja) 2020-05-11 2020-05-11 学習装置、学習方法および学習プログラム

Publications (3)

Publication Number Publication Date
JPWO2021229625A1 JPWO2021229625A1 (https=) 2021-11-18
JPWO2021229625A5 JPWO2021229625A5 (https=) 2023-01-24
JP7420236B2 true JP7420236B2 (ja) 2024-01-23

Family

ID=78525971

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022522086A Active JP7420236B2 (ja) 2020-05-11 2020-05-11 学習装置、学習方法および学習プログラム

Country Status (3)

Country Link
US (1) US20230281506A1 (https=)
JP (1) JP7420236B2 (https=)
WO (1) WO2021229625A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20250161753A1 (en) * 2022-03-30 2025-05-22 Nec Corporation Workout support apparatus, workout support method, training apparatus, and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190390867A1 (en) 2019-07-03 2019-12-26 Lg Electronics Inc. Air conditioner and method for operating the air conditioner

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10902347B2 (en) * 2017-04-11 2021-01-26 International Business Machines Corporation Rule creation using MDP and inverse reinforcement learning
US11308401B2 (en) * 2018-01-31 2022-04-19 Royal Bank Of Canada Interactive reinforcement learning with dynamic reuse of prior knowledge
WO2020159692A1 (en) * 2019-01-28 2020-08-06 Mayo Foundation For Medical Education And Research Estimating latent reward functions from experiences
US20220390909A1 (en) * 2019-11-14 2022-12-08 Nec Corporation Learning device, learning method, and learning program
US20220318917A1 (en) * 2019-12-25 2022-10-06 Nec Corporation Intention feature value extraction device, learning device, method, and program
JP7464115B2 (ja) * 2020-05-11 2024-04-09 日本電気株式会社 学習装置、学習方法および学習プログラム

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190390867A1 (en) 2019-07-03 2019-12-26 Lg Electronics Inc. Air conditioner and method for operating the air conditioner

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
増山 岳人 ほか,逆強化学習による学習者の選好を考慮した報酬関数の推定,第32回日本ロボット学会学術講演会 ,2014年09月29日,1102~1104頁

Also Published As

Publication number Publication date
WO2021229625A1 (ja) 2021-11-18
JPWO2021229625A1 (https=) 2021-11-18
US20230281506A1 (en) 2023-09-07

Similar Documents

Publication Publication Date Title
Walsh et al. Exploring compact reinforcement-learning representations with linear regression
JP7529145B2 (ja) 学習装置、学習方法および学習プログラム
CN118154089B (zh) 基于供应链管理的智能仓储系统及方法
CN113287124A (zh) 用于搭乘订单派遣的系统和方法
CN115146764B (zh) 一种预测模型的训练方法、装置、电子设备及存储介质
CN119067273A (zh) 用于物流路线规划的方法及系统
CN119558385B (zh) 一种具有高泛化能力的分层联邦学习方法
CN118760100A (zh) 一种大规模模糊柔性作业车间调度方法及相关设备
JP7464115B2 (ja) 学習装置、学習方法および学習プログラム
Gaidar et al. Mathematical method for optimising the transport and logistics industry
Xu et al. Empty container repositioning problem using a reinforcement learning framework with multi-weight adaptive reward function
Workneh et al. Learning to schedule (L2S): Adaptive job shop scheduling using double deep Q network
JP7420236B2 (ja) 学習装置、学習方法および学習プログラム
Li et al. A reinforcement learning framework for efficient task allocation among AGVs in smart warehouse
Chen et al. Genetic programming with reinforcement learning trained transformer for real-world dynamic scheduling problems
Su et al. Multi-objective optimization for dynamic logistics scheduling based on hierarchical deep reinforcement learning
Huang et al. Deep reinforcement learning for solving car resequencing with selectivity banks in automotive assembly shops
Zabraoui et al. Ai-driven optimization of vehicle routing problems in supply chain: integrating graph neural networks and reinforcement learning
US20240037452A1 (en) Learning device, learning method, and learning program
CN114298870A (zh) 一种路径规划方法、装置、电子设备及计算机可读介质
Workneh et al. Deep q network method for dynamic job shop scheduling problem
WO2025223072A1 (zh) 排产计划生成方法、装置和计算机可读存储介质
Oh et al. Applying multi-agent reinforcement learning and graph neural networks to flexible job shop scheduling problem
Beier et al. Towards supervised learning of optimal replenishment policies
Hamada Neural Network Intelligence Model for Software Projects Cost Prediction

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20221026

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20221026

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230919

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20231031

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20231212

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20231225

R151 Written notification of patent or utility model registration

Ref document number: 7420236

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151