JP2023157746A5 - - Google Patents

Download PDF

Info

Publication number
JP2023157746A5
JP2023157746A5 JP2022067840A JP2022067840A JP2023157746A5 JP 2023157746 A5 JP2023157746 A5 JP 2023157746A5 JP 2022067840 A JP2022067840 A JP 2022067840A JP 2022067840 A JP2022067840 A JP 2022067840A JP 2023157746 A5 JP2023157746 A5 JP 2023157746A5
Authority
JP
Japan
Prior art keywords
agent
action
agents
inference
state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2022067840A
Other languages
English (en)
Japanese (ja)
Other versions
JP2023157746A (ja
Filing date
Publication date
Application filed filed Critical
Priority to JP2022067840A priority Critical patent/JP2023157746A/ja
Priority claimed from JP2022067840A external-priority patent/JP2023157746A/ja
Priority to US18/115,081 priority patent/US20230334406A1/en
Publication of JP2023157746A publication Critical patent/JP2023157746A/ja
Publication of JP2023157746A5 publication Critical patent/JP2023157746A5/ja
Pending legal-status Critical Current

Links

JP2022067840A 2022-04-15 2022-04-15 推論装置、生成方法、および生成プログラム Pending JP2023157746A (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2022067840A JP2023157746A (ja) 2022-04-15 2022-04-15 推論装置、生成方法、および生成プログラム
US18/115,081 US20230334406A1 (en) 2022-04-15 2023-02-28 Inference apparatus, inference method, and recording medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2022067840A JP2023157746A (ja) 2022-04-15 2022-04-15 推論装置、生成方法、および生成プログラム

Publications (2)

Publication Number Publication Date
JP2023157746A JP2023157746A (ja) 2023-10-26
JP2023157746A5 true JP2023157746A5 (enExample) 2025-03-10

Family

ID=88308056

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022067840A Pending JP2023157746A (ja) 2022-04-15 2022-04-15 推論装置、生成方法、および生成プログラム

Country Status (2)

Country Link
US (1) US20230334406A1 (enExample)
JP (1) JP2023157746A (enExample)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117407514B (zh) * 2023-11-28 2024-07-09 星环信息科技(上海)股份有限公司 一种解决计划生成方法、装置、设备及存储介质
CN120338122B (zh) * 2025-06-20 2025-11-11 厦门渊亭信息科技有限公司 一种基于多智能体协作的大模型应用的评估方法及装置

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8874477B2 (en) * 2005-10-04 2014-10-28 Steven Mark Hoffberg Multifactorial optimization system and method
US20190339087A1 (en) * 2018-05-03 2019-11-07 Didi Research America, Llc Deep reinforcement learning for optimizing carpooling policies
WO2020086214A1 (en) * 2018-10-26 2020-04-30 Dow Global Technologies Llc Deep reinforcement learning for production scheduling
WO2021092263A1 (en) * 2019-11-05 2021-05-14 Strong Force Vcn Portfolio 2019, Llc Control tower and enterprise management platform for value chain networks
US11514363B2 (en) * 2019-12-06 2022-11-29 Microsoft Technology Licensing, Llc Using a recursive reinforcement model to determine an agent action
WO2021226925A1 (en) * 2020-05-14 2021-11-18 Beijing Didi Infinity Technology And Development Co., Ltd. Method and system for constructing virtual environment for ride-hailing platforms
CN112288341B (zh) * 2020-12-29 2021-04-13 青岛泛钛客科技有限公司 基于多智能体强化学习的信贷工厂订单调度方法及装置
US20220292434A1 (en) * 2021-03-09 2022-09-15 Microsoft Technology Licensing, Llc Resource planning for delivery of goods

Similar Documents

Publication Publication Date Title
JP2023157746A5 (enExample)
CN115128960B (zh) 一种基于深度强化学习双足机器人运动控制方法及系统
CN115509233B (zh) 基于优先经验回放机制的机器人路径规划方法及系统
CN111546349A (zh) 一种仿人机器人步态规划的深度强化学习新方法
JPWO2020240720A5 (enExample)
WO2018227820A1 (zh) 控制机械臂运动的方法及装置、存储介质和终端设备
CN117973218B (zh) 一种基于多策略改进蜣螂算法的减速器设计方法
CN113887708A (zh) 基于平均场的多智能体学习方法、存储介质及电子设备
Lu et al. Streaming variational probabilistic principal component analysis for monitoring of nonstationary process
JPWO2021005776A5 (ja) 物体検知装置、学習方法、及び、プログラム
CN111111200B (zh) 战斗策略生成方法及装置
Acerbi et al. Predation and the phasing of sleep: an evolutionary individual-based model
JPWO2022044335A5 (enExample)
JP2020151592A5 (enExample)
CN109031959B (zh) 一种非一致非线性系统协同控制方法及控制系统
CN116384594A (zh) 基于大数据分析的作物虫害预警方法、系统、终端及介质
JP7667054B2 (ja) 水分出納管理システム、推論装置、学習済みモデル生成装置、学習済みモデル生成方法、およびコンピュータプログラム
CN110302539A (zh) 一种游戏策略计算方法、装置、系统及可读存储介质
JP2021010739A5 (enExample)
CN109526701B (zh) 滴灌控制方法及装置
WO2023130800A1 (zh) 生成主控对象投影的方法、装置、设备及介质
CN115167106A (zh) 基于除氧器系统的比例积分微分pid参数整定方法
CN114842905B (zh) 平衡点稳定性的预测方法、装置、存储介质及计算机设备
CN112948240A (zh) 游戏的回归测试方法、装置、设备及存储介质
JPWO2022064610A5 (ja) 物体検知装置、学習済みモデル生成方法、及び、プログラム