MX2014007056A - Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red. - Google Patents
Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red.Info
- Publication number
- MX2014007056A MX2014007056A MX2014007056A MX2014007056A MX2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A
- Authority
- MX
- Mexico
- Prior art keywords
- integrated
- reinforcement learning
- agent
- signal control
- traffic signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/07—Controlling traffic signals
- G08G1/081—Plural intersections under common control
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/07—Controlling traffic signals
- G08G1/081—Plural intersections under common control
- G08G1/083—Controlling the allocation of time between phases of a cycle
Landscapes
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Traffic Control Systems (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
Un sistema y método de aprendizaje por refuerzo de agentes múltiples para controladores de tránsito adaptable, integrado y conectado en red (MARLIN-ATC). Los agentes enlazados a señales de tránsito generan medidas de control para una directriz de control óptima con base en las condiciones de tránsito en la intersección y una o más intersecciones diferentes. El agente proporciona una medida de control al considerar la directriz de control para la intersección y una o más intersecciones vecinas. Debido al efecto en cascada del sistema, cada agente considera implícitamente el entorno de tránsito completo, lo cual resulta en una directriz de control optimizada global.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161576637P | 2011-12-16 | 2011-12-16 | |
PCT/CA2012/050887 WO2013086629A1 (en) | 2011-12-16 | 2012-12-10 | Multi-agent reinforcement learning for integrated and networked adaptive traffic signal control |
Publications (2)
Publication Number | Publication Date |
---|---|
MX2014007056A true MX2014007056A (es) | 2015-03-06 |
MX344434B MX344434B (es) | 2016-12-15 |
Family
ID=48611761
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2014007056A MX344434B (es) | 2011-12-16 | 2012-12-10 | Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red. |
Country Status (4)
Country | Link |
---|---|
US (1) | US9818297B2 (es) |
CA (1) | CA2859049C (es) |
MX (1) | MX344434B (es) |
WO (1) | WO2013086629A1 (es) |
Families Citing this family (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9679258B2 (en) * | 2013-10-08 | 2017-06-13 | Google Inc. | Methods and apparatus for reinforcement learning |
US20150301510A1 (en) * | 2014-04-22 | 2015-10-22 | Siegmund Düll | Controlling a Target System |
US9483938B1 (en) | 2015-08-28 | 2016-11-01 | International Business Machines Corporation | Diagnostic system, method, and recording medium for signalized transportation networks |
US10839302B2 (en) | 2015-11-24 | 2020-11-17 | The Research Foundation For The State University Of New York | Approximate value iteration with complex returns by bounding |
US10719777B2 (en) | 2016-07-28 | 2020-07-21 | At&T Intellectual Propery I, L.P. | Optimization of multiple services via machine learning |
CN106412049A (zh) * | 2016-09-26 | 2017-02-15 | 北京东土科技股份有限公司 | 智能交通云控制系统 |
US20180165602A1 (en) | 2016-12-14 | 2018-06-14 | Microsoft Technology Licensing, Llc | Scalability of reinforcement learning by separation of concerns |
CN106846836B (zh) * | 2017-02-28 | 2019-05-24 | 许昌学院 | 一种单交叉口信号灯时间控制方法及系统 |
US9972199B1 (en) * | 2017-03-08 | 2018-05-15 | Fujitsu Limited | Traffic signal control that incorporates non-motorized traffic information |
US10002530B1 (en) | 2017-03-08 | 2018-06-19 | Fujitsu Limited | Traffic signal control using multiple Q-learning categories |
CN106910351B (zh) * | 2017-04-19 | 2019-10-11 | 大连理工大学 | 一种基于深度强化学习的交通信号自适应控制方法 |
US10872526B2 (en) * | 2017-09-19 | 2020-12-22 | Continental Automotive Systems, Inc. | Adaptive traffic control system and method for operating same |
EP3467718A1 (en) * | 2017-10-04 | 2019-04-10 | Prowler.io Limited | Machine learning system |
US11568236B2 (en) | 2018-01-25 | 2023-01-31 | The Research Foundation For The State University Of New York | Framework and methods of diverse exploration for fast and safe policy improvement |
CN110114806A (zh) * | 2018-02-28 | 2019-08-09 | 华为技术有限公司 | 信号灯控制方法、相关设备及系统 |
EP3782143B1 (en) * | 2018-04-20 | 2023-08-09 | The Governing Council of the University of Toronto | Method and system for multimodal deep traffic signal control |
US11610165B2 (en) * | 2018-05-09 | 2023-03-21 | Volvo Car Corporation | Method and system for orchestrating multi-party services using semi-cooperative nash equilibrium based on artificial intelligence, neural network models,reinforcement learning and finite-state automata |
US20190347933A1 (en) * | 2018-05-11 | 2019-11-14 | Virtual Traffic Lights, LLC | Method of implementing an intelligent traffic control apparatus having a reinforcement learning based partial traffic detection control system, and an intelligent traffic control apparatus implemented thereby |
CN110861634B (zh) * | 2018-08-14 | 2023-01-17 | 本田技研工业株式会社 | 交互感知决策 |
US11482106B2 (en) | 2018-09-04 | 2022-10-25 | Udayan Kanade | Adaptive traffic signal with adaptive countdown timers |
CN109785619B (zh) * | 2019-01-21 | 2021-06-22 | 南京邮电大学 | 区域交通信号协调优化控制系统及其控制方法 |
US11416743B2 (en) | 2019-04-25 | 2022-08-16 | International Business Machines Corporation | Swarm fair deep reinforcement learning |
GB2583747B (en) * | 2019-05-08 | 2023-12-06 | Vivacity Labs Ltd | Traffic control system |
WO2020227959A1 (en) | 2019-05-15 | 2020-11-19 | Advanced New Technologies Co., Ltd. | Determining action selection policies of an execution device |
US11176368B2 (en) | 2019-06-13 | 2021-11-16 | International Business Machines Corporation | Visually focused first-person neural network interpretation |
US11217094B2 (en) | 2019-06-25 | 2022-01-04 | Board Of Regents, The University Of Texas System | Collaborative distributed agent-based traffic light system and method of use |
CN110930734A (zh) * | 2019-11-30 | 2020-03-27 | 天津大学 | 基于强化学习的闲时交通指示灯智能控制方法 |
CN111127910A (zh) * | 2019-12-18 | 2020-05-08 | 上海天壤智能科技有限公司 | 交通信号调节方法、系统及介质 |
US12026186B2 (en) | 2020-01-27 | 2024-07-02 | International Business Machines Corporation | Managing query systems for responding to queries based on attributes associated with a given query |
US11080602B1 (en) | 2020-06-27 | 2021-08-03 | Sas Institute Inc. | Universal attention-based reinforcement learning model for control systems |
US20220035640A1 (en) * | 2020-07-28 | 2022-02-03 | Electronic Arts Inc. | Trainable agent for traversing user interface |
CN112133109A (zh) * | 2020-08-10 | 2020-12-25 | 北方工业大学 | 一种单交叉口多方向空间占有率均衡控制模型建立方法 |
CN112215364B (zh) * | 2020-09-17 | 2023-11-17 | 天津(滨海)人工智能军民融合创新中心 | 一种基于强化学习的敌-友深度确定性策略方法及系统 |
US11783702B2 (en) * | 2020-09-18 | 2023-10-10 | Huawei Cloud Computing Technologies Co., Ltd | Method and system for adaptive cycle-level traffic signal control |
CN112099510B (zh) * | 2020-09-25 | 2022-10-18 | 东南大学 | 一种基于端边云协同的智能体控制方法 |
CN112233434A (zh) * | 2020-10-10 | 2021-01-15 | 扬州大学 | 基于智能体的城市路口交通信号协调控制系统及方法 |
CN112488310A (zh) * | 2020-11-11 | 2021-03-12 | 厦门渊亭信息科技有限公司 | 一种多智能体群组协作策略自动生成方法 |
US11883746B2 (en) * | 2021-02-23 | 2024-01-30 | Electronic Arts Inc. | Adversarial reinforcement learning for procedural content generation and improved generalization |
CN113077642B (zh) * | 2021-04-01 | 2022-06-21 | 武汉理工大学 | 一种交通信号灯控制方法、装置及计算机可读存储介质 |
CN113435112B (zh) * | 2021-06-10 | 2024-02-13 | 大连海事大学 | 基于邻居感知的多智能体强化学习的交通信号控制方法 |
CN113763723B (zh) * | 2021-09-06 | 2023-01-17 | 武汉理工大学 | 基于强化学习与动态配时的交通信号灯控制系统及方法 |
WO2023161947A1 (en) * | 2022-02-25 | 2023-08-31 | Telefonaktiebolaget Lm Ericsson (Publ) | Handling heterogeneous computation in multi-agent reinforcement learning |
CN114973660B (zh) * | 2022-05-13 | 2023-10-24 | 黄河科技学院 | 一种模型线性化迭代更新法的交通决策方法 |
CN115083175B (zh) * | 2022-06-23 | 2023-11-03 | 北京百度网讯科技有限公司 | 基于车路协同的信号管控方法、相关装置及程序产品 |
CN115457781B (zh) * | 2022-09-13 | 2023-07-11 | 内蒙古工业大学 | 一种基于多代理深度强化学习的智能交通信号灯控制方法 |
CN115457782B (zh) * | 2022-09-19 | 2023-11-03 | 吉林大学 | 基于深度强化学习的自动驾驶车辆交叉口无冲突合作方法 |
CN115631638B (zh) * | 2022-12-07 | 2023-03-21 | 武汉理工大学三亚科教创新园 | 管控区域基于多智能体强化学习的交通灯控制方法及系统 |
CN116129635B (zh) * | 2022-12-27 | 2023-11-21 | 重庆邮电大学 | 一种基于编队的单点无信号交叉口智能调度方法与系统 |
CN117973538B (zh) * | 2024-01-30 | 2024-08-06 | 西南交通大学 | 一种基于多所博弈的融通型牵引供电系统能量管理方法 |
CN118053311A (zh) * | 2024-04-16 | 2024-05-17 | 联易云科(北京)科技有限公司 | 基于多智能体强化学习模型的交通信号控制方法和装置 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3662329A (en) | 1968-08-20 | 1972-05-09 | Gulf & Western Industries | Multi-phase traffic control system |
US3818429A (en) | 1971-07-28 | 1974-06-18 | Singer Co | Multi-intersection traffic control system |
US4323970A (en) | 1979-06-22 | 1982-04-06 | Siemens Aktiengesellschaft | Method and circuit arrangement for generating setting signals for signal generators of a traffic signal system, particularly a street traffic signal system |
US5357436A (en) | 1992-10-21 | 1994-10-18 | Rockwell International Corporation | Fuzzy logic traffic signal control system |
US5668717A (en) * | 1993-06-04 | 1997-09-16 | The Johns Hopkins University | Method and apparatus for model-free optimal signal timing for system-wide traffic control |
JP3399421B2 (ja) | 1999-11-05 | 2003-04-21 | 住友電気工業株式会社 | 交通信号制御装置 |
US6690292B1 (en) | 2000-06-06 | 2004-02-10 | Bellsouth Intellectual Property Corporation | Method and system for monitoring vehicular traffic using a wireless communications network |
US6617981B2 (en) | 2001-06-06 | 2003-09-09 | John Basinger | Traffic control method for multiple intersections |
US6985090B2 (en) | 2001-08-29 | 2006-01-10 | Siemens Aktiengesellschaft | Method and arrangement for controlling a system of multiple traffic signals |
JP3680815B2 (ja) | 2002-05-13 | 2005-08-10 | 住友電気工業株式会社 | 交通信号制御方法 |
US7688224B2 (en) | 2003-10-14 | 2010-03-30 | Siemens Industry, Inc. | Method and system for collecting traffic data, monitoring traffic, and automated enforcement at a centralized station |
US7590589B2 (en) * | 2004-09-10 | 2009-09-15 | Hoffberg Steven M | Game theoretic prioritization scheme for mobile ad hoc networks permitting hierarchal deference |
US20070273552A1 (en) | 2006-05-24 | 2007-11-29 | Bellsouth Intellectual Property Corporation | Control of traffic flow by sensing traffic states |
US20080204277A1 (en) | 2007-02-27 | 2008-08-28 | Roy Sumner | Adaptive traffic signal phase change system |
DE102008049568A1 (de) | 2008-09-30 | 2010-04-08 | Siemens Aktiengesellschaft | Verfahren zur Optimierung der Verkehrssteuerung an einem lichtsignalgesteuerten Knoten in einem Straßenverkehrsnetz |
US8040254B2 (en) | 2009-01-06 | 2011-10-18 | International Business Machines Corporation | Method and system for controlling and adjusting traffic light timing patterns |
GB0916204D0 (en) * | 2009-09-16 | 2009-10-28 | Road Safety Man Ltd | Traffic signal control system and method |
GB201009974D0 (en) * | 2010-06-15 | 2010-07-21 | Trinity College Dublin | Decentralised autonomic system and method for use inan urban traffic control environment |
US8554456B2 (en) * | 2011-07-05 | 2013-10-08 | International Business Machines Corporation | Intelligent traffic control mesh |
-
2012
- 2012-12-10 MX MX2014007056A patent/MX344434B/es active IP Right Grant
- 2012-12-10 US US14/364,998 patent/US9818297B2/en active Active
- 2012-12-10 WO PCT/CA2012/050887 patent/WO2013086629A1/en active Application Filing
- 2012-12-10 CA CA2859049A patent/CA2859049C/en active Active
Also Published As
Publication number | Publication date |
---|---|
WO2013086629A1 (en) | 2013-06-20 |
CA2859049A1 (en) | 2013-06-20 |
MX344434B (es) | 2016-12-15 |
US9818297B2 (en) | 2017-11-14 |
US20150102945A1 (en) | 2015-04-16 |
CA2859049C (en) | 2018-06-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX344434B (es) | Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red. | |
WO2011133860A3 (en) | Systems and methods for providing haptic effects | |
EA201690256A1 (ru) | Система и способ планирования площадки для транспортных средств | |
MX2018000620A (es) | Sistema de analisis de registro de operador. | |
TW201612879A (en) | Display device | |
WO2013102932A3 (en) | System and method facilitating forecasting, optimization and visualization of energy data for industry | |
GB2457620A (en) | Event based process configuration | |
IN2014DN08342A (es) | ||
WO2013153441A8 (en) | Secure zone for digital communications | |
EA201001725A1 (ru) | Системы и способы регулирования темпа передвижения механизированных систем, перемещающихся по маршруту | |
WO2011155961A3 (en) | Method for quantitative resilience estimation of industrial control systems | |
EP3044062A4 (en) | Method and system for adaptive cruise control and vehicle | |
WO2013033625A3 (en) | Systems and methods for switching a relay at zero cross | |
MY189296A (en) | Driving assistance apparatus | |
HK1187312A1 (zh) | 車輛、車用智能鑰匙裝置、車輛遙控駕駛系統及方法 | |
IN2014CN00467A (es) | ||
SG195202A1 (en) | Method and device for acquiring distributed duration for traffic lights | |
WO2011051501A3 (de) | Ausbildungssimulationssystem für drohnensysteme | |
WO2012134447A3 (en) | Flight control laws for full envelope banked turns | |
EP3780003A4 (en) | PREDICTION SYSTEM, MODEL GENERATION SYSTEM, PROCEDURE AND PROGRAM | |
WO2011143610A3 (en) | Process and system for recovering phosphorus from wastewater | |
WO2013012780A3 (en) | Systems and method for a crossing equipment controller | |
EP3570740A4 (en) | APPARATUS, METHODS AND SYSTEMS FOR USING IMAGINED DIRECTION TO DEFINE ACTIONS, FUNCTIONS, OR EXECUTION | |
ATE549228T1 (de) | Fahrzeugdetektionssystem und -methode | |
TW200637221A (en) | A multiplexer and methods thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |