MX2014007056A - Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red. - Google Patents

Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red.

Info

Publication number
MX2014007056A
MX2014007056A MX2014007056A MX2014007056A MX2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A MX 2014007056 A MX2014007056 A MX 2014007056A
Authority
MX
Mexico
Prior art keywords
integrated
reinforcement learning
agent
signal control
traffic signal
Prior art date
Application number
MX2014007056A
Other languages
English (en)
Other versions
MX344434B (es
Inventor
Samah El-Tantawy
Baher Abdulhai
Original Assignee
Pragmatek Transp Innovations Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pragmatek Transp Innovations Inc filed Critical Pragmatek Transp Innovations Inc
Publication of MX2014007056A publication Critical patent/MX2014007056A/es
Publication of MX344434B publication Critical patent/MX344434B/es

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/07Controlling traffic signals
    • G08G1/081Plural intersections under common control
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/07Controlling traffic signals
    • G08G1/081Plural intersections under common control
    • G08G1/083Controlling the allocation of time between phases of a cycle

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Traffic Control Systems (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Un sistema y método de aprendizaje por refuerzo de agentes múltiples para controladores de tránsito adaptable, integrado y conectado en red (MARLIN-ATC). Los agentes enlazados a señales de tránsito generan medidas de control para una directriz de control óptima con base en las condiciones de tránsito en la intersección y una o más intersecciones diferentes. El agente proporciona una medida de control al considerar la directriz de control para la intersección y una o más intersecciones vecinas. Debido al efecto en cascada del sistema, cada agente considera implícitamente el entorno de tránsito completo, lo cual resulta en una directriz de control optimizada global.
MX2014007056A 2011-12-16 2012-12-10 Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red. MX344434B (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201161576637P 2011-12-16 2011-12-16
PCT/CA2012/050887 WO2013086629A1 (en) 2011-12-16 2012-12-10 Multi-agent reinforcement learning for integrated and networked adaptive traffic signal control

Publications (2)

Publication Number Publication Date
MX2014007056A true MX2014007056A (es) 2015-03-06
MX344434B MX344434B (es) 2016-12-15

Family

ID=48611761

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2014007056A MX344434B (es) 2011-12-16 2012-12-10 Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red.

Country Status (4)

Country Link
US (1) US9818297B2 (es)
CA (1) CA2859049C (es)
MX (1) MX344434B (es)
WO (1) WO2013086629A1 (es)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9679258B2 (en) * 2013-10-08 2017-06-13 Google Inc. Methods and apparatus for reinforcement learning
US20150301510A1 (en) * 2014-04-22 2015-10-22 Siegmund Düll Controlling a Target System
US9483938B1 (en) 2015-08-28 2016-11-01 International Business Machines Corporation Diagnostic system, method, and recording medium for signalized transportation networks
US10839302B2 (en) 2015-11-24 2020-11-17 The Research Foundation For The State University Of New York Approximate value iteration with complex returns by bounding
US10719777B2 (en) 2016-07-28 2020-07-21 At&T Intellectual Propery I, L.P. Optimization of multiple services via machine learning
CN106412049A (zh) * 2016-09-26 2017-02-15 北京东土科技股份有限公司 智能交通云控制系统
US20180165602A1 (en) 2016-12-14 2018-06-14 Microsoft Technology Licensing, Llc Scalability of reinforcement learning by separation of concerns
CN106846836B (zh) * 2017-02-28 2019-05-24 许昌学院 一种单交叉口信号灯时间控制方法及系统
US9972199B1 (en) * 2017-03-08 2018-05-15 Fujitsu Limited Traffic signal control that incorporates non-motorized traffic information
US10002530B1 (en) 2017-03-08 2018-06-19 Fujitsu Limited Traffic signal control using multiple Q-learning categories
CN106910351B (zh) * 2017-04-19 2019-10-11 大连理工大学 一种基于深度强化学习的交通信号自适应控制方法
US10872526B2 (en) * 2017-09-19 2020-12-22 Continental Automotive Systems, Inc. Adaptive traffic control system and method for operating same
EP3467718A1 (en) * 2017-10-04 2019-04-10 Prowler.io Limited Machine learning system
US11568236B2 (en) 2018-01-25 2023-01-31 The Research Foundation For The State University Of New York Framework and methods of diverse exploration for fast and safe policy improvement
CN110114806A (zh) * 2018-02-28 2019-08-09 华为技术有限公司 信号灯控制方法、相关设备及系统
EP3782143B1 (en) * 2018-04-20 2023-08-09 The Governing Council of the University of Toronto Method and system for multimodal deep traffic signal control
US11610165B2 (en) * 2018-05-09 2023-03-21 Volvo Car Corporation Method and system for orchestrating multi-party services using semi-cooperative nash equilibrium based on artificial intelligence, neural network models,reinforcement learning and finite-state automata
US20190347933A1 (en) * 2018-05-11 2019-11-14 Virtual Traffic Lights, LLC Method of implementing an intelligent traffic control apparatus having a reinforcement learning based partial traffic detection control system, and an intelligent traffic control apparatus implemented thereby
CN110861634B (zh) * 2018-08-14 2023-01-17 本田技研工业株式会社 交互感知决策
US11482106B2 (en) 2018-09-04 2022-10-25 Udayan Kanade Adaptive traffic signal with adaptive countdown timers
CN109785619B (zh) * 2019-01-21 2021-06-22 南京邮电大学 区域交通信号协调优化控制系统及其控制方法
US11416743B2 (en) 2019-04-25 2022-08-16 International Business Machines Corporation Swarm fair deep reinforcement learning
GB2583747B (en) * 2019-05-08 2023-12-06 Vivacity Labs Ltd Traffic control system
WO2020227959A1 (en) 2019-05-15 2020-11-19 Advanced New Technologies Co., Ltd. Determining action selection policies of an execution device
US11176368B2 (en) 2019-06-13 2021-11-16 International Business Machines Corporation Visually focused first-person neural network interpretation
US11217094B2 (en) 2019-06-25 2022-01-04 Board Of Regents, The University Of Texas System Collaborative distributed agent-based traffic light system and method of use
CN110930734A (zh) * 2019-11-30 2020-03-27 天津大学 基于强化学习的闲时交通指示灯智能控制方法
CN111127910A (zh) * 2019-12-18 2020-05-08 上海天壤智能科技有限公司 交通信号调节方法、系统及介质
US12026186B2 (en) 2020-01-27 2024-07-02 International Business Machines Corporation Managing query systems for responding to queries based on attributes associated with a given query
US11080602B1 (en) 2020-06-27 2021-08-03 Sas Institute Inc. Universal attention-based reinforcement learning model for control systems
US20220035640A1 (en) * 2020-07-28 2022-02-03 Electronic Arts Inc. Trainable agent for traversing user interface
CN112133109A (zh) * 2020-08-10 2020-12-25 北方工业大学 一种单交叉口多方向空间占有率均衡控制模型建立方法
CN112215364B (zh) * 2020-09-17 2023-11-17 天津(滨海)人工智能军民融合创新中心 一种基于强化学习的敌-友深度确定性策略方法及系统
US11783702B2 (en) * 2020-09-18 2023-10-10 Huawei Cloud Computing Technologies Co., Ltd Method and system for adaptive cycle-level traffic signal control
CN112099510B (zh) * 2020-09-25 2022-10-18 东南大学 一种基于端边云协同的智能体控制方法
CN112233434A (zh) * 2020-10-10 2021-01-15 扬州大学 基于智能体的城市路口交通信号协调控制系统及方法
CN112488310A (zh) * 2020-11-11 2021-03-12 厦门渊亭信息科技有限公司 一种多智能体群组协作策略自动生成方法
US11883746B2 (en) * 2021-02-23 2024-01-30 Electronic Arts Inc. Adversarial reinforcement learning for procedural content generation and improved generalization
CN113077642B (zh) * 2021-04-01 2022-06-21 武汉理工大学 一种交通信号灯控制方法、装置及计算机可读存储介质
CN113435112B (zh) * 2021-06-10 2024-02-13 大连海事大学 基于邻居感知的多智能体强化学习的交通信号控制方法
CN113763723B (zh) * 2021-09-06 2023-01-17 武汉理工大学 基于强化学习与动态配时的交通信号灯控制系统及方法
WO2023161947A1 (en) * 2022-02-25 2023-08-31 Telefonaktiebolaget Lm Ericsson (Publ) Handling heterogeneous computation in multi-agent reinforcement learning
CN114973660B (zh) * 2022-05-13 2023-10-24 黄河科技学院 一种模型线性化迭代更新法的交通决策方法
CN115083175B (zh) * 2022-06-23 2023-11-03 北京百度网讯科技有限公司 基于车路协同的信号管控方法、相关装置及程序产品
CN115457781B (zh) * 2022-09-13 2023-07-11 内蒙古工业大学 一种基于多代理深度强化学习的智能交通信号灯控制方法
CN115457782B (zh) * 2022-09-19 2023-11-03 吉林大学 基于深度强化学习的自动驾驶车辆交叉口无冲突合作方法
CN115631638B (zh) * 2022-12-07 2023-03-21 武汉理工大学三亚科教创新园 管控区域基于多智能体强化学习的交通灯控制方法及系统
CN116129635B (zh) * 2022-12-27 2023-11-21 重庆邮电大学 一种基于编队的单点无信号交叉口智能调度方法与系统
CN117973538B (zh) * 2024-01-30 2024-08-06 西南交通大学 一种基于多所博弈的融通型牵引供电系统能量管理方法
CN118053311A (zh) * 2024-04-16 2024-05-17 联易云科(北京)科技有限公司 基于多智能体强化学习模型的交通信号控制方法和装置

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3662329A (en) 1968-08-20 1972-05-09 Gulf & Western Industries Multi-phase traffic control system
US3818429A (en) 1971-07-28 1974-06-18 Singer Co Multi-intersection traffic control system
US4323970A (en) 1979-06-22 1982-04-06 Siemens Aktiengesellschaft Method and circuit arrangement for generating setting signals for signal generators of a traffic signal system, particularly a street traffic signal system
US5357436A (en) 1992-10-21 1994-10-18 Rockwell International Corporation Fuzzy logic traffic signal control system
US5668717A (en) * 1993-06-04 1997-09-16 The Johns Hopkins University Method and apparatus for model-free optimal signal timing for system-wide traffic control
JP3399421B2 (ja) 1999-11-05 2003-04-21 住友電気工業株式会社 交通信号制御装置
US6690292B1 (en) 2000-06-06 2004-02-10 Bellsouth Intellectual Property Corporation Method and system for monitoring vehicular traffic using a wireless communications network
US6617981B2 (en) 2001-06-06 2003-09-09 John Basinger Traffic control method for multiple intersections
US6985090B2 (en) 2001-08-29 2006-01-10 Siemens Aktiengesellschaft Method and arrangement for controlling a system of multiple traffic signals
JP3680815B2 (ja) 2002-05-13 2005-08-10 住友電気工業株式会社 交通信号制御方法
US7688224B2 (en) 2003-10-14 2010-03-30 Siemens Industry, Inc. Method and system for collecting traffic data, monitoring traffic, and automated enforcement at a centralized station
US7590589B2 (en) * 2004-09-10 2009-09-15 Hoffberg Steven M Game theoretic prioritization scheme for mobile ad hoc networks permitting hierarchal deference
US20070273552A1 (en) 2006-05-24 2007-11-29 Bellsouth Intellectual Property Corporation Control of traffic flow by sensing traffic states
US20080204277A1 (en) 2007-02-27 2008-08-28 Roy Sumner Adaptive traffic signal phase change system
DE102008049568A1 (de) 2008-09-30 2010-04-08 Siemens Aktiengesellschaft Verfahren zur Optimierung der Verkehrssteuerung an einem lichtsignalgesteuerten Knoten in einem Straßenverkehrsnetz
US8040254B2 (en) 2009-01-06 2011-10-18 International Business Machines Corporation Method and system for controlling and adjusting traffic light timing patterns
GB0916204D0 (en) * 2009-09-16 2009-10-28 Road Safety Man Ltd Traffic signal control system and method
GB201009974D0 (en) * 2010-06-15 2010-07-21 Trinity College Dublin Decentralised autonomic system and method for use inan urban traffic control environment
US8554456B2 (en) * 2011-07-05 2013-10-08 International Business Machines Corporation Intelligent traffic control mesh

Also Published As

Publication number Publication date
WO2013086629A1 (en) 2013-06-20
CA2859049A1 (en) 2013-06-20
MX344434B (es) 2016-12-15
US9818297B2 (en) 2017-11-14
US20150102945A1 (en) 2015-04-16
CA2859049C (en) 2018-06-12

Similar Documents

Publication Publication Date Title
MX344434B (es) Aprendizaje por refuerzo de agentes multiples para control de señales de transito adaptable, integrado y conectado en red.
WO2011133860A3 (en) Systems and methods for providing haptic effects
EA201690256A1 (ru) Система и способ планирования площадки для транспортных средств
MX2018000620A (es) Sistema de analisis de registro de operador.
TW201612879A (en) Display device
WO2013102932A3 (en) System and method facilitating forecasting, optimization and visualization of energy data for industry
GB2457620A (en) Event based process configuration
IN2014DN08342A (es)
WO2013153441A8 (en) Secure zone for digital communications
EA201001725A1 (ru) Системы и способы регулирования темпа передвижения механизированных систем, перемещающихся по маршруту
WO2011155961A3 (en) Method for quantitative resilience estimation of industrial control systems
EP3044062A4 (en) Method and system for adaptive cruise control and vehicle
WO2013033625A3 (en) Systems and methods for switching a relay at zero cross
MY189296A (en) Driving assistance apparatus
HK1187312A1 (zh) 車輛、車用智能鑰匙裝置、車輛遙控駕駛系統及方法
IN2014CN00467A (es)
SG195202A1 (en) Method and device for acquiring distributed duration for traffic lights
WO2011051501A3 (de) Ausbildungssimulationssystem für drohnensysteme
WO2012134447A3 (en) Flight control laws for full envelope banked turns
EP3780003A4 (en) PREDICTION SYSTEM, MODEL GENERATION SYSTEM, PROCEDURE AND PROGRAM
WO2011143610A3 (en) Process and system for recovering phosphorus from wastewater
WO2013012780A3 (en) Systems and method for a crossing equipment controller
EP3570740A4 (en) APPARATUS, METHODS AND SYSTEMS FOR USING IMAGINED DIRECTION TO DEFINE ACTIONS, FUNCTIONS, OR EXECUTION
ATE549228T1 (de) Fahrzeugdetektionssystem und -methode
TW200637221A (en) A multiplexer and methods thereof

Legal Events

Date Code Title Description
FG Grant or registration