现代制造工程 ›› 2024, Vol. 523 ›› Issue (4): 80-86.doi: 10.16731/j.cnki.1671-3133.2024.04.011

• 车辆工程制造技术 • 上一篇    下一篇

基于强化学习的车队速度规划与能量管理联合优化*

卢兵1, 刘腾2,3, 霍为炜2,3   

  1. 1 北京理工大学机械与车辆学院,北京 100081;
    2 北京信息科技大学机电工程学院,北京 100192;
    3 新能源汽车北京实验室,北京 100192
  • 收稿日期:2023-06-16 出版日期:2024-04-18 发布日期:2024-05-31
  • 通讯作者: 刘腾,硕士,主要从事基于深度强化学习的燃料电池能量管理的研究工作。E-mail: liut_bistu@163.com
  • 作者简介:卢兵,博士,主要从事燃料电池能量管理的研究工作。E-mail:lubingev@sina.com;。
  • 基金资助:
    *国家自然科学基金面上项目(52077007)

Co-optimization of vehicle queue speed planning and energy management with reinforcement learning

LU Bing1, LIU Teng2,3, HUO Weiwei2,3   

  1. 1 School of Mechanical Engineering,Beijing Institute of Technology,Beijing 100081,China;
    2 Mechanical Electrical Engineering School,Beijing Information Science & Technology University, Beijing 100192,China;
    3 Collaborative Innovation Center for Electric Vehicles,Beijing 100192,China
  • Received:2023-06-16 Online:2024-04-18 Published:2024-05-31

摘要: 近年来,随着智能交通系统的快速发展,包括车车通信、车路通信等短距离实时无线通信信息,以及道路交通通行信息等远距离交通信息,使得车辆能够实时获知周围车辆运动情况以及前方交通环境情况,有利于提高车辆对周围交通环境的感知能力,以实现合理的出行安排与行驶控制,从而提高车辆的使用性能。为实现车队在多信号灯场景下的节能驾驶,提出了一种基于强化学习的车队速度规划与能量管理联合优化方法。通过SUMO平台,建立了包括5辆汽车的车队通过多个信号灯的场景,结果表明,提出的方法在舒适性、经济性和效率方面均优于传统的驾驶模型(Intelligent Driver Model,IDM)策略。

关键词: 多智能体强化学习, 能量管理, 燃料电池汽车, 联合优化

Abstract: In recent years,with the rapid development of intelligent transportation system,including vehicle-vehicle communication,vehicle-road communication and other short-range real-time wireless communication information,as well as road traffic information and other long-distance traffic information,so that the vehicle is able to obtain real-time knowledge of the surrounding vehicle movement as well as the traffic environment in front of it,which is conducive to improving the vehicle′s perception of the surrounding traffic environment in order to achieve reasonable travel arrangements and driving control,thus improving the vehicle performance. In order to realize the energy-saving driving of fleet in multi-signal light scenarios,a co-optimization method based on reinforcement learning for vehicle queuq speed planning and energy management was proposed.Through the SUMO platform,a scenario including five vehicles for a fleet of vehicles passing through multiple signals was established.The results show that the proposed method outperforms the traditional driving model Intelligent Driver Model (IDM) in terms of comfort,economy and efficiency.

Key words: multi-agent reinforcement learning, energy management, fuel cell vehicles, co-optimization

中图分类号: 


版权所有 © 《现代制造工程》编辑部 
地址:北京市东城区东四块玉南街28号 邮编:100061 电话:010-67126028 电子信箱:2645173083@qq.com
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn
访问总数:,当日访问:,当前在线: