报告讲座

当前位置：首页 > 学院要闻 > 报告讲座 > 正文

分类

肖峰：Optimizing online matching for ride-sourcing services with multi-agent deep reinforcement learning

报告时间：2019年07月12日（周五）下午16:00-17:30

报告地点：屯溪路校区三立苑331室

报告人：肖峰教授

工作单位：西南财经大学工商管理学院

举办单位：太阳商城贵宾会2017Cm

个人简介：

肖峰，教授，博士生导师，西南财经大学大数据研究院副院长。国家自然科学基金优秀青年基金获得者，教育部长江学者青年学者（2017）。主持多项国家和省部级课题。研究方向包括道路拥挤收费、网络建模和优化、博弈论、机器学习与交通数据挖掘、智能交通系统等。在管理科学与工程交通研究领域著名国际期刊和会议如Transportation Science，TransportationResearch Part A、B、C、D, ISTTT等发表多篇论文。

报告简介：

Ride-sourcing servicesare now reshaping the way people travel by effectively connecting drivers andpassengers through mobile internets. Online matching between idle drivers andwaiting passengers is one of the most key components in a ride-sourcing system.The average pickup distance or time is an important measurement of systemefficiency since it affects both passengers’ waiting time and drivers’utilization rate. It is naturally expected that a more effective bipartitematching (with smaller average pickup time) can be implemented if the platformaccumulates more idle drivers and waiting passengers in the matching pool. Aspecific passenger request can also benefit from a delayed matching sincehe/she may be matched with closer idle drivers after waiting for a few seconds.Motivated by the potential benefits of delayed matching, this paper establishesa two-stage framework which incorporates a combinatorial optimization andmulti-agent deep reinforcement learning methods. The multi-agent reinforcementlearning methods are used to dynamically determine the delayed time for eachpassenger request (or the time at which each request enters the matching pool),while the combinatorial optimization conducts an optimal bipartite matchingbetween idle drivers and waiting passengers in the matching pool. Tworeinforcement learning methods, spatio-temporal multi-agent deep Q learning(ST-M-DQN) and spatio-temporal multi-agent actor-critic (ST-M-A2C) aredeveloped. Through extensive empirical experiments with a well-designed simulator,we show that the proposed framework is able to remarkably improve systemperformances.

领导与决策机构

学院党政领导

学院党委委员会

学院纪律委员会

学院教学委员会

学院学术委员会

服务与管理机构

学院兼职秘书

学院党政办公室

学院教学办公室

学院科研办公室

学生工作办公室

教学机构

车辆工程系

能源科学与动力系

制冷与低温工程系

道路与交通工程系

运输与物流工程系

实验中心

科研机构

汽车工程技术研究院

科研团队

学科平台

报告讲座

肖峰：Optimizing online matching for ride-sourcing services with multi-agent deep reinforcement learning

肖 峰：Optimizing online matching for ride-sourcing services with multi-agent deep reinforcement learning

肖峰：Optimizing online matching for ride-sourcing services with multi-agent deep reinforcement learning