学术报告《 Data-Driven Control for Intelligent Systems: From Smart Grid to Robotics》-通知公告-新闻中心-江苏省计算机学会

欢迎访问江苏省计算机学会网站！

设为首页 | 收藏本站

当前位置：首页 > 新闻中心 > 通知公告

新闻中心

通知公告

学术报告《 Data-Driven Control for Intelligent Systems: From Smart Grid to Robotics》

发布时间：2020-12-15 10:41:17

南京大学计算机科学与技术系

软件新技术与产业化协同创新中心

摘要：

Deep neural networks have proven to be successful in many identification tasks, however, from model-based control perspective, these networks are difficult to work with because they are typically nonlinear and nonconvex. Therefore, many systems are still identified and controlled based on simple linear models despite their poor representation capability. In the first part of the talk, I will introduce our recent work on bridging the gap between model accuracy and control tractability faced by neural networks, by explicitly constructing networks that are convex with respect to their inputs. We show that these input convex networks (ICNN) can be trained to obtain accurate models of complex physical systems. Experiment results demonstrate the good potential of the proposed ICNN approach in a variety of control applications. In particular we show that in the MuJoCo robotics locomotion tasks, we could achieve over 10% higher performance using 5× less time compared with state-of-the-art model-based reinforcement learning method; and in the building HVAC control example, our method achieved up to 20% energy reduction compared with classic linear models. Besides the complexity of system dynamics, physical systems often encompass a myriad of uncertainties that come from human behavior and the environment. In the second part of the talk, I will introduce a work in collaboration with DeepMind on learning reinforcement learning policies that are robust to perturbations in the environment dynamics. We propose a new data-driven algorithm for incorporating robustness into standard RL called Data-Driven Robust Maximum a-posteriori Policy Optimization (DDR-MPO). This algorithm first learns several transition models from the environment perturbation datasets, then incorporating the transition models into the simulator as an uncertainty set. We show that DDR-MPO outperforms standard RL algorithms in a variety of MuJoCo domains under different perturbed environments.

报告人简介:

石媛媛 (Yuanyuan Shi) 于2011-2015年就读于南京大学工程管理学院自动化专业，获得工程学士学位。2015-2020年于美国华盛顿大学西雅图分校（University of Washington, Seattle）电子及计算机工程学院获得博士学位，导师为Baosen Zhang。目前在加州理工学院跟随Anima Anandkumar与Adam Wierman进行博士后研究。将于2021年入职加州大学圣地亚哥分校 (UCSD)电子及计算机工程系担任助理教授。她的主要研究方向为智能控制, 机器学习及其应用在智能电网，物流以及机器人系统。她已在IEEE Transactions on Automatic Control, IEEE Transactions on Power Systems等国际期刊以及ICLR, NeurIPS, CDC, ACC等机器学习和控制会议上发表多篇论文。曾在Google DeepMind, 京东北美研究院等多处实习，以及获得由MIT授予的2018女性学术新星奖 (Rising Stars in EECS)。

时间:12月18日 10:00-11:30

腾讯会议ID: 832 554 424

上一篇：青年学者学术报告《Inventing New Sensing Modalities on Commodity Smartphones》
下一篇：关于召开江苏省计算机学会计算机伦理与职业修养专委会 2020 年学术年会和换届选举大会第一轮通知

友情链接：

江苏省科学技术协会中国计算机学会南京大学南京大学计算机科技与技术系南京大学软件学院东南大学计算机科学与工程学院江苏经贸职业技术学院南京信息职业技术学院南京工业职业技术学院江苏海事职业技术学院常州信息职业技术学院国网电力科学研究院电子科技集团第28研究所江南计算技术研究所

Copyright (c) 版权所有江苏省计算机学会   南京网站建设公司
秘书处办公室地址：江苏省南京市仙林大道163号邮编：210023   电话/传真：025-89680909
秘书处市内联络点地址：江苏省南京市汉口路22号邮编：210093 电话/传真：025-86635622
电子邮箱：jscs@nju.edu.cn 网址：www.jscs.org.cn 技术支持：南京成旭通信息技术有限公司

网站备案号：苏ICP备14049275号-1

您是本站第28457555位来客！

在线客服