学术报告第201978期

作者：时间：2019-11-26 浏览：

题目：Learning from Data to Select Optimal Decisions and Policies

报告人：Yannis Paschalidis（波士顿大学）

报告时间：2019年11月27日 9:30--10:30

报告地点：南一楼中 311

报告摘要：

Making optimal decisions in the absence of an accurate model predicting their impact

is a hard problem, particularly for sequential decision making under

uncertainty. Essentially one is faced with the double challenge of not knowing a

transition model and having to deal with the curse-of-dimensionality. I will consider two related settings. In the first, simpler setting, we are interested in making a single decision. A motivating example is to select the type of treatment to be recommended for a specific patient. I will present a new prediction-based prescriptive model that (i) predicts the outcome under each action using a robust nonlinear model, and (ii) adopts a randomized prescriptive policy determined by the predicted outcomes. The predictive model combines a new distributionally robust learning approach with K-Nearest Neighbors (K-NN) regression,which helps to capture potential nonlinearities embedded in the data. We establish out-of-sample guarantees for the predictive model and the optimality of the randomized policy in terms of the expected true future outcome. I will discuss applications in making treatment recommendations for patients with hypertension and for diabetic patients.

In the second setting, I will consider a Markov Decision Process and develop a robust

method for learning the policy and transition probability model from data (state,

action, next state tuples). We propose two robust maximum likelihood estimation

algorithms for learning the transition probability model and policy, respectively. An

upper bound is established on the regret, which is the difference between the average

reward of the estimated policy under the estimated transition probabilities and that

of the original unknown policy under the true (unknown) transition probabilities. We

provide a sample complexity result showing that we can achieve a low regret with a

relatively small amount of training samples. I will discuss applications in making

treatment decisions to optimize disease progression and also in robotics.

报告人简介：

Yannis Paschalidis is a Professor and Data Science Fellow in Electrical and Computer

Engineering, Systems Engineering, and Biomedical Engineering at Boston University. He is the Director of the Center for Information and Systems Engineering (CISE). He

obtained a Diploma (1991) from the National Technical University of Athens, Greece,

and an M.S. (1993) and a Ph.D. (1996) from the Massachusetts Institute of Technology (MIT), all in Electrical Engineering and Computer Science. He has been at Boston University since 1996. His current research interests lie in the fields of systems and control, networks, optimization, operations research, computational biology, and medical informatics.

Prof. Paschalidis' work has been recognized with a CAREER award (2000) from the

U.S. National Science Foundation, the second prize in the 1997 George E. Nicholson

paper competition by INFORMS, the best student paper award at the 9th Intl. Symposium of Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt 2011) won by one of his Ph.D. students for a joint paper, an IBM/IEEE Smarter Planet Challenge Award, and a finalist best paper award at the IEEE International Conference on Robotics and Automation (ICRA). His work on protein docking (with his collaborators) has been recognized for best performance in modeling selected protein-protein complexes against 64 other predictor groups (2009 Protein Interaction Evaluation Meeting). His recent work on health informatics won an IEEE Computer Society Crowd Sourcing Prize and a best paper award by the International Medical Informatics Associations (IMIA). He was an invited participant at the 2002 Frontiers of Engineering Symposium organized by the National Academy of Engineering, and at the 2014 National Academies Keck Futures Initiative (NAFKI) Conference. Prof. Paschalidis is a Fellow of the IEEE and the founding Editor-in-Chief of the IEEE Transactions on Control of Network Systems.

上一篇：学术报告第201979期

下一篇：学术报告第201976期

科学研究

喻园.AIA学术论坛

学术报告第201978期

友情链接

联系我们