| 以文本方式查看主题 - SCI论坛 (http://artsoncqu.eicp.top/scibbs/index.asp) -- 统计研究所 (http://artsoncqu.eicp.top/scibbs/list.asp?boardid=65) ---- 学术报告:明尼苏达大学统计学系Yuhong Yang教授 (http://artsoncqu.eicp.top/scibbs/dispbbs.asp?boardid=65&id=12113) |
| -- 作者:夜莺 -- 发布时间:2013/7/6 21:24:22 -- 学术报告:明尼苏达大学统计学系Yuhong Yang教授 时间2013.7.7上午10点 地点:重庆大学虎溪校区数统学院统计与精算学系301室
Multi-Armed Bandit with Covariates Yuhong Yang School of Statistics University of Minnesota Multi-armed bandit problem is an important optimization game that requires an exploration-exploitation tradeoff to achieve optimal total reward. Motivated from industrial applications such as online advertising and clinical trial adaptive design, we consider a setting where the rewards of bandit machines are associated with covariates, and the accurate estimation of the corresponding mean reward functions plays an important role in the performance of the allocation rules. We establish strong consistency of nonparametric methods and derive their rates of convergence. In addition, model selection and combination results are presented as well. The work is joint with Wei Qian. |