学术报告260-增强学习-上海大学计算机工程与科学学院

首页-科学研究_学术预告


学术报告260-增强学习


发布日期： 2013/11/15 刘华浏览次数：部门: 未知返回

报告主题：增强学习
Reinforcement Learning: An Introduction
报告人：Bernard Manderick
比利时布鲁赛尔自由大学人工智能实验室教授
报告时间：2013年 11月 15日（周五）下午2:00
报告地点：上海大学校本部东区计算机大楼1104室
邀请人：刘悦博士
报告简介：
In this talk we explore a computational approach to learning from interaction. That is, we adopt the perspective of an artificial intelligence researcher. We explore designs for machines that are effective in solving learning problems of scientific, evaluating the designs through mathematical analysis or computational experiments. The approach we explore, called reinforcement learning, is focused on goal-directed learning from interaction.
报告人简介：
Since 1994, Bernard Manderick is professor in the Artificial Intelligence Lab at the Department of Computer Science of the Vrije Universiteit Brussel. Currently, the AI Lab consists of 3 professors, 3 senior researchers, 8 postdocs, and 15 PhD students. For more information, cf. the homepage of the AI Lab http://ai.vub.ac.be for more information. He is (co-)author of over 130 papers covering several machine learning techniques (genetic algorithms, support vector machines, reinforcement learning, Bayesian networks, … ) and several of its applications (bioinformatics, text mining, evolvable hardware, music classification, …). And, he is (co)-supervisor of 11 PhD-theses and currently, he (co)-supervises 8 PhD students. Finally, he was/is coordinator of over 25 research projects and is involved in several international research co-operations.
For his PhD-thesis Selectionism as a Basis for Categorization and Adaptation, Bernard Manderick received the IBM–prize for Informatics awarded by the Fund for Scientific Research (FWO). He also received a Science and Technology Agency Fellowship from the European Commission (DG XII) sponsored by the Japanese Government and an ERCIM (European Research Center for Informatics and Mathematics) Fellowship.

上一条：学术报告261-高维医学图像的并行处理问题研究

下一条：学术报告259-云环境下可扩展的隐私保护在大数据中的应用