首页-科学研究_学术预告

学术报告299-隐藏数据的发现

发布日期:  2014/11/07  刘华   浏览次数: 部门: 未知   返回

报告人:巩志国 副教授[University of Macau, China]
报告时间:11月 08日(周六)14: 30~16: 00
报告地点:校本部东区计算机学院大楼1001室
邀 请 人:骆祥峰 教授
内容摘要:
A large number of web data repositories are hidden behind restrictive web interfaces, making it an important challenge to enable data analytics over these hidden web databases. Most existing techniques assume a form-like web interface which consists solely of categorical attributes (or numeric ones that can be discretized). Nonetheless, many real-world web interfaces (of hidden databases) also feature checkbox interfaces - e.g., the specification of a set of desired features, such as A/C, navigation, etc., for a car-search website like Yahoo!~Autos. We find that, for the purpose of data analytics, such checkbox-represented attributes differ fundamentally from the categorical/numerical ones that were traditionally studied. In this paper, we address the problem of data analytics over hidden databases with checkbox interfaces. Extensive experiments on both synthetic and real datasets demonstrate the accuracy and efficiency of our proposed.
Brief Biography:
Zhiguo Gong got his PhD degree in Computer Science from Chinese Academy of Science in 1998, and is currently an associate Professor in the Department of Computer and Information Science, University of Macau, Macau, China. His research interests include Database, Web Information Retrieval and Web Mining.
.



上一条:学术报告300-新媒体时代与数据时代的可视化

下一条:学术报告298-计算流体力学的直接建模