Novel Methods in Data Summarization, Anonymization, and Indexing
Speaker: Pangiotis Karras, National University of Singapore
Location: Warren Weaver Hall 1302
Date: April 6, 2011, 11:30 a.m.
Host: Denis Zorin
This talk will present highlights of my research in three areas of data management and mining. The first area concerns data summarization under space or accuracy constraints. We will mention instances of novel algorithms and structures for synopsis construction I have introduced, which outperform previous state-of-the-art in both accuracy and efficiency, while going beyond conventional assumptions in the area to show how some problems are not as hard as previously thought. The second area concerns the transformation of relational and transaction (set-valued) data in order to satisfy a privacy constraint. We will shortly outline two such instances, namely an algorithm that satisfies the l-diversity model, and a novel model for transaction data publishing with algorithms therefor. Last, we will discuss the problem of indexing and answering complex queries on semi-structured Semantic Web data, and the Hexastore, a sextuple index structure I introduced for that purpose. The focus will be on the core ideas on each topic, the pedagogic insights one can find in them. The end of the talk will outline the directions of my future research plans.
Refreshments will be offered starting 15 minutes prior to the scheduled start of the talk.