In the process of fixing a broken link on my previous post, I found another interesting webinar hosted by MySQL. This one is a case study on how to analyze terabytes of click stream data to improve advertising effectiveness. Being a database-centric talk, the focus will be on performance rather than statistics.
This webinar will present a case study of how a large online ad network uses MySQL and the Infobright storage engine to analyze web log files with the goal of improving campaign effectiveness.
In less than 1 TB the system houses up to 9 TB of data collected every 6 months. More than 100 million rows of data are collected per day. The data includes details such as viewed ads, ads acted on, geographical location and demographics information. Using this information analytics are used to determine the attributes of a user with the goal of finding out which users should be presented with which ads to maximize clicks and conversions.
The queries executed on this system are extremely complex. Billions of rows of data are compared to determine what the unique attributes of a user are. This talk will examine these queries and discuss how they can be executed with excellent performance.
The webinar is on June 21 at 10am PDT. Register for it here.