Data Strategy

June 18, 2007

A case study: Analyzing terabytes of click stream data

Filed under: Uncategorized — chucklam @ 12:56 pm

In the process of fixing a broken link on my previous post, I found another interesting webinar hosted by MySQL.  This one is a case study on how to analyze terabytes of click stream data to improve advertising effectiveness. Being a database-centric talk, the focus will be on performance rather than statistics.

This webinar will present a case study of how a large online ad network uses MySQL and the Infobright storage engine to analyze web log files with the goal of improving campaign effectiveness.

In less than 1 TB the system houses up to 9 TB of data collected every 6 months. More than 100 million rows of data are collected per day. The data includes details such as viewed ads, ads acted on, geographical location and demographics information. Using this information analytics are used to determine the attributes of a user with the goal of finding out which users should be presented with which ads to maximize clicks and conversions.

The queries executed on this system are extremely complex. Billions of rows of data are compared to determine what the unique attributes of a user are. This talk will examine these queries and discuss how they can be executed with excellent performance.

The webinar is on June 21 at 10am PDT. Register for it here.

Advertisements

Leave a Comment »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Create a free website or blog at WordPress.com.

%d bloggers like this: