Data Strategy

June 12, 2007

Forthcoming book “Introduction to Information Retrieval”

Filed under: Information Retrieval, Search — chucklam @ 4:57 pm

It is written by Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, all noted experts in statistical natural language processing and information retrieval. (Chris and Prabhakar are both professors at Stanford. Prabhakar is also Head of Yahoo! Research.) The book will be published by Cambridge University Press sometime in 2008. Fortunately, for those of us who can’t wait, advance draft of the book is available at

The book will be a welcome introduction to building today’s information retrieval system. It’s the first coherent textbook that incorporates in one place techniques that tend to be taught in different areas. It covers basic topics in classical IR, such as indexing, vector space model, and relevance feedback. It also covers techniques of machine learning and statistical analysis, such as Naive Bayes, support vector machines, clustering, and latent semantic indexing. Finally, it includes web-specific techniques such as crawling and link analysis. It’s intended to be a texbook for advanced undergraduates and should be a useful addition to a search engine practitioner’s library as well.


Leave a Comment »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s

Blog at

%d bloggers like this: