BeyeBLOGS | BeyeBLOGS Home | Get Your Own Blog

« A Short History of Bad Decisions | Main | Using business rules to write maintainable code »

December 27, 2006

Book Review: Data Mining Techniques

I am getting caught up on book reviews over the break. Today's is Data Mining Techniques by Berry and Linoff. This is one of the classic works on data mining and well worth the read.I really liked the book both because it is well written and because, although it drilled into a fair amount of detail about some of the techniques, it started each new section off at a high level. This allows someone without a statistical background, such as me, to read as far as I can in each section and then skip ahead to the next technique. This is a nice change from books that simply get more and more detailed as page follows page, preventing you from gaining an overview of the subject. The book introduces data mining and a methodology for applying it, talks about some of the applications in "Marketing, Sales, and Customer Relationship Management" (as the subtitle puts it), walks through some statistical techniques and then spends the bulk of the book on various data mining techniques. It wraps up with a nice summary of how data mining plays with other technologies and with some practical advice on getting started.

One of the best summaries of where data mining, and indeed EDM, fits is given early in the book where an enterprise is encouraged to:

The authors point out that Data Mining is focused on the "Learn" stage or, as they put it data mining suggests but businesses decide. EDM, of course, is concerned not only with learning but also with acting, most particularly acting by automating decisions in front-line systems. Merely finding patterns is not enough - you must respond to the patterns and act on them, ultimately turning data into information, information into action and action into value.

The methodology section, and the subsequent notes that relate to applying these techniques in real life, talked about the feedback loops between steps in data mining - there is not a linear "waterfall" sequence of steps but constant iteration and learning. They also emphasized the importance of finding the right business problem at the beginning - start as someone once said, with the end in mind. This was reiterated when they quote Voltaire who said "Le mieux est l'ennemi du bien" ("The best is the enemy of good"). In other words, don't get hung up on trying to find the perfect algorithm, perfect answer. Instead build something that is good, that works, and learn and improve over time.

The authors made a big point out of the value of data mining for "mass intimacy", where you want to treat customers differently and there is a business reason to do so but where customers are too numerous to be assigned to staff. One of the issues they pointed out was that staff must be trained in customer interaction skills while also using all the data you have. This can be a real challenge and is one of the reasons I prefer an EDM approach, where the decisions those staff need to make are automated, to other approaches. By giving them the decisions they need you free them to work on the relationship (as I have discussed before). The value of data mining, and EDM, in building a customer-centric organization cannot be overestimated.

Some random snippets of useful stuff from the book:

You can buy the book here and it should definitely be on your bookshelf.

Technorati Tags: , , , , , , , , , , , , , ,

Share: del.icio.us Digg Furl ma.gnolia Netscape Newsvine reddit StumbleUpon Yahoo MyWeb  

Posted by James Taylor at December 27, 2006 10:42 PM

Comments

Post a comment




Remember Me?

(you may use HTML tags for style)