In case you missed it #20

  • Data Mining in R a free work in progress e-book from Luís Torgo that starts with the very basics of R and shows different data mining methods for prediction and visualization. Definitely worth the peek.
  • Google Tech Talks - Statistical Aspects of Data Mining is a series of lectures on data mining, using R no less. Great material and very easy to follow. Course website is here.
  • Intro to Data Mining contains three free chapters on PDF. I've read the cluster analysis chapter and it is excellent.
  • Corey Haines performance Kata, where he creates a StringTemplater in full BDD style using RSpec. I found it curious that he would not use "shoulds" on his "its" in RSpec. Why? Wait for the end.
  • How to determine missing indexes in PostgreSQL has some good pointers for finding the obvious missing PKs or unique indexes, but the comment contains a view for finding tables that are doing full table scans vs index scans, which may help identify missing and/or wrong indexes.
  • Ruby Version Manager is an awesome little gem that makes using different versions of ruby on linux or mac extremely simple.
Tagged as: in case you missed it, bdd, ruby, postgresql, r, data mining
Fork me on GitHub