Bin Packing Too Many Features
My girlfriend has been struggling with an interesting little problem lately. She was asked to determine the optimal distribution of medicine boxes and bottles over a set of adaptable cabinets; under...
View ArticleSnake Oil and Tiger Repellant
The Wall Street Journal has an interesting article explaining how companies are starting to use (big) data to support their recruiting efforts. It provides a good example of the more general trend in...
View ArticleActionable Predictive Analytics with Oracle Data Mining
Oracle Data Mining (ODM) provides powerful data mining functionality as native SQL functions within the Oracle Database. This Oracle By Example Tutorial gives a good overview of the GUI. While being...
View ArticleA/B Testing XXL
[I’ve tweeted about this before.] If fashion stores believed in A/B testing, they would probably only sell white XXL shirts. Most customers would fit tent-sized garments; most colours go well with...
View ArticleEvidence-Based Everything
I’m not really interested in an exposition of your facts. I don’t very much care to learn about your reasons. First, show me your evidence. Once we’ve established what you think you’ve seen, we can...
View ArticleRestaurant Reviews and the Availability Heuristic
You could say fine dining is a bit of a hobby of mine; and as I’ve mentioned before, I’ve composed quite a few restaurant reviews over the years. I enjoy writing about food almost as much as I love...
View ArticleA New Kind Of Science?
I am no longer a Corporate Ninja. As of a few weeks ago I can now call myself “Data Scientist at Booking.com“. Although I am really excited about the new challenges and opportunities that await me in...
View ArticleData Science: for Fun and for Profit
In the next few weeks I’ll be giving two talks on the topic of Data Science at Xebicon and another event affiliated with Xebia. There is an abstract of my spiel available on the Xebicon site. Data...
View ArticleSimulating Repeated Significance Testing
My colleague Mats has an excellent piece on the topic of repeated significance testing on his blog. To demonstrate how much [repeated significance testing] matters, I’ve ran a simulation of how much...
View ArticlePredictive Analytics World London
In October I’ll speak at Predictive Analytics World in London. Once again, I’ll be talking about Data Science. You can register for the event on the site. Slides are already available online.
View Article