There are several technology and business forces in-play that are going to derive and drive new sources of customer, product and operational value. As a set up for this ...
PREFACE Previously, I tackled the Gambler’s Ruin problem using conditional probability and difference equations as well as visualising the simulations of the proble...
If there was an AI winter, we are clearly in the peak of its summer. I do not know if we will ever build something as Skynet, but we are going to build much simpler thing...
In this 5 Minute Analysis we’ll preprocess, map, and explore complicated sales data for liquor stores in Iowa. Then we’ll extract the relevant latitude and longit...
This is the first article in what will be a three-part series: “How to make your mark on the world as a talented, socially conscious data scientist.” In thi...
Many of the following statistical tests are rarely discussed in textbooks or in college classes, much less in data camps. Yet they help answer a lot of different and inte...
An Introduction to Bayesian Reasoning You might be using Bayesian techniques in your data science without knowing it! And if you’re not, then it could enhance the p...
The original article is no longer available. Similar (and more comprehensive) material is available below. Example of underfitted, well-fitted and overfitted models Con...
Nowadays, industries are privileged by the opportunity to apply data science to reach new heights in their efficiency, productiveness, and overall success. The range of t...
I have used synthetic data sets many times for simulation purposes, most recently in my articles Six degrees of Separations between any two Datasets and How to Lie wit...