The problem has to do with sampling, random numbers and probability distributions, so it is of interest to our community. As Scott Aaronson describes it in his blog, her...
This document is an attempt to provide a summary of the mathematical background needed for an introductory class in machine learning, which at UC Berkeley is known as CS ...
Correlation coefficients enable to you find relationships between a wide variety of data. However, the sheer number of options can be overwhelming. This picture sums up t...
This article was written by Cassie Kozyrkov. You’ve probably heard of machine learning and artificial intelligence, but are you sure you know what they are? If you�...
I recently came across an interesting account by a practical data scientist on how to munge 25 TB of data. What caught my eye at first was the article’s title: ...
The explanation of Logistic Regression as a Generalized Linear Model and use as a classifier is often confusing. In this article, I try to explain this idea from first pr...
Data science is becoming the pulse of the industry as they provide the minute details to speed up marketing campaigns, enhance business opportunities, drive excellence an...
I am presenting at the upcoming NISS (National Institute of Statistical Sciences) webinar on September 27. This was my first employer in US, back in 1996. I was then comp...
I grew up in a small manufacturing town in Northeast Iowa. The factory in my hometown made tractors (no surprise given that it was Iowa), but eventually the economics o...
Guest blog post by Nabanita Roy. Introduction: The art and science of discriminating between writing styles of authors by identifying the characteristics of the persona ...