Introduction : Web scraping or crawling is the process of extracting data from any website. The data does not necessarily have to be in the form of text, it could be imag...
What makes a great CEO? First of all, the answer to that question depends on who raises it: An investor will likely come up with a different answer than an employee. What...
This article is a solid introduction to statistical testing, for beginners, as well as a reference for practitioners. It includes numerous examples as well as illustratio...
In practice, the Data Scientist wants to know which formula they will write in their Excel sheet when they enter all the data available into it: Bayes’ or usual? The an...
PREFACE Previously, I tackled the Gambler’s Ruin problem using conditional probability and difference equations as well as visualising the simulations of the proble...
In this 5 Minute Analysis we’ll preprocess, map, and explore complicated sales data for liquor stores in Iowa. Then we’ll extract the relevant latitude and longit...
The original article is no longer available. Similar (and more comprehensive) material is available below. Example of underfitted, well-fitted and overfitted models Con...
Probably the worst error is thinking there is a correlation when that correlation is purely artificial. Take a data set with 100,000 variables, say with 10 observations. ...
Many statistics, such as correlations or R-squared, depend on the sample size, making it difficult to compare values computed on two data sets of different sizes. Here, w...
Artificial intelligence (AI) seemingly has been discussed everywhere over the last few years, and now it’s made its way into the commercial insurance industry. Organiza...