Free Book: Probability and Statistics Cookbook
The format is very similar to a BIG cheat sheet. This cookbook integrates a variety of topics in probability theory and statistics. It is based… Read More »Free Book: Probability and Statistics Cookbook
Author and Publisher at MLtechniques.com. Machine learning scientist, mathematician, book author (Wiley), patent owner, former post-doc at Cambridge University, former VC-funded executive, with 20+ years of corporate experience including CNET, NBC, Visa, Wells Fargo, Microsoft, eBay. Vincent also founded and co-founded a few start-ups, including one with a successful exit (Data Science Central acquired by Tech Target).
The format is very similar to a BIG cheat sheet. This cookbook integrates a variety of topics in probability theory and statistics. It is based… Read More »Free Book: Probability and Statistics Cookbook
This article has been updated, and now includes two new sections at the bottom (sections 4 and 5), featuring interesting results, more accurate approximations, and… Read More »Little Proof of the Prime Number Theorem
Or of any celestial body. Here I discuss a solution that can be explained to high school students, to get them interested in mathematics, statistics… Read More »Math Challenge: Computing the Average Rotational Speed of Earth
This picture originally posted here covers the following topics: Basic stack Integrated platforms Visualization Data formats Large & out-of-memory data Hadoop Glue As backend GPU in-database… Read More »R for Big Data in One Picture
In this article, I clarify the various roles of the data scientist, and how data science compares and overlaps with related fields such as machine… Read More »Difference between Machine Learning, Data Science, AI, Deep Learning, and Statistics
Today Trump met with leaders of pharmaceutical companies, to discuss “astronomical” drug prices and reduce regulations, so that drug companies can still make hefty profits while… Read More »Will Trump Kill Statistician's Jobs
This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published.… Read More »20 Great Blogs Posted in the last 12 Months
In this article, we discuss a general framework to drastically reduce the influence of outliers in most contexts. It applies to problems such as clustering… Read More »Tutorial: Neutralizing Outliers in Any Dimension
The Art and Science of Encrypting, Embedding and Hiding Messages in Pictures and Videos. This is related to data encryption and security. Imagine that you… Read More »Interesting Data Science Application: Steganography
By David Robinson. David Robinson is a data scientist at Stack Overflow. His article (parts of it) was re-posted in the Washington Post, here. This is… Read More »Data Science Reveals Trump Tweets are Written by Two People