Azure Data Studio in One Picture
I was invited at the recent Microsoft’s Azure Data and AI Tech Immersion event (March 2019) and we played a bit with various tools, including Azure… Read More »Azure Data Studio in One Picture
Author and Publisher at MLtechniques.com. Machine learning scientist, mathematician, book author (Wiley), patent owner, former post-doc at Cambridge University, former VC-funded executive, with 20+ years of corporate experience including CNET, NBC, Visa, Wells Fargo, Microsoft, eBay. Vincent also founded and co-founded a few start-ups, including one with a successful exit (Data Science Central acquired by Tech Target).
I was invited at the recent Microsoft’s Azure Data and AI Tech Immersion event (March 2019) and we played a bit with various tools, including Azure… Read More »Azure Data Studio in One Picture
This is another interesting problem, off-the-beaten-path. It ends up with a formula to compute the integral of a function, based on its derivatives solely. For… Read More »From Infinite Matrices to New Integration Formula
The following articles were hand-picked, and curated by one of our interns. They cover dozens of topics of interest to data scientists. Precision vs significance… Read More »12 Great Curated Blogs About Data Science
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation,… Read More »19 Great Articles About Natural Language Processing (NLP)
Here we describe a simple methodology to produce predictive scores that are consistent over time and compatible across various clients, to allow for meaningful comparisons… Read More »How to Stabilize Data Systems, to Avoid Decay in Model Performance
It all depends on the classes that you attended. Some are worth listing, some are best not to mention. Here I review of few of… Read More »Should you Add your Coursera, Udacity, or DataCamp Training to your Resume?
Many of the following statistical tests are rarely discussed in textbooks or in college classes, much less in data camps. Yet they help answer a… Read More »A Plethora of Original, Not Well-Known Statistical Tests
The original article is no longer available. Similar (and more comprehensive) material is available below. Example of underfitted, well-fitted and overfitted models Content Regression What… Read More »Linear regression in Python: Using numpy, scipy, and statsmodels
I have used synthetic data sets many times for simulation purposes, most recently in my articles Six degrees of Separations between any two Datasets and How to Lie… Read More »Surprising Uses of Synthetic Random Data Sets
I have used synthetic data sets many times for simulation purposes, most recently in my articles Six degrees of Separations between any two Datasets and… Read More »Surprising Uses of Synthetic Random Data Sets