Understanding basics of data science
In this blog I will be covering the basic understanding of a data science problem. To begin with I think before starting with any data… Read More »Understanding basics of data science
In this blog I will be covering the basic understanding of a data science problem. To begin with I think before starting with any data… Read More »Understanding basics of data science
Reading the academic literature Text Analytics seems difficult. However, applying it in practice has shown us that Text Classification is much easier than it looks.… Read More »The Naive Bayes Classifier explained
In the last post, we talked about how the open sourcing of machine learning algorithms and hardware architecture gives rise to the latest phenomenon of… Read More »Data Science & Technology Monthly: Feb 2016
Originally written by Nigel Higgs on LinkedIn Pulse. We who have been in the data sphere a while and in and around Data Governance will… Read More »Outside In Data Governance – a value driven approach
One of the hot topics on Machine Learning is, with no doubts, feature engineering. In fact, it comes before the buzz on this topic, simple… Read More »Feature engineering? Start here!
Feature selection is one of the core topics in machine learning. In statistical science, it is called variable reduction or selection. Our scientist published a… Read More »An Introduction to Variable and Feature Selection
This is the first in a series about cross format data modeling principles. Names are Simple, Right? In the data modeling realm, there is perhaps… Read More »The Art of Modeling Names
This article focuses on cases such as Facebook and protein interaction networks. The article was written by By Paul Scherer (paulmorio) and submitted as a research… Read More »Detecting and Visualising Clusterings Interaction Networks (And a few other cool things like Facebook)
This article was posted by Adrian Sampson on his own blog. Adrian is an assistant professor in the Department of Computer Science at Cornell University, where here is also… Read More »Statistical Mistakes and How to Avoid Them
Reddit is now at the center of this attack that impacts millions of top domains (most of the Internet) since November 30. While Reddit appears… Read More »Massive Internet Attack Floods the World with Fake Data