Python and R are the two most commonly used languages for data science today. They are both fully open source products and completely free to use and modify as required ...
In the twentieth century, oil was the most valuable resource – but not anymore. In today’s digital age data is the new oil. It will play a similar, perhaps bigger rol...
After my last blog on the use of relational databases PostgreSQL and MonetDB to help compensate for R’s RAM limitations, I received an email from a reader who ask...
When the first release of Spark became available in 2014, Hadoop had already enjoyed several years of growth since 2009 onwards in the commercial space. Although Hadoop s...
A few days ago, while discussing with colleagues why many organizations were still fumbling at what’s typically called the “last mile” in data science, the conversa...
Knowledge is power With IDC estimating that the data mountain has now reached five zettabytes, it is not a case of a business not having enough data to make business deci...
We all know that Data does not lie. However, when we create visualizations we may at times “stretch the truth.” This is often done to help others realize the true sto...
Over the past few years, AI and big data have powered numerous technologies that have changed the way we live, from autonomous cars to conversational systems to personali...
This article was written by Emma Walker. Emma is a data scientist at Qriously. When I started to learn about data science and consider it as a career choice, there was a...
As we move into 2018, Analytical Data Infrastructure (ADI) is becoming a significant topic in business intelligence and analytics. Where Big Data was once an over-hyped, ...