Introduction to Apache Spark Streaming A data stream is an unbounded sequence of data arriving continuously. Streaming divides continuously flowing input data into discre...
Step 0: Should I try to Scrape this? So you’re excited about a great idea, you’ve found a great site that looks easy to scrape, time to jump right in and star...
Summary: Is everyone a ‘data scientist’? What about ‘data engineers’ and the junior versus senior, or skill level distinctions? We do seem to need some agre...
It speaks volumes of the world we live in today when headlines such as “The world’s most valuable resource is no longer oil, but data” and “Why Data May Be More V...
You must have heard about the global cyberattack of WannaCry ransomware in over 200 countries. It encrypted all the files on the machine and asked for payment. Ransomware...
You must have heard about the global cyberattack of WannaCry ransomware in over 200 countries. It encrypted all the files on the machine and asked for payment. Ransomware...
The advancement of new technologies and the development of big data require professionals with skills in many fields: computer science, mathematics, statistics and busine...
Big Data and Data Science are two of the most exciting areas in the business today. While most of the decision makers understand the true potential of both the fields, co...
The challenge Recently a colleague asked me to help her with a data problem, that seemed very straightforward at a glance. She had purchased a small set of data from th...
This article was posted by S. Richter-Walsh. A Brief Introduction: Linear regression is a classic supervised statistical technique for predictive modelling which is ...