Business Insider analyzed tech skills posted in job ads found on Dice.com. Below are the 6 data science skills found in the top 10, in terms of salary. This is in accordance with a previous study, performed using Glassdoor data (instead of Dice) to conclude that data scientist is the most promising job in 2016.
- MapReduce is a programming framework used for analyzing data to get meaning out of it. It’s most commonly associated with Hadoop, a system for storing data across a bunch of low-cost servers.
- Pig is another hot skill associated with Hadoop. It’s a programming language that lets you extract information from Hadoop find answers to questions or otherwise use the data.
- PaaS is “platform as a service,” which is a cloud system that lets developers build entire applications and host them in the cloud. Microsoft’s Azure is the most prominent example.
- Cloudera makes a commercial version of Hadoop. Although Hadoop is a free and open-source project for storing large amounts of data on inexpensive computer servers, the free version of Hadoop is not easy to use. Several companies have created friendlier versions of Hadoop, and Cloudera is among the most popular.
- Cassandra is a free and open source “noSQL” database, which can handle and store data of different types and sizes. It’s increasingly the go-to database for mobile and cloud applications. Apple uses Cassandra in a big way to store over 10 petabytes of data. Netflix uses it, too, among many others.
- HANA is SAP’s database, the company’s Oracle competitor. HANA is a part of a new wave of databases, known as an in-memory database. It runs entirely in a computer’s memory instead of on storage disks. It can crunch large amounts of data nearly instantly.
Four skills are related to cloud computing and software deployment automation (DevOps), and are not listed here. PaaS is borderline IT / Data Science. To read the original article and check out the salary attached to these 10 skills, click here.
Source for picture: click here
Which other skills would you add? Not sure why Hadoop, Spark and Hive are not listed. I would add deep learning, though it is not considered as a skill but rather, a body of knowledge.
DSC Resources
- Career: Training | Books | Cheat Sheet | Apprenticeship | Certification | Salary Surveys | Jobs
- Knowledge: Research | Competitions | Webinars | Our Book | Members Only | Search DSC
- Buzz: Business News | Announcements | Events | RSS Feeds
- Misc: Top Links | Code Snippets | External Resources | Best Blogs | Subscribe | For Bloggers
Additional Reading
- What statisticians think about data scientists
- Data Science Compared to 16 Analytic Disciplines
- 10 types of data scientists
- 91 job interview questions for data scientists
- 50 Questions to Test True Data Science Knowledge
- 24 Uses of Statistical Modeling
- 21 data science systems used by Amazon to operate its business
- Top 20 Big Data Experts to Follow (Includes Scoring Algorithm)
- 5 Data Science Leaders Share their Predictions for 2016 and Beyond
- 50 Articles about Hadoop and Related Topics
- 10 Modern Statistical Concepts Discovered by Data Scientists
- Top data science keywords on DSC
- 4 easy steps to becoming a data scientist
- 22 tips for better data science
- How to detect spurious correlations, and how to find the real ones
- 17 short tutorials all data scientists should read (and practice)
- High versus low-level data science
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge