Guest blog by David Stephenson, Ph.D. David is a data science and big data analytics speaker and thought leader. For over 15 years, David has been delivering analytic and risk management tools that have guided $10+ billion in business decisions. Prior to returning to consulting, David led global analytics for eBay Classifieds Group, reaching 30 countries operating under a dozen consumer facing brands and spread over six continents. David is a frequent keynote conference speaker on big data, analytics, and privacy, with patented work in data science.
I myself am a data scientist and not a recruiter, but lately a number of my clients have asked me to help them scope and recruit for the role of Chief Data Scientist (a.k.a. ‘Head of Data Science’, ‘Lead Data Scientist’, ‘Head of Analytics’, etc.).
The roles are typically opened by the client company for one of two reasons:
- The company is looking to launch a new data science department, or
- The company has already tried to launch a data science department leveraging existing management and has realized the hard way the need to bring in fresh, specialized leadership.
Over the past 20 years I’ve conducted probably several hundred interviews for analytic roles (after screening many, many more C.V.s). The candidates with whom I’ve spoken have come from all over the world, many having completed world-class technical graduate programs or MBAs from schools such as Wharton, Chicago Booth or Oxford. It’s been a real privilege to find and hire many excellent people over the years.
Filling a Chief Data Scientist role, however, is particularly challenging because of the complex requirements which the successful candidate must satisfy:
Possession of three unrelated skill sets
The Chief Data Scientist role requires a unique blend of technical, business and communication skills; skills which often correlate negatively with one other. That is to say, individuals who excel in technical areas often have proportionately less interest in mastering the art of communication with non-technical business colleagues. They often do not even want to confine their efforts to tasks with demonstrable business value. The successful Chief Data Scientist, however, will not succeed unless he or she possesses both strong communication skills and strong business acumen.
Breadth and depth of technical skills
From an analytic perspective, the role requires both familiarity with a broad range of tools and techniques and also an experience-based understanding of what is involved with in-depth technical implementations. There is certainly space in an organization for specialists in areas such as statistics, deep learning, NLP, or integer programming, but for the lead role, the right candidate must have an overview of the entire analytic tool chest, so that he or she can choose analytic techniques that best address the business problems at hand.
Even more, the candidate must be familiar with analytic tooling, including data base technologies (e.g. SQL, no-SQL, graph databases), programming frameworks (e.g. Spark, MapReduce), development languages (python, R, Matlab, Java), and prototyping tools (Rapidminer, Enterprise Miner, Knime). The technology space is already quite broad and is developing very rapidly. Properly leveraging existing technologies can easily save months or years of in-house development.
Ability to deliver results
I love @Joel Spolsky’s guerilla guide to interviewing. At the risk of oversimplifying, Joel looks for only two things in a technical candidate: 1) intelligence 2) the ability to get things done. In the case of a Chief Data Scientist, initiatives will almost certainly not ‘get done’ if the leader is not able to
- Understand tangible business drivers and KPI’s
- Identify appropriate data science techniques, tools, and applications, typically drawn from cross-industry studies
- Execute the analytics projects in a lean manner
- Communicate vision and win buy-in from peers
I’ve spoken to quite a number of smart candidates with great backgrounds who didn’t have the track-record to convince me that they would deliver amazing results in a business environment. When looking to fill a lead role, I always need to pass on these candidates.
Three phases of the hiring process
There are three main phases through which I typically progress alongside the company seeking to recruit for this role.
1. Working with the recruitment team
My favorite part of the process. The men and women in recruitment are always a lot of fun to work with and are very enthusiastic to learn to recruit new profiles. The Chief Data Scientist role is almost always completely new to them in terms of skill sets, technologies, background and business experience, and so I find it necessary to work very closely with the internal recruiter(s) over multiple sessions in order to scope the role properly, to identify appropriate distribution channels, and to sketch the profiles of candidates to whom they should reach out.
I also work with both recruitment and with the hiring executive to develop a salary indication. It’s important to think about salary early, as many companies do not realize the high premium that this role commands in the job market these days. A company will lose qualified candidates if it takes too long to bring its salary expectations up to a market-conforming level.
I find it helpful to continue working very closely with the recruiter and the hiring executive throughout the hiring process.
2. Finding strong candidates
The most challenging part. We are looking for an individual who can take complete ownership of the analytic program within a company, and, depending on the company’s organization, quite possibly take responsibility for data governance. The techniques for finding and screening the candidates would require another very long article.
3. Landing the candidate
The top candidates for this role will have many job options. It’s important to offer a competitive salary and to follow up closely with the candidate to quickly address any ancillary concerns. As with most positions, geography often plays an important role in a candidate’s decision
For lead data science roles, my experience is that the really strong candidates will be drawn most by the opportunity to work with interesting and abundant data and by the opportunity to contribute to growing an interesting business in creative and meaningful ways. These top candidates are people who love applying complex analytics to large data sets using innovative tooling in order to bring tangible results to real-world business problems. In essence, these are the candidates who want nothing more than to bring real growth to the business.
These candidates are out there. They may be difficult to find, and even more difficult to screen, but they’re out there.
How has your experience been in this area, either as a recruiter, as an executive or as a data scientist? I’d like to hear your thoughts and suggestions in the comments below.
This article, alongside others which occasionally appear, is posted on my website blog. You can also stay connected with me via linkedin or twitter.
Top DSC Resources
- Article: What is Data Science? 24 Fundamental Articles Answering This Question
- Article: Hitchhiker’s Guide to Data Science, Machine Learning, R, Python
- Tutorial: Data Science Cheat Sheet
- Tutorial: How to Become a Data Scientist – On Your Own
- Categories: Data Science – Machine Learning – AI – IoT – Deep Learning
- Tools: Hadoop – DataViZ – Python – R – SQL – Excel
- Techniques: Clustering – Regression – SVM – Neural Nets – Ensembles – Decision Trees
- Links: Cheat Sheets – Books – Events – Webinars – Tutorials – Training – News – Jobs
- Links: Announcements – Salary Surveys – Data Sets – Certification – RSS Feeds – About Us
- Newsletter: Sign-up – Past Editions – Members-Only Section – Content Search – For Bloggers
- DSC on: Ning – Twitter – LinkedIn – Facebook – GooglePlus
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge