Was that a unicorn? No, it was a perfect data scientist. You won't find that person, but you can find a great hire โ if they don't suffer from these maladies.
Iโve said it before and Iโll say again: Data science is a team sport.
The gold rush has started and no one will question the wisdom of buying a random acre of land with a stream and searching for your very own gold nuggetโor in this case, a data scientist. Gosh, there are a lot of articles on what makes a good data scientist.
Enough of that. Iโd rather focus on what makes a bad data scientist who has the potential to harm rather than help your organization. Here are eight signs.
Weak mathematical background
With very few exceptions, data scientists are,ย at their core, math geeks. They fall on a spectrum, from total math types who write terrible Python (and ) to folks who can pop machine learning algorithms off the top of their head. You may need both depending on what youโre doing. But a data scientist with a weak mathematical background probably isnโt a real data scientist. Maybe theyโre a data architect or data engineer, but theyโre more likely a consultant from a staffing firm. This person wonโt help you. A weak mathematical background can hurt in a lot of waysโparticularly in judging whether the results youโre getting are useful.
Weak computing background
Data scientists who are mathematicians but donโt really understand computers arenโt terribly useful (in the same way an executive assistant who uses a typewriter isnโt terribly useful in the modern world). In plenty of circumstances, the way youโd calculate something on paper isnโt the same as how youโd calculate it using a distributed platform like Spark. Your data scientist needs to understand this.
Too good to be true
At the same time, donโt expect to find a data scientistย who is a mathematician, statistician, and distributed computing developer, with an MBA and actual experience as a mathematician, distributed computing developer, business person, and so on. In the words of a friend: โHow old are theyโ80?โ This is why you need a team. When you see a data scientist who meets the โunicornโ definition, remember this simple rule: Unicorns do not exist!ย
Effete academics
Just like there are coders who donโt code and architects with no actual technical expertise, there are data scientists with limited experience with actual, you know, data. Moreover, they donโt want to get their hands dirty by digging in the code. Weโre talking practical application, not theory. Youโre not running a university.
Poor communication skills
Fundamentally, a data scientist is there to bring clarity to data. While you as a technology pro or business expert might not understand all of the math or be able to implement it yourself, to trust in the decision-making process, you should understand it at a high level at least conceptually. Whether itโs a clustering algorithm, probability calculations, or NLP, this stuff isnโt hard to convey. If your data scientist isnโt making that happen, your data scientist is doing a bad job. Your data scientist needs to be approachable and make the process approachable. Also, the ability to communicate clearly with multiple groups at an organization to get adjunct information, data, or access to dataโand details on how the data was developedโmakes the work go much smoother.
No understanding of business problems
You really canโt hire a person who reiterates all of the business in math or statistical terms to your data scientist, who then โsolvesโ problems. Why? Because if person A knows how to do all that, he or she probably knows enough to describe that in an algorithm to a computer. Why do you need person B?
No familiarity with the tools of the trade
There is SAS. There is R. There is Scala. There is Python. There is Matlab and a bevy of other tools. If you donโt see those on the resume, then that person probably isnโt a data scientist.
The SAS-only syndrome
With all due respect for my friends in the Containment Area for Relocated Yankees (Cary, N.C.), it seems like all two-bit SAS developers have rebranded themselves โdata scientists.โ But that doesnโt mean they know anything about data science (aka, knowing how to read the data) except how to write SAS code.
What sort of person do you need? You need an individual with the specific skills to address the problem and augment the existing technology team: a mathematician with programming and analytics experience, business sense, and the ability to talk to CEOs and techs alike. Now, donโt go chasing unicornsโbut donโt settle for chumps, either.
Good luck; youโre going to need it. How competitive is it to snag someone good? Try this experiment: Add โdata scientistโ to your LinkedIn profile and watch a million recruiters shower you with offers of riches.


