Andrew C. Oliver
Contributing Writer

8 telltale signs of a bad data scientist

analysis
Mar 3, 20165 mins

Was that a unicorn? No, it was a perfect data scientist. You won't find that person, but you can find a great hire โ€” if they don't suffer from these maladies.

young boy raising hand in front of chalkboard
Credit: Thinkstock

Iโ€™ve said it before and Iโ€™ll say again: Data science is a team sport.

The gold rush has started and no one will question the wisdom of buying a random acre of land with a stream and searching for your very own gold nuggetโ€”or in this case, a data scientist. Gosh, there are a lot of articles on what makes a good data scientist.

Enough of that. Iโ€™d rather focus on what makes a bad data scientist who has the potential to harm rather than help your organization. Here are eight signs.

Weak mathematical background

With very few exceptions, data scientists are,ย at their core, math geeks. They fall on a spectrum, from total math types who write terrible Python (and ) to folks who can pop machine learning algorithms off the top of their head. You may need both depending on what youโ€™re doing. But a data scientist with a weak mathematical background probably isnโ€™t a real data scientist. Maybe theyโ€™re a data architect or data engineer, but theyโ€™re more likely a consultant from a staffing firm. This person wonโ€™t help you. A weak mathematical background can hurt in a lot of waysโ€”particularly in judging whether the results youโ€™re getting are useful.

Weak computing background

Data scientists who are mathematicians but donโ€™t really understand computers arenโ€™t terribly useful (in the same way an executive assistant who uses a typewriter isnโ€™t terribly useful in the modern world). In plenty of circumstances, the way youโ€™d calculate something on paper isnโ€™t the same as how youโ€™d calculate it using a distributed platform like Spark. Your data scientist needs to understand this.

Too good to be true

At the same time, donโ€™t expect to find a data scientistย who is a mathematician, statistician, and distributed computing developer, with an MBA and actual experience as a mathematician, distributed computing developer, business person, and so on. In the words of a friend: โ€œHow old are theyโ€”80?โ€ This is why you need a team. When you see a data scientist who meets the โ€œunicornโ€ definition, remember this simple rule: Unicorns do not exist!ย 

Effete academics

Just like there are coders who donโ€™t code and architects with no actual technical expertise, there are data scientists with limited experience with actual, you know, data. Moreover, they donโ€™t want to get their hands dirty by digging in the code. Weโ€™re talking practical application, not theory. Youโ€™re not running a university.

Poor communication skills

Fundamentally, a data scientist is there to bring clarity to data. While you as a technology pro or business expert might not understand all of the math or be able to implement it yourself, to trust in the decision-making process, you should understand it at a high level at least conceptually. Whether itโ€™s a clustering algorithm, probability calculations, or NLP, this stuff isnโ€™t hard to convey. If your data scientist isnโ€™t making that happen, your data scientist is doing a bad job. Your data scientist needs to be approachable and make the process approachable. Also, the ability to communicate clearly with multiple groups at an organization to get adjunct information, data, or access to dataโ€”and details on how the data was developedโ€”makes the work go much smoother.

No understanding of business problems

You really canโ€™t hire a person who reiterates all of the business in math or statistical terms to your data scientist, who then โ€œsolvesโ€ problems. Why? Because if person A knows how to do all that, he or she probably knows enough to describe that in an algorithm to a computer. Why do you need person B?

No familiarity with the tools of the trade

There is SAS. There is R. There is Scala. There is Python. There is Matlab and a bevy of other tools. If you donโ€™t see those on the resume, then that person probably isnโ€™t a data scientist.

The SAS-only syndrome

With all due respect for my friends in the Containment Area for Relocated Yankees (Cary, N.C.), it seems like all two-bit SAS developers have rebranded themselves โ€œdata scientists.โ€ But that doesnโ€™t mean they know anything about data science (aka, knowing how to read the data) except how to write SAS code.

What sort of person do you need? You need an individual with the specific skills to address the problem and augment the existing technology team: a mathematician with programming and analytics experience, business sense, and the ability to talk to CEOs and techs alike. Now, donโ€™t go chasing unicornsโ€”but donโ€™t settle for chumps, either.

Good luck; youโ€™re going to need it. How competitive is it to snag someone good? Try this experiment: Add โ€œdata scientistโ€ to your LinkedIn profile and watch a million recruiters shower you with offers of riches.