Evaluate Yourself As a Data Scientist

What skills do you need to be a data scientist?

I read an interesting data-driven approach to answering this question in the book Doing Data Science: Straight Talk from the Frontline.

In this post I summarize this self-assessment approach that you can use to evaluate your strengths as a data scientist and where you might fit into an amazing data science team.

You can use applied machine learning practitioner as a synonym for data scientist if you like.

Data Science Unicorns

Reviewing jobs for data scientists, the authors of Doing Data Science see that employers are looking for unicorns.

Job ads seek employees that do not exist with strengths in computer science, statistics, communication, data visualization, and domain expertise.

This is not surprising given how the term “data scientist” is ill defined, employers don’t even know what they need or even what problems they need solved.

Skill Histogram

Cleverly, the authors make a list of common required skills of data scientists from job ads.

They use this list and suggest that you rank yourself on a relative scale (0-100) against each skill.

Finally they suggest that you present the results as a bar graph or histogram.

A single person won’t have all the skills, but a well designed data science team will.

The skills in this self assessment are as follows:

  • Computer science
  • Math
  • Statistics
  • Machine learning
  • Domain expertise
  • Communication and presentation skills
  • Data visualization

An example of a completed assessment for Rachel from page 11 of the book is as follows:

Data Science Profile

Data Science Profile
Taken from page 11 of Doing Data Science

I think this is a useful tool to help you focus on your strengths and acknowledged your weaknesses that team members can help you cover.

Ensemble of Skill Histograms

Good data science results require a team.

An individual may have a speciality and be generally weak a other areas. It is when individuals with diverse strengths are brought together into a team that you are able to do great data science.

The authors demonstrate this pictorially as follows (taken from page 12 of the book)

Profile of a Data Science Team

Profile of a Data Science Team
Taken from Page 12 of Doing Data Science

Assess Yourself

What is your strength from the 7 listed above?

Are you able to give yourself a subjective score between 1-100 on each of these skills?

See below for my attempt at a self-assessment.

Jasons Subjective Skill Assessment

Jason’s Subjective Skill Assessment

It’s hard. I believe my strengths are perhaps in computer science, machine learning and communication. The graph above suggests that my visualization skills are not awesome.

I think it is very easy to inflate your skills. How good is good and how do you compare one skill to another? Being good at discrete math in computer science does not help your math score if your calculus is rubbish. Stats is math right? So on. Nevertheless, you have to start somewhere.

The key learning here is to identify and double down on your strengths. You cannot master all the skills. Bring your strongest skill to the table.

Post your results below, I think it would be a fascinating way to group people together on small projects or kaggle competitions.

Is there a skill missing from above?

3 Responses to Evaluate Yourself As a Data Scientist

  1. Avatar
    Brandon M. Burroughs March 3, 2015 at 2:18 am #

    Hey Jason,

    After reading “Doing Data Science”, I latched onto the idea of the Data Science profile. It’s a cool idea that illustrates your skillset well.

    Here’s a link to my Data Science profile.

    https://github.com/brandonmburroughs/Data_Science_Profile

    I’ve also posted my code to generate the plot and written a few thoughts about it. Let me know what you think!

  2. Avatar
    Dalila June 1, 2022 at 4:20 am #

    Hi Jason,
    Thanks for the article it is really interesting
    However I think you data science profile links are dead

Leave a Reply