Dataconomy
  • News
  • AI
  • Big Data
  • Machine Learning
  • Trends
    • Blockchain
    • Cybersecurity
    • FinTech
    • Gaming
    • Internet of Things
    • Startups
    • Whitepapers
  • Industry
    • Energy & Environment
    • Finance
    • Healthcare
    • Industrial Goods & Services
    • Marketing & Sales
    • Retail & Consumer
    • Technology & IT
    • Transportation & Logistics
  • Events
  • About
    • About Us
    • Contact
    • Imprint
    • Legal & Privacy
    • Newsletter
    • Partner With Us
    • Writers wanted
Subscribe
No Result
View All Result
Dataconomy
  • News
  • AI
  • Big Data
  • Machine Learning
  • Trends
    • Blockchain
    • Cybersecurity
    • FinTech
    • Gaming
    • Internet of Things
    • Startups
    • Whitepapers
  • Industry
    • Energy & Environment
    • Finance
    • Healthcare
    • Industrial Goods & Services
    • Marketing & Sales
    • Retail & Consumer
    • Technology & IT
    • Transportation & Logistics
  • Events
  • About
    • About Us
    • Contact
    • Imprint
    • Legal & Privacy
    • Newsletter
    • Partner With Us
    • Writers wanted
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

The Data Science Skills Network

by Ferris Jumah
August 21, 2015
in Data Science
Home Topics Data Science
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail

335eeed Ferris is a full stack data scientist at LinkedIn who enjoys building products at the forefront of intelligent technology. He understands that the next generation won’t be concerned with how to use technology to do things, but will expect technology to do and adapt for them.


As a data scientist, I am usually heads down in numbers, patterns, and code, but as crazy as it sounds, one of the hardest parts of my job is actually describing what I do. There are plenty of resources that offer descriptions and guides on the career of a data scientist. I’ve heard them described as those at the intersection of statistics, hacking abilities, and domain expertise. Or, as data analysts who live in San Francisco.

Rather than add a new definition to the collection, I thought I’d take a data-centric approach towards defining the role. I looked at what skills people with the title “Data Scientist” have listed on their LinkedIn profiles and aggregated the top ten by occurrence*.

Most Popular Data Science Skills

*Corrected using a measure called TFIDF


Join the Partisia Blockchain Hackathon, design the future, gain new skills, and win!


While this list sheds some light on what skills are most frequently included on the profiles of data scientists, it’s difficult to understand how they relate to each other when we’re just looking at a stagnant ranking. To dig a bit deeper, I explored the relationships among these skills by representing and visualizing them as a network. A’la, the Data Science Skill Network (High Res Image):

Data Science Skills

In the network, each node is a skill. Skills are connected when both are listed together in a profile, with the connection growing stronger the more often they are listed together. Since the goal was to visualize the relationships between skills, I clustered similar skills together, represented by colors. Next, skills were sized depending on how well connected they were, and to what extent they influenced other skills in the network, using a measure called network centrality. While there are plenty of conclusions to be drawn, both figures highlight a few key themes. Namely, that today’s data scientists typically:

Approach data with a mathematical mindset

  • We see that machine learning, data mining, data analysis and statistics are all highly ranking skills in the network. This indicates that being able to understand and represent data mathematically, with statistical intuition, is a key skill for data scientists.

Use a common language to access, explore and model data

  • Python, R, and Matlab are the three most popular languages for visualization and model development and SQL is the most common for data access. When it comes to data, extracting, exploring, and testing hypotheses is a big part of the job, so it’s no surprise to see these skills rising to the top.

Develop strong computer science and software engineering backgrounds

  • We also see computer science and software engineering skillsets, with Java, C++, Algorithms, and Hadoop all having notable real estate on the Network visualization. These are skills that are primarily used to leverage data to architect systems.

In my experience, most data scientists will not be experts in all of these categories (math, tools, and software development), but, instead, specialize or hone their skills in one or two of them. These are, therefore, a more holistic view of the skills represented within a typical data science team.

I hope this helped to shed some light on what a data scientist is, and what skills are required to become one. These analyses are all pulled from the skills you list on your LinkedIn profile so hopefully it is also a reminder for you to keep your profile up to date.

Thank you, and I’d be interested in hearing your thoughts below.

(This post was originally published on LinkedIn.)

Tags: data science skillsWeekly Newsletter

Related Posts

BuzzFeed ChatGPT integration: Buzzfeed stock surges in enthusiasm over OpenAI

BuzzFeed ChatGPT integration: Buzzfeed stock surges after the OpenAI deal

January 31, 2023
Adversarial machine learning 101: A new frontier in cybersecurity

Adversarial machine learning 101: A new cybersecurity frontier

January 31, 2023
What is the Nvidia Eye Contact AI feature? Learn how to get and use the new Nvidia Broadcast feature. Zoom meetings and streams get easier.

Nvidia Eye Contact AI can be the savior of your online meetings

January 30, 2023
How did ChatGPT passed an MBA exam

How did ChatGPT passed an MBA exam?

January 27, 2023
What is AI prompt engineering? Learn how to write a prompt with examples. ChatGPT prompt engineering and more explained in this article.

AI prompt engineering is the key to limitless worlds

January 27, 2023
What is Analytics as a Service (AaaS): Examples

Transform your data into a competitive advantage with AaaS

January 26, 2023

Comments 11

  1. josephwnorman says:
    8 years ago

    Cool, thanks for this! How did you measure similarity for coloring?

    Reply
  2. Shan says:
    7 years ago

    I have most of the skills required for DS, So if i take a data science course online will help?

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

LATEST ARTICLES

BuzzFeed ChatGPT integration: Buzzfeed stock surges after the OpenAI deal

Adversarial machine learning 101: A new cybersecurity frontier

Fostering a culture of innovation through digital maturity

Nvidia Eye Contact AI can be the savior of your online meetings

How did ChatGPT passed an MBA exam?

AI prompt engineering is the key to limitless worlds

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy
  • Partnership
  • Writers wanted

Follow Us

  • News
  • AI
  • Big Data
  • Machine Learning
  • Trends
    • Blockchain
    • Cybersecurity
    • FinTech
    • Gaming
    • Internet of Things
    • Startups
    • Whitepapers
  • Industry
    • Energy & Environment
    • Finance
    • Healthcare
    • Industrial Goods & Services
    • Marketing & Sales
    • Retail & Consumer
    • Technology & IT
    • Transportation & Logistics
  • Events
  • About
    • About Us
    • Contact
    • Imprint
    • Legal & Privacy
    • Newsletter
    • Partner With Us
    • Writers wanted
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.