Compilation of Advice for New and Aspiring Data Scientists

Compilation of Advice for New and Aspiring Data ScientistsCondensing 30+ pieces of advice into 6 minutesConor DeweyBlockedUnblockFollowFollowingApr 2Photo by Christopher Burns on UnsplashMaybe it’s just me, but I’ve noticed more and more posts on Medium and elsewhere in the data science community centered around offering up advice to newcomers in the field.

I think this is awesome and it’s this type of content that helped me immensely on my journey to data scientist and still helps me today as I continue to learn and grow.

But as the amount of these posts grows larger, it requires a bit more work on the part of the reader to seek out, sift through, and process all of the available information.

This post is designed to make it a little easier for aspiring data scientists to find all of the excellent advice out there from experts in the field.

The majority of the ideas below are condensed from the following 6 posts that I found especially helpful:Advice for New and Junior Data Scientists12 Things I Wish I Knew Before Starting as a Data ScientistAdvice for New Data Scientists6 Recommendations for Aspiring Data Scientists16 Useful Advices for Aspiring Data ScientistsAspiring Data Scientists Master These FundamentalsFirst, I went through each article and plucked out each individual insight or piece of advice.

Then I looked over the list of ideas and made note of any common themes among the different resources, seen below.

Later on in this post, I include all of the other pieces of advice that stood alone.

Let’s get to it.

Common themesAs mentioned previously, here are the ideas that were repeated among several of the articles linked above.

For each point, I’ll include a bit of commentary from myself to go along with it.

Master the art of communicationThis was probably the most popular theme of them all.

The importance of communication in data science is often harped on, and for good reason.

Sifting through data to find insights is useless if you can’t communicate those insights and drive impact in some way.

Like anything else, effective communication is a skill that you can practice and improve on with time.

Build a solid statistics foundationWhether it’s data analytics, machine learning, running experiments, or something more esoteric, you can’t avoid the use of statistics.

Taking the time to build a solid foundation and master fundamental statistics concepts will pay itself off over and over again.

Be skeptic — question your assumptions constantlyAs data scientists, we make assumptions constantly, whether we know it or not.

These assumptions might be related to the data we’re working with or the problem that we’re trying to tackle, but they need to be questioned.

By keeping some level of paranoia about our outputs, we ensure that we’re on the right track.

This skill is often associated with exploratory, research-oriented work, but it’s much more applicable than you think.

Curiosity will take you a long way — ask lots of questionsSimilar to the last point, it also pays to be curious.

Curiosity can lead you to interesting insights that you would never have found otherwise.

It drives you to adopt a growth mindset and ask questions constantly — questions that will help you learn and grow as much as the work itself.

Put your work out into the world (Github, blogging, etc.

)This is a big one for me.

Early on in my career, I learned about the benefits of putting your work out into the world.

Whether it’s through blog posts, projects, tweets, or something else — it doesn’t really matter.

What matters is that you are putting something out there.

The tweet below pretty much sums up my stance on this:Build learning projects with real data that interest youWhen working on learning projects, make sure you’re interested in the topic.

This seems pretty straightforward, but plenty of aspiring data scientists get caught up trying to produce the project that seems the most complex or impressive to would-be employees and colleagues.

Stick to what you enjoy and use real-world data instead of super-clean Kaggle or UCI datasets.

For bonus points, collect some data and build your own dataset.

You’ll never know everything — and that’s okayIt’s clear that data science is a broad, complex field.

You could spend your whole working life practicing it and not even skim the surface.

There’s always going to be another technique to master, another tool to learn, and another paper to read.

This is why imposter syndrome is so relevant in the field.

I find this to be frustrating and exciting all at once.

Pick the right tools for the problem and master themAlong the same lines, just because you can’t master every tool out there doesn’t mean that you shouldn’t master some of them.

There will probably be a couple of building blocks that you spend most of your day working with.

That might be R, SQL, Vim, Airflow, Scikit-learn, anything really.

It doesn’t matter as long as you hone in on your critical tools and learn them well.

More key pointsThese are the ideas that I didn’t find in more than one of the linked posts.

You’ll find equally useful and interesting information here, some more specialized than the common themes from before.

Prioritize effectivelyLearn to properly estimate how long tasks will takeThink about your critical pathPartner with experienced data scientistsTeach and evangelize data scienceLearn domain knowledge, not just methodsThe most important skill is critical thinkingGo to events — hackathons, conferences, meetupsLearn relevant skills, not just technical onesBe flexible with how you enter the fieldGet some hands-on experience with cloud computingGet used to gluing things together and standing up servicesWrite a white paperAlways make sure you understand your data before diving inA mix of algorithms will usually beat just oneTake as many math and physics courses as possibleInvest in your software engineering skillsTrust yourself and follow your passionTry out different roles within data scienceWhen communicating analysis, tell a storyDistribution of a variable is usually more important than its locationSampling is hard and won’t always be perfectBecome a confident command line user to boost productivityBe learning constantlyStart a blog and build out a portfolio to display your skillsLook for companies that leverage data science for their strategyThe size of the company will affect your roleDon’t demand perfection out of your first jobLearn how to sell your ideasAdditional resourcesAlong with the primary articles that I used to compile this list, there are a million other great blog posts for aspiring data scientists to utilize.

These posts below also helped inform some of the key points listed above.

Advice Applying to Data Science JobsAdvice on Building Data Portfolio ProjectsAdvice to Aspiring Data Scientists: Start a BlogThe Two Sides of Getting a Job as a Data ScientistDoing Data Science at TwitterFor more information on any of the bulleted points above, be sure to explore the awesome resources that I linked to throughout this post.

The journey to data scientist isn’t an easy one.

Starting out as a data scientist is no difference.

But the beauty of information sharing makes things that much easier.

It lets us learn from those that came before us.

I think that’s a pretty cool thing.

Pass it on and enjoy the ride.

Thanks for reading!.Feel free to check out some of my similar essays below and subscribe to my newsletter to receive any new content.

5 Resources to Inspire Your Next Data Science ProjectThe Big List of DS/ML Interview ResourcesPython for Data Science: 8 Concepts You May Have ForgottenYou can follow me on Medium for more posts like this and find me on Twitter as well.

For more on me and what I’m up to, check out my website.


. More details

Leave a Reply