I’ve read a lot of data science resumes working at SharpestMinds.
Because the platform is built to be a giant feedback machine, we constantly hear back from companies who post to it — not only about whether they want to interview or hire candidates but also about the reasons why they chose to interview or hire the ones they did.
That visibility into the decision-making processes of hundreds of companies has taught us a lot about what good resumes look like, and — just as importantly — what they don’t look like.
Caveat: every company looks for something different.
What gets you hired at Google may or may not work at other companies (and may even be meaningless).
So building the “perfect” all-purpose data science resume is all-but-impossible.
Having said that, there are a number of definite mistakes that we’ve seen pretty well guarantee that your application won’t be considered.
It’s hard to think of a faster way to have your resume thrown into the “definite no” pile than featuring work you did on trivial proof-of-concept datasets among your highlighted personal projects.
When in doubt, here are some projects that hurt you more than they help you:Why it hurts youSpace on a resume is limited.
Candidates know it, and so do recruiters.
So if “training wheel datasets” like MNIST are taking up some of that precious space, it can raise questions in recruiters’ minds about how far along you could really be in your data science journey.
What to do about itIf your resume features these kinds of projects — and if you don’t have other, more challenging and substantial ones to replace them with — it’s a strong indication that you need to put some serious time into building up your portfolio.
Of course, if you do have other, more interesting projects to showcase, then you’ll definitely want to swap them in.
ExceptionsIt’s entirely possible to build sophisticated projects using well-studied datasets like MNIST or the Titanic Dataset.
If you’re playing with a new kind of GAN you’ve invented, or you’re replicating an interesting capsule network paper, then go for it.
But keep in mind that most recruiters are nontechnical, and they often work simply by looking for keywords, so you’ll have to make it very clear that your MNIST project involves more than just the trivial digit classification task.
Massive online open courses (MOOCs) like Udacity, Coursera, and deeplearning.
ai are fantastic ways to dive into deep learning and data science.
Having said that, many companies look at candidates with skepticism if that’s all they have to show for themselves.
Some things to avoid:Why it hurts youThere’s a fair bit of ego that goes into hiring.
Companies want to be able to say that they only hire “very special people,” or “the top 1% of applicants,” etc.
And because so many people now have MOOC certifications, it’s easy to be typecast as just another Udacity grad, which makes you sound less special.
Recruiters are now familiar enough with many MOOCs to instantly recognize projects that are part of a standard nanodegree or MOOC (e.
g.
, Udacity’s traffic sign classification task, which is nontrivial and even somewhat interesting, is featured on a disproportionate number of resumes).
To sound special, you need to focus on problems that aren’t as well-explored.
***I want to be clear about something here: Udacity, Coursera, and deeplearning.
ai are awesome programs.
They’re also legitimately correlated with hireability and technical ability (according to our data).
But putting them at the top of your list of advertised data science experience — with few other projects/experience — sets off red flags for recruiters who are looking for “someone really special.
”What to do about itAs soon as you finish your course or nanodegree, you’ll want to strike out on your own, and take part in Kaggle competitions, or replicate the results of interesting papers in the data science literature.
This is important because it: 1) makes you look much more unique, 2) gives you cutting-edge work to showcase and discuss during interviews, and 3) shows that you’re capable of self-directed (and unsupported!) study.
ExceptionsOne exception to this rule is capstone projects you may have completed as part of your MOOC, provided that they’re unique.
By unique, I mean that you had the freedom to choose the dataset you wanted and to solve the problem end-to-end largely on your own.
Because this is effectively no different from going rogue and working on a side-project by yourself, it’s less likely to hurt you.
Not including skills that indicate that you know how the data science sausage is made can be fatal.
Here are some must-haves:Why it hurts youWhat gets people excited about data science tends to be the algorithms.
It’s fun to think of a neural network or boosted tree architectures that could solve your problem.
And as a result, that’s where most people invest their time.
The problem is that designing models is not the same thing as production-level deep learning or data science.
The not-so-fun parts of data science (setting up servers, cleaning data) are virtually guaranteed to make up most of your day-to-day as a data scientist, and the reality is that it’s no longer enough to be good at Python/sklearn/TensorFlow/Keras/PyTorch.
Not including these bread-and-butter skills is also a great way to get skipped over by the median keyword-seeking recruiter, who is more interested in finding a reason to say “no,” than one to say “maybe.
”No GitHub? No mongo? No thanks.
What to do about itIf you have the skills but they’re not listed on your resume, list them.
If you don’t have any experience with version control, DevOps, and databasing tools, you need to get some.
And not just because it looks good on a resume, but because it’s a necessary part of your bag of tricks as a data scientist.
ExceptionsListing these skills can be less important if you’re applying for more senior positions, where your experience with data science tools will be assumed.
Include a project on your resume, and there’s a good chance you’ll be asked about it if you’re interviewed.
If the interviewer asks what you learned by working on a given project, “not much, really,” is not the right answer.
Why it hurts youSince you’ve taken the time to work on a project, questions related to what you’ve learned from it tell the interviewer a lot about how deeply you like to think about your problems and your communication skills.
It’s unlikely that even a fairly straightforward dataset won’t have something to teach you, so walking away none the wiser from a project can raise serious red flags.
What to do about itIf you have a project listed on your resume, and you land an interview with a company, be prepared with some insights that might surprise someone who hasn’t worked with the dataset you tackled in that project.
ExceptionsNone, really.
If you include a project on your resume, you really should have learned from building it.
Ok, this one’s not data science-specific, but we were legitimately shocked to find how correlated typos were to interview performance.
Consistently, people whose resumes feature typos — whether in the form of mis-spellings or even sloppy formatting — performed more poorly than those whose resumes didn’t.
Typos are a great example of a totally unforced error that you really shouldn’t be making, regardless of your level of experience.
And they lead to an objectively lower application -> interview transition rates.
Why it hurts youUnsurprisingly, attention to detail on your resume is, and is perceived to be, correlated with attention to detail on projects, and technical development.
What to do about itHave your resume reviewed by a friend who is a) a native English speaker, and b) attentive to detail.
If you have to, promise them $50 if you get hired, just to motivate them that much more to catch any small details that are out of place.
ExceptionsThere should be no exceptions.
Original.
Reposted with permission.
Related: var disqus_shortname = kdnuggets; (function() { var dsq = document.
createElement(script); dsq.
type = text/javascript; dsq.
async = true; dsq.
src = https://kdnuggets.
disqus.
com/embed.
js; (document.
getElementsByTagName(head)[0] || document.
getElementsByTagName(body)[0]).
appendChild(dsq); })();.