I basically did what you’re talking about. Masters in physics, then went into se...

jeff76 · on Jan 1, 2019

Great reply. My question based on this comment,

>My advice is assuming you’d like to be a person that trains/deploys ML models to solve problems in industry. This is much different than an ML Engineer, who’s implementing algorithms in low level languages and squeezing out efficiency. Obviously that would require a much deeper understanding of SWE. And a totally different person is an academic researcher that’s developing theory or technique. It’ll be hard to do that without a PhD

Can one not only train/deploy ML models, but in addition to that be able to implement the algorithms in low level languages and also be able to develop theory?

I’d imagine these are all skill sets that someone in PhD program could pick up.

If they could do all three, what kind of job should they be looking for?

agentofoblivion · on Jan 1, 2019

I think it’s unlikely to become expert in all of those things. If you do, it’s over the course of an entire career, not to get started. I guess it comes down to how much expertise is “enough” for you. Naturally, if you split your time across 3 domains you won’t be as expert as someone who dedicated all their time to going deep in one.

In the context of a big company, I think it makes sense to have a specialized workforce. Why look for the one in a billion person that can publish top quality theoretic papers and then implement them on distributed gpus in an optimal way while also building simple Random Forest models for your business? I’d rather that person do more of the most valuable thing, and then hire someone else to do the rest.

jeff76 · on Jan 2, 2019

This answer makes sense.

I suppose my question is more along the lines of, if someone is specializing in deep learning in a PhD program then shouldn’t they at the very least be able to implement models and also know optimization tricks?

In other words shouldn’t they be able to develop enough skills to go deep in one area but also know enough to be dangerous in the other three domains?

agentofoblivion · on Jan 2, 2019

I think I agree with you with the caveat that it would depend on what they're researching. If they're researching new model architectures, I don't think it makes sense for them to try to implement the algorithms from scratch in C++/CUDA to do distributed GPU training--why not just use TensorFlow? But if you're researching distributed tensor computation, then that's your bread and butter.

scatter · on Jan 2, 2019

Great reply. Just out of curiosity, did you end up giving your semiconductor job before joining the start up with a data scientist title ?

I am deep into semiconductors, and am facing the dilemma of giving up my expertise so far, to join a startup as an entry level engineer.

I have done a couple of MOOC specializations and am trying to find projects within my industry to gain some credibility. Also trying to stay active on Kaggle to build some basic data analysis portfolio.

agentofoblivion · on Jan 2, 2019

In my particular case, I did quit my job before I got my "real" Data Science position. I can't recall exact timelines, but I think I had already lined up the relationship with the startup.

The reason I did this is because there just wasn't enough hours in the day, and my job was taking ~10 hours a day with commuting...etc. It was a risk, but the idea was that I would be able to transition much more quickly if I worked full-time towards it. I also had the financial savings to support myself for 6-9 months and was willing to get a part time job if necessary. Once it became clear that my job's only purpose was to pay the bills in the context of my goals, and I had enough to pay the bills for the near future, it was clear that quitting was the easiest way to free up a lot of time.

This turned out to be the best decision of my career, but YMMV. I doubled my salary in less than 2 years. It's also nice to be part of an industry that isn't so cost-sensitive. I also have a skill set that's in much higher demand, so you can live almost anywhere and there's a ton of companies that want/need it. With semiconductors, you're much more limited.

It's true that you're giving up some expertise and will start in a less senior position in a different field than if you stayed in semi. Sticking around because you have experience is classic Sunk Cost Fallacy. Think 5 years down the line. If you leave now, you'll have 5 years experience in ML. You'll definitely be giving something up if you leave, but there's huge opportunity cost if you don't leave.

scatter · on Jan 2, 2019

Would you be open to have a brief conversation on the phone for advice? My email is knariks@gmail.com.

ultrasounder · on Jan 2, 2019

Hi OP here. I don't think you need to give up Your Experience to join a startup as a Data scientist. But what won't be 100% transferrable would be your Semiconductor specific skills(think spice, process technology etc). If you have been coding your current job that is a skill that is transferrable.Your PhD is a massive foot in the door. Have you considered something like Data incubator or Insight data science fellowship which require a minimum P.hD to transition.