A New Batch of Deep Learning Startups to Watch for in 2017

karambahh · on Jan 2, 2017

Sorry to be negative, but can we stop touting companies solely on the mathematical concepts they rely on?

Deep learning techniques are indeed very efficient at solving a very large spectrum of problems, but it does not mean that every company using DL is remarkable. That's remarkable is the goal they achieve or aim to achieve.

If they use decades old technologies in a meaningful way for their target industry, does it make their company less likely to suceed? No it does not, for all that matters is solving the customers' issues...

My company use DL in several fields, but what we sell is simply a solution to customers needs...

plafl · on Jan 2, 2017

I agree with you, but actually two of the companies listed target computer vision and the other three seem to be working on deep learning related hardware which looks like a very valid business although not consumer facing. I suppose most of them will fail, like most startups do, deep learning or not.

deepnotderp · on Jan 2, 2017

Agreed, silly deepmind! /s

The fact of the matter is, if you are smart enough, you can get acquihired for a lot of money. DeepMind, Geometric Intelligence, Deep Blue Labs, Vision Factory, TupleJump, Turi, etc. are all examples.

karambahh · on Jan 2, 2017

DeepMind delivered value on several fields.

They happen to rely (partly) on DL to deliver this value, but they got acqui-hired not because "we are doing deep learning" but because "we have technology that can lower your costs/increase your profits"...

iandanforth · on Jan 2, 2017

They did? My impression was that until the server farm optimization they were a cost center for Google.

karambahh · on Jan 2, 2017

The server farm optimization is massive indeed.

They also provided value before but nothing has been made public about it yet, unfortunately.

darksaints · on Jan 2, 2017

It appears that your company is trying to make a profit. That isn't the point of "deep learning startups"...they're just trying to get a big money acquisition from large tech companies.

karambahh · on Jan 2, 2017

I confirm we aim to (and do :-) ) make a profit. I just can't wrap my mind around the idea that drafting a deck of slides full of buzzwords is enough to get a fat check from VCs...

pryelluw · on Jan 2, 2017

You must be new to the startup "scene". ;)

karambahh · on Jan 2, 2017

Unfortunately I am not. I've seen the 2000 dot com boom and subsequent bust...

pryelluw · on Jan 2, 2017

It was worse back then. But that does not mean stupid money is not running around these days.

kozikow · on Jan 2, 2017

I think "deep learning startup" may be more exciting than company using "decades old technology" :

- Applied deep learning is still not easy. Not many people have know-how about training models. There are no easy answers in hardware space.

- Translate academia success to businesses. After DL achieved remarkable things in Academia, people expect that same success will be translated to business. Business success mostly only came from Google.

karambahh · on Jan 2, 2017

Why would that be?

I built a company on translating academia research to business so I can totally relate to that feeling. I consider that a company exists to provide value to its consumers. If a simple linear regression provides value, why go the DL route?

I am absolutely not saying that DL has no use-case, we use it internally and on customers projects, I am just arguing that if decades old stuff does the job, don't bother with DL, in particular because applied DL is indeed very hard.

To me, it is like an extension of the "if it aint broke, don't fix it" philosophy: "if it can be fixed with a simple solution, dont over-engineer".

I often hear young software engineers laugh about banks still using COBOL for their core business. COBOL is lame, it's monolothic, etc. It may be all fashioned (and the little COBOL I've done I've hated it :-) ) but it goddamn works, why touch it and risk breaking stuff?

If a new technology efficiently solves a use-case then fine, go for it, but I hate the "hype" phenomenon. For instance, about 5 years, the hype was RoR, 2 years ago Scala, now it's DL and Pony, etc... I do some pretty nice DL, it's useful. I really like the concepts of Pony. But I'm not going to promote a deep learning framework in Pony to replace SWIFT money transfer, even though I'd be ticking many boxes of hype with that...

kozikow · on Jan 3, 2017

Previous approaches to AI couldn't beat game of go. There is not as much qualitive difference between pre-decesors of RoR, Pony and Scala. This makes me think that DL hype may be warranted.

Anyway, I am working on deep learning and it enable us to solve problems of our users better than previous computer vision approaches.

zxcvvcxz · on Jan 2, 2017

I'd just like to interject for moment. What you're refering to as Deep Learning, is in fact, non-convex optimization, or as I've recently taken to calling it, generalized function approximation. Deep Learning is not a technology unto itself, but rather a specific type of mathematical optimization that has been known about for decades, is based on stochastic gradient descent, and only now is being featured more prominently due to enhaced processing power.

Many computer users run some form of mathematical optimization everyday, without realizing it. Through a peculiar turn of events, the optimizations which are widely used today are often called Machine Learning, and many of its users are not aware that it is basically non-convex optimization, or high dimensional function approximation.

There really is Deep Learning, and these people are using it, but it is just a part of the optimization mathematics they use. Deep Learning is the kernel: the program in the system that allocates the machine's resources to the other programs that you run. The kernel is an essential part of an operating system, but useless by itself; it can only function in the context of a complete rigorous mathematical framework. Deep Learning is normally used in combination with other ideas in classical mathematical optimization: the whole system is basically stochastic gradient descent with GPUs added, or large-scale numerical optimization. All the so-called Deep Learning algorithms are really algorithms of generalized function approximation!

brandonb · on Jan 2, 2017

This seems like a sincere comment, so in case it's helpful, here's my interpretation of why others may be downvoting it.

Deep learning practitioners are aware that they're using non-convex optimization, stochastic gradient descent, and so on. In fact, those topics are core to modern deep learning research and explicitly acknowledged in most published research: LSTMs were invented to solve the vanishing gradient problem, Restricted Boltzmann Machines were used as a pre-training step to avoid local minima, and optimizers like ADAM have explicit guarantees about things like convergence.

You may know all this stuff already--not sure, based on your comment above.

If not, here are a few example papers from mainstream AI researchers explicitly talking about deep learning as an optimization problem or function approximator: [1] Why Does Unsupervised Pre-training Help Deep Learning?http://www.jmlr.org/papers/volume11/erhan10a/erhan10a.pdf [2] Multilayer feedforward networks are universal approximators: http://www.sciencedirect.com/science/article/pii/08936080899... [3] Deep Learning, Nature. http://www.cs.toronto.edu/~hinton/absps/NatureDeepReview.pdf [4] Adam: A Method for Stochastic Optimization. https://arxiv.org/abs/1412.6980

Guyag · on Jan 2, 2017

I believe it's a version of an old copypasta [0].

[0] https://wiki.installgentoo.com/index.php/Interjection

brandonb · on Jan 2, 2017

Well, I guess my sincerity detector is officially broken, huh?

amelius · on Jan 2, 2017

> Deep learning practitioners are aware that they're using non-convex optimization

It really depends. It's now perfectly possible to use deep learning as a black box, and I've already seen many people use it that way.

Just like you don't have to know how a web-browser works to be a web-programmer, you don't have to know how deep learning works to (e.g.) train a classifier.