Can someone elaborate the difference between reinforcement learning and unsuperv...

chrisfosterelli · on March 14, 2016

Not an expert, but my understanding is that humans can learn many things through reinforcement learning, but most of our intelligent decisions are a result of unsupervised learning.

For instance, if the stove element was red hot and you touched it, you'd receive a feeling of pain. Reinforcement learning would suggest that you shouldn't do this again, and you might learn to not touch stove elements when they are red hot anymore.

However, a human is likely to additionally realize that touching a red hot marshmallow stick would burn them as well. And at the same time, a human can also tell that a red ball is safe to touch. This sort of behaviour (internally labeling things and deciding what it is that made the stove element dangerous so you can apply that to other things) would be unsupervised learning.

markism · on March 14, 2016

Interesting, so it's the ability to generalize attributes selectively? That sounds like it would make for the difference between the specialized deep learning we have and a more general intelligence.

Could that be accomplished if NN problems are broken into features, and those features are individually tested against new information? Though you'd need a layer for feature selection, and it still lacks the ability to pick features without training.

chrisfosterelli · on March 14, 2016

> Interesting, so it's the ability to generalize attributes selectively?

In technical terms it is the ability to generate its own features and classifications for making decisions, where in supervised learning a human provides the features and classifications.

> Could that be accomplished if NN problems are broken into features, and those features are individually tested against new information?

If I understood your example correctly, that would be an example of supervised learning, as you pointed out it lacks the ability to pick features without being told what they are. There are types of unsupervised learning that exist and work quite well, for example cluster analysis [1].

[1] https://en.wikipedia.org/wiki/Cluster_analysis

dilemma · on March 14, 2016

Computers follow instructions written for them. Humans write their own instructions (and can even disregard instructions).

micro_cam · on March 14, 2016

Unsupervised learning in a technical sense means things like clustering algorithms where the goal is to discover structure in a data set without known labels (or numeric response).

If you show a human a single unlabeled picture of a platypus (or whatever) they will know instantly with a high degree of certainty that it is a new category of thing and be able to recognize additional images of it as belonging to that class even without being told a label for it.

Our best image classification algorithms can't do that, even with labels they need a lot of example images to learn to identify a new class.

sakopov · on March 14, 2016

Exactly what i was going to ask! Is there such a thing as unsupervised learning? A human mind is constantly fed a stream of learning data. And while some things may be predispositioned by our DNA, I think for the most part we learn directly from the rules and guidelines dictated by our parents first and our society later.

hughperkins · on March 14, 2016

So, its like, when I learned salsa dancing, i couldnt hear the 'beat' of the music, and was always out of time. So what I did was just listen to salsa music whilst cycling and walking for a couple of weeks. Now i could hear the 'beat', and lots more structure in the music to boot.

sakopov · on March 15, 2016

Ah, yes. This's a great example. I see what you're saying.