A Neural Network in 11 Lines of Python (2015)

ocharles · on Oct 19, 2017

3blue1brown is currently doing a terrific series on how neural networks work, which nicely compliments this blog post. https://www.youtube.com/watch?v=aircAruvnKk&list=PLZHQObOWTQ...

amelius · on Oct 18, 2017

I'm wondering, given a random truth-table with N binary variables and 1 binary output, what is (worst case) the smallest network that can learn it? (In terms of number of parameters).

fooker · on Oct 18, 2017

For a 100% success rate, it would have to be of the size of the minimal BDD. If you can allow for some errors, this becomes an interesting problem.

amelius · on Oct 18, 2017

Yes, but that's probably the theoretical minimum.

What I'm talking about is the size that is required so that the neural net can learn it. This may be different.

HalcyonicStorm · on Oct 18, 2017

Can you define BDD, please?

Someone · on Oct 18, 2017

https://en.wikipedia.org/wiki/Binary_decision_diagram

fnbr · on Oct 18, 2017

This is technically true, but I wonder how close you could get to 100% with minimal size. I would expect that you could get somewhere around >98% with a network that's a few megabytes in size.

e19293001 · on Oct 18, 2017

The author is currently writing this book:

https://www.manning.com/books/grokking-deep-learning

dahart · on Oct 18, 2017

Having gone through this tutorial (which is great!) and several others, I'm curious what is a good second step for the casual neural network learner?

It sounds like there is a growing bag of tricks neural network researchers are discovering to make training practical and stable for large data sets.

One example would be using relu activation - whenever I play with it in a simple tutorial like this one, training seems to explode and fail much more frequently, so I'm guessing either I'm missing another step people use, or there are some extra constraints on initial conditions?

Using a Gaussian for activation in my tutorials has tended to be more stable and converge much faster, but I assume there is a huge downside lurking somewhere to having a non-monotonically increasing function?

What are the tricks of the trade that a weekend warrior should investigate?

mholt · on Oct 18, 2017

I co-authored a blog post with my lab that has practical advice for debugging DNNs - some of these tips might be helpful to you? https://pcc.cs.byu.edu/2017/10/02/practical-advice-for-build...

maurits · on Oct 19, 2017

Stanford CS231n: Convolutional Neural Networks for Visual Recognition [1]

The assignments are excellent and will let you implement a deephish network from practically scratch, including backprop, optimizers, tuning etc.

[1]: http://cs231n.stanford.edu/index.html

jackthetab · on Oct 18, 2017

You may want to check out Andrew Ng's Deep Learning Specialization over on Coursera. [1] One of the courses is specifically about hyperparameter tuning and another about structuring your project. There is a lot of practical information scattered across all the courses.

Yes, I'm taking the specialization and having a blast with it. :-)

[1] https://www.coursera.org/learn/neural-networks-deep-learning

T_D_K · on Oct 18, 2017

Based on the limited amount of information, I'm assuming that by "training explodes" you mean that your gradient descent never reaches a local minimum. Try lowering your learning rate? You may be "stepping over" the minimum.

ghosthamlet · on Oct 19, 2017

this is my APL and J implemention: https://github.com/ghosthamlet/ann.apl

zakoud · on Oct 19, 2017

This is my Q implmentation: https://github.com/zakoud/mlq/blob/master/neuralnets/terser_...

craigching · on Oct 18, 2017

If this blog post interests you, you may be interested in his book that is forthcoming: https://www.manning.com/books/grokking-deep-learning

yujinyuz · on Oct 19, 2017

Nice discussion :)

carlosgg · on Oct 18, 2017

This one has been posted before.

https://news.ycombinator.com/item?id=9886555

https://news.ycombinator.com/item?id=11378022

KennyCason · on Oct 18, 2017

Quite a while ago, not sure if that's a problem. It's a pretty great write up and worthy of posting again. For me, this was the first time seeing it.

SeanDav · on Oct 18, 2017

You are both right. It is worthwhile knowing that something has been posted before, in order to review old comments if interested and good content is always worth repeating.

In any case, reposting is allowed, for several good reasons, which have been discussed in the past.

KennyCason · on Oct 18, 2017

Indeed, I had not considered that. I may have been used to people saying that links shouldn't be posted twice. The extra context is in fact nice!

traverseda · on Oct 18, 2017

It's nice to have the context of past discussions.

KennyCason · on Oct 18, 2017

Ah, very good point. I hadn't considered that. :)

dang · on Oct 18, 2017

After a year or so it's ok: https://news.ycombinator.com/newsfaq.html. Or if the story hasn't had major attention yet.

SloopJon · on Oct 18, 2017

Perhaps a 2015 label would be appropriate?

dang · on Oct 18, 2017

Doh yes added thanks!