A few comments on some of these projects: Keras is pretty much the best way to d...

cedricblack · on Aug 11, 2016

I ported ConvNet JS to C# in order to really understand what's going on: https://github.com/cbovar/ConvNetSharp

redcalx · on Aug 11, 2016

Brilliant, I've been looking for projects like this. I'm currently working through a couple of RBM C# projects but will add this to my list of reference code.

Top tip: If you use the matrix and vector classes in math.net then you can optionally configure it to use optimised version of e.g. matrix multiplication, that map through to one of the providers, such as Intel Math Kernel Lib, OpenBLAS, and I think there's a CUDA provider too.

I tried the Intel MKL one and the dense matrix multiplication was about 60x faster than a plain C# version.

https://github.com/mathnet/mathnet-numerics

https://www.nuget.org/packages/MathNet.Numerics/

cedricblack · on Aug 12, 2016

Thanks for the tip. I'll see where I can apply it.

Most of the time is usually spent in the convolution layers. Convolution is not a matrix multiplication in the current implementation. I guess it would be a matrix multiplication in frequency domain or by using a Toeplitz matrix.

I've implemented a CPU Parallel version and gave a try at GPU implementation. But I'm not satisfied at all by the GPU version :)

https://github.com/cbovar/ConvNetSharp/blob/master/src/ConvN...

https://github.com/cbovar/ConvNetSharp/blob/Gpu/src/ConvNetS...

Pull requests more than welcome!

redcalx · on Aug 12, 2016

> Convolution is not a matrix multiplication in the current implementation

I figure there's a code re-organisation task since propagating node activations through a layer of weights is essentially a matrix multiplication (fully connected => fully dense matrix).

The optimised routines make use of vectorised CPU instructions and the FMA instruction (fused multiply and add), all of which are perfect fits for [dense] matrix multiplcation. Not so great for sparse matrices, but they help, usually unless it's very sparse it's faster to use a dense matrix format with zeros for the missing weights.

> Pull requests more than welcome!

Duly noted :)

tchalla · on Aug 11, 2016

> Keras is pretty much the best way to do almost anything these days

What makes Keras take the advantage?

nl · on Aug 11, 2016

It's a well designed API for using deep neural networks rather than an API for doing optimized mathematical operations.

Compare how you build some vaguely comparable models in Keras[1] and raw TensorFlow[2]. Keras uses TensorFlow (or Theano) underneath, so there is no performance penalty.

It's like in Python machine learning, most people use Scikit instead of implementing things in numpy.

[1] https://github.com/fchollet/keras/blob/master/examples/mnist...

[2] https://github.com/tensorflow/tensorflow/blob/master/tensorf...

tchalla · on Aug 11, 2016

Thanks! Your example to explain the differences makes me more leaning towards Keras. It feels like Scikit for Deep Learning.

nl · on Aug 11, 2016

Yes, that's a decent analogy. If you are already familiar with Scikit then TFLearn is worth looking at too.

chestervonwinch · on Aug 11, 2016

Theano itself is more like a language, not a deep learning framework. There is no NeuralNetworkClassifier class, for example. Although, you could write a neural network library / framework using Theano, and it would have all the benefits of Theano (code compiled for the GPU, various common neural net ops available, etc.), which is what it looks like the Keras folks have done. I took a stab at this a while ago (1), but I didn't keep up on it. I haven't used Keras much, but it looks like it fills a much needed gap, which I'm glad for.

(1): https://github.com/notmatthancock/neural_network

stephanheijl · on Aug 11, 2016

Very extensible API, accessible and widely used programming language (Python), the ability to use both Theano and Tensorflow as a backend and easy to implement non-linear neural networks (where data is split and merged at will) all contribute to this. Using Keras means you will almost never need to implement some custom layer or function, whilst sacrificing very little performance-wise.

pilooch · on Aug 11, 2016

> Also, DeepDetect! I keep trying to find that and never can remember the name.

Ah, thanks for the reference, DD author here, the funny fact about the name is that translated to my mother tongue (French), it sounds like porn ;)

IshKebab · on Aug 11, 2016

CNTK is actually pretty damn good. It just lacks a good scripting interface. They're adding one though.

The network description language (now 'BrainScript') is a far nicer way to specify networks than the approaches used by any other network. Especially for recurrent networks. In CNTK you can just say `X = FutureValue(Y)` or `X = PastValue(Y)`. It's so convoluted in TensorFlow I actually never worked out how to do it.

It also has their fancy 1-bit SGD stuff, but I doubt many people use that, and it has a more restrictive license anyway.

Eridrus · on Aug 11, 2016

Keras support for recurrent models leaves a bit to be desired at this point, so it's great if it has what you want, but otherwise you have to start peeking under the hood, which may be harder than just learning the underlying framework.

shostack · on Aug 11, 2016

When you say starting out, do you mean a true beginner to machine learning, or someone new to these frameworks?