Serious question: what is an AI coprocessor technically? Some machine learnt mod...

dgacmu · on July 24, 2017

One example is: https://arxiv.org/abs/1704.04760

There are many potential designs for these things, but the first gen TPU is one that works, is in production, and has been described in a paper. But you have to differentiate if you mean an inference engine, or something that can also do training. For HoloLens, it's probably going to be an inference unit, which means it'll possibly look something like a TPU, perhaps with more specific hardware support optimized for convolutions (which are very important for visual processing DNNs these days), as the NVidia tensor units are.

arcanus · on July 24, 2017

It is not well documented by anyone. However, the expectation is that it is a matrix or convolution coprocessor, as this is a common operation in deep neural networks (for both inference and training). For instance, NVIDIA says they are supporting 4x4 convolutions with the tensor unit.

danmaz74 · on July 24, 2017

The AI coprocessor is probably the first processor designed directly by the marketing department...

gumby · on July 24, 2017

Oh man, that ship has sailed!

chriskanan · on July 24, 2017

I was in the audience at CVPR when it was presented. They were doing semantic segmentation using resnet-18, so I'm guessing it speeds up convolutions and some linear algebra during inference. I'm guessing it won't be used for training.

hatsunearu · on July 24, 2017

A whole butt ton of GPU-style FMA and low precision float multiply ALUs would be my guess

protomyth · on July 24, 2017

How low precision can you get and still have it be useful?

fulafel · on July 24, 2017

Google's TPU proves that 8 bits is still good.

sbierwagen · on July 24, 2017

Note that the TPU used 8 bit integer math, not even floating point.

Govindae · on July 24, 2017

1-bit, binary neural networks work.

WorldMaker · on July 24, 2017

According to the linked article, this coprocessor seems particularly focused on Deep Neural Networks (DNN), so it does sound like a updatable weight neural network evaluator.

Eridrus · on July 24, 2017

Probably also enough high speed memory to store the weights without needing to go to RAM.