Keep in mind that what you linked refers to TPUv1, which is built for quantized ...

vomjom on Feb 12, 2018 | parent | context | favorite | on: Cloud TPUs in Beta

Keep in mind that what you linked refers to TPUv1, which is built for quantized 8-bit inference. The TPUv2, which was announced in this blog post, is for general purpose training and uses 32-bit weights, activations, and gradients.

It will have very different performance characteristics.

bloudermilk on Feb 12, 2018 [–]

Thanks for pointing that out!