First generation TPU[edit] The first-generation TPU is an 8-bit matrix multiplication engine, driven with CISC instructions by the host processor across a PCIe 3.0 bus. It is manufactured on a 28 nm process with a die size ≤ 331 mm2. The clock speed is 700 MHz and it has a thermal design power of 28–40 W. It has 28 MiB of on chip memory, and 4 MiB of 32-bit accumulators taking the results of a 256
![Tensor Processing Unit - Wikipedia](https://cdn-ak-scissors.b.st-hatena.com/image/square/26d317990e14420f6d0f95987ff0ab8f3ee1e219/height=288;version=1;width=512/https%3A%2F%2Fupload.wikimedia.org%2Fwikipedia%2Fcommons%2Fthumb%2Fb%2Fbe%2FTensor_Processing_Unit_3.0.jpg%2F1200px-Tensor_Processing_Unit_3.0.jpg)