jensen-nvidia-gtc16-tesla-p100_01In 2012, NVIDIA started to make promising inroads in Cloud computing environments, thanks to its Kepler architecture. Fast forward to April 2016, more developers are using GPUs for non-graphics purposes than ever, 2015 was a corner year for A.I, with computers winning at the game of go for the first time, and today NVIDIA announces the Tesla P100 chip based on the NVIDIA Pascal architecture.

The Tesla P100 processor succeeds to the Tesla K40 and the Tesla K80 (basically two K40 working together) and packs 3584 computing cores (vs. 2880 for the K40). Each core uses a better architecture, which yields higher performance and better compute density.

To feed data to this monster, a 4096-bit bus to cutting-edge HBM2 memory modules can provide up to 720GB/sec of bandwidth, which is like moving the content of 28 Blu-Ray movies every second. The new HBM2 technology is a key factor in the increased bandwidth.

The computing power tops 21.2 TFLOPS (teraFLOPS) in half-precision calculations and 5.3 TFLOPS in double precision. That is ~5X faster than the Tesla K40 (single-chip). Interestingly, the K40 was designed for 235W, while the P100 chip is designed for 300W – you can see how the thermals have been kept in check, despite the amazing increase in computing performance.

jensen-nvidia-gtc16-tesla-p100_03The NVIDIA Tesla P100 contains a massive 15.3B transistors, which is mind-boggling for a chip. NVIDIA Plans to use it to power scientific applications, including artificial intelligence (A.I) that are based on deep-learning.

The massive size of the internal register file(s) will further accelerate application that can keep data in the chip, without having to communicate with the memory, which is inherently slower. Previous NVIDIA single-chip solutions like the K40 have 3840KB of registers. P100 has 14336KB!

With general purpose computing demand for more performance showing no signs of letting down, NVIDIA can afford to engage in building those amazingly huge and powerful chips.

Filed in Computers. Read more about AI (Artificial Intelligence), NVIDIA, processors.

User Comments