ethomaz
Banned
I can't find a thread about that.
Yesterday nVidia announced the first Volta GPU aimed at very high end of the compute market. While gamer GPU based on Volta will probably come in 2018 only it is good to have idia of the Pascal's sucessor.
I did a summary myself but you can read the article linked after that.
More info here: http://www.anandtech.com/show/11367...v100-gpu-and-tesla-v100-accelerator-announced
nVidia showed Kingsglaive: Final Fantasy XV demo running on Volta GPU.
https://www.youtube.com/watch?v=TIgQQz5SNxs
nVidia PR.
http://nvidianews.nvidia.com/news/n...next-era-of-ai-and-high-performance-computing
Yesterday nVidia announced the first Volta GPU aimed at very high end of the compute market. While gamer GPU based on Volta will probably come in 2018 only it is good to have idia of the Pascal's sucessor.
I did a summary myself but you can read the article linked after that.
- The first Volta GPUs are focused on business, HPC, and deep learning.
- The first chip is codenamed GV100 (sucessor of GP100).
- GV100 has 84SMs with 64 CUDA Cores each one (5376 CUDA Cores total).
- There are a new processing unit called Tensor Cores, 8 per SM, 672 Tensor Cores total.
- FP16 2:1, FP64 1:2 (compared with FP32 performance).
- 1455MHz Boost Clock, 30TFs FP16, 15TFs FP32, 7.5TFs FP64.
- 336 TMUs.
- 16GB HBM2, 900GB/s bandwidth.
- 21.1B transistors, 815mm2 die size in TSMC 12nm FFN (it is a improved 16nmFF+).
- 128KB of L1 data cache/shared memory (split between both can be configurable now).
- 300W TDP.
- Launch in Q3 2017.
More info here: http://www.anandtech.com/show/11367...v100-gpu-and-tesla-v100-accelerator-announced
nVidia showed Kingsglaive: Final Fantasy XV demo running on Volta GPU.
https://www.youtube.com/watch?v=TIgQQz5SNxs
nVidia PR.
http://nvidianews.nvidia.com/news/n...next-era-of-ai-and-high-performance-computing
The Tesla V100 GPU leapfrogs previous generations of NVIDIA GPUs with groundbreaking technologies that enable it to shatter the 100 teraflops barrier of deep learning performance. They include:
- Tensor Cores designed to speed AI workloads. Equipped with 640 Tensor Cores, V100 delivers 120 teraflops of deep learning performance, equivalent to the performance of 100 CPUs.
- New GPU architecture with over 21 billion transistors. It pairs CUDA cores and Tensor Cores within a unified architecture, providing the performance of an AI supercomputer in a single GPU.
- NVLink™ provides the next generation of high-speed interconnect linking GPUs, and GPUs to CPUs, with up to 2x the throughput of the prior generation NVLink.
- 900 GB/sec HBM2 DRAM, developed in collaboration with Samsung, achieves 50 percent more memory bandwidth than previous generation GPUs, essential to support the extraordinary computing throughput of Volta.
- Volta-optimized software, including CUDA, cuDNN and TensorRT™ software, which leading frameworks and applications can easily tap into to accelerate AI and research.