Andodalf
Banned
On RTX GPUs, both Tensor and CUDA cores are limited by the external video memory and texture cache bandwidth.
RX 6800's DirectML and normal Shader workloads have access to very fast 128 MB Infinity Cache (Level L3 cache) in addition to the texture cache.
Its worth noting that RTX 3000 is using higher bandwidth GDRR6X memory, which AMD doesn’t have access to. In many ways infinity cache seems to be a response to this.