NVIDIA Quadro RTX 8000 vs NVIDIA A40 PCIe
What is the difference between NVIDIA Quadro RTX 8000 and NVIDIA A40 PCIe. Find out which graphics card has better performance.
Graphics Processor (GPU)
TU102 | GPU Name | GA102 |
Turing | Architecture | Ampere |
TSMC | Foundry | Samsung |
12 nm | Process Size | 8 nm |
18,600 million | Transistors | 28,300 million |
754 mm² | Die Size | 628 mm² |
Graphics Card
Aug 13th, 2018 | Release Date | Oct 5th, 2020 |
Quadro (Tx000) | Family | Tesla (Axx) |
Active | Production | Active |
PCIe 3.0 x16 | Bus Interface | PCIe 4.0 x16 |
Memory
48 GB | Memory Size | 48 GB |
GDDR6 | Memory Type | GDDR6 |
384 bit | Memory Bus | 384 bit |
672.0 GB/s | Bandwidth | 695.8 GB/s |
Performance
169.9 GPixel/s | Pixel fillrate | 194.9 GPixel/s |
509.8 GTexel/s | Texture fillrate | 584.6 GTexel/s |
32.62 TFLOPS (2:1) | FP16 (half) performance | 37.42 TFLOPS (1:1) |
16.31 TFLOPS | FP32 (float) performance | 37.42 TFLOPS |
509.8 GFLOPS (1:32) | FP64 (double) performance | 1,169 GFLOPS (1:32) |
Clock Speeds
1395 MHz | Base Clock | 1305 MHz |
1770 MHz | Boost Clock | 1740 MHz |
1750 MHz 14 Gbps effective | Memory Clock | 1812 MHz 14.5 Gbps effective |
Render Config
4608 | Shading Units | 10752 |
288 | TMUs | 336 |
96 | ROPs | 112 |
72 | RT Cores | 84 |
64 KB (per SM) | L1 Cache | 128 KB (per SM) |
6 MB | L2 Cache | 6 MB |
72 | SM Count | 84 |
576 | Tensor Cores | 336 |
Board Design
Dual-slot | Slot Width | Dual-slot |
267 mm 10.5 inches | Length | 267 mm 10.5 inches |
111 mm 4.4 inches | Width | 112 mm 4.4 inches |
260 W | Thermal design power (TDP) | 300 W |
600 W | Suggested PSU | 700 W |
4x DisplayPort 1x USB Type-C | Display Connectors | 3x DisplayPort |
1x 6-pin + 1x 8-pin | Power Connectors | 8-pin EPS |
API support
12 Ultimate (12_2) | DirectX | 12 Ultimate (12_2) |
4.6 | OpenGL | 4.6 |
3.0 | OpenCL | 3.0 |
1.2 | Vulkan | 1.2 |
6.6 | Shader Model | 6.6 |
7.5 | CUDA | 8.6 |