NVIDIA Quadro RTX 8000 vs NVIDIA A40 PCIe

What is the difference between NVIDIA Quadro RTX 8000 and NVIDIA A40 PCIe. Find out which graphics card has better performance.

Graphics Processor (GPU)

TU102GPU NameGA102
TuringArchitectureAmpere
TSMCFoundrySamsung
12 nmProcess Size8 nm
18,600 millionTransistors28,300 million
754 mm²Die Size628 mm²

Graphics Card

Aug 13th, 2018Release DateOct 5th, 2020
Quadro
(Tx000)
FamilyTesla
(Axx)
ActiveProductionActive
PCIe 3.0 x16Bus InterfacePCIe 4.0 x16

Memory

48 GBMemory Size48 GB
GDDR6Memory TypeGDDR6
384 bitMemory Bus384 bit
672.0 GB/sBandwidth695.8 GB/s

Performance

169.9 GPixel/sPixel fillrate194.9 GPixel/s
509.8 GTexel/sTexture fillrate584.6 GTexel/s
32.62 TFLOPS (2:1)FP16 (half) performance37.42 TFLOPS (1:1)
16.31 TFLOPSFP32 (float) performance37.42 TFLOPS
509.8 GFLOPS (1:32)FP64 (double) performance1,169 GFLOPS (1:32)

Clock Speeds

1395 MHzBase Clock1305 MHz
1770 MHzBoost Clock1740 MHz
1750 MHz
14 Gbps effective
Memory Clock1812 MHz
14.5 Gbps effective

Render Config

4608Shading Units10752
288TMUs336
96ROPs112
72RT Cores84
64 KB (per SM)L1 Cache128 KB (per SM)
6 MBL2 Cache6 MB
72SM Count84
576Tensor Cores336

Board Design

Dual-slotSlot WidthDual-slot
267 mm
10.5 inches
Length267 mm
10.5 inches
111 mm
4.4 inches
Width112 mm
4.4 inches
260 WThermal design power (TDP)300 W
600 WSuggested PSU700 W
4x DisplayPort
1x USB Type-C
Display Connectors3x DisplayPort
1x 6-pin + 1x 8-pinPower Connectors8-pin EPS

API support

12 Ultimate (12_2)DirectX12 Ultimate (12_2)
4.6OpenGL4.6
3.0OpenCL3.0
1.2Vulkan1.2
6.6Shader Model6.6
7.5CUDA8.6