NVIDIA Quadro RTX 8000 vs NVIDIA A40 PCIe

What is the difference between NVIDIA Quadro RTX 8000 and NVIDIA A40 PCIe. Find out which graphics card has better performance.

Quadro RTX 8000

Quadro RTX 8000

A40 PCIe

Graphics Processor (GPU)

TU102	GPU Name	GA102
Turing	Architecture	Ampere
TSMC	Foundry	Samsung
12 nm	Process Size	8 nm
18,600 million	Transistors	28,300 million
754 mm²	Die Size	628 mm²

Graphics Card

Aug 13th, 2018	Release Date	Oct 5th, 2020
Quadro (Tx000)	Family	Tesla (Axx)
Active	Production	Active
PCIe 3.0 x16	Bus Interface	PCIe 4.0 x16

Memory

48 GB	Memory Size	48 GB
GDDR6	Memory Type	GDDR6
384 bit	Memory Bus	384 bit
672.0 GB/s	Bandwidth	695.8 GB/s

Performance

169.9 GPixel/s	Pixel fillrate	194.9 GPixel/s
509.8 GTexel/s	Texture fillrate	584.6 GTexel/s
32.62 TFLOPS (2:1)	FP16 (half) performance	37.42 TFLOPS (1:1)
16.31 TFLOPS	FP32 (float) performance	37.42 TFLOPS
509.8 GFLOPS (1:32)	FP64 (double) performance	1,169 GFLOPS (1:32)

Clock Speeds

1395 MHz	Base Clock	1305 MHz
1770 MHz	Boost Clock	1740 MHz
1750 MHz 14 Gbps effective	Memory Clock	1812 MHz 14.5 Gbps effective

Render Config

4608	Shading Units	10752
288	TMUs	336
96	ROPs	112
72	RT Cores	84
64 KB (per SM)	L1 Cache	128 KB (per SM)
6 MB	L2 Cache	6 MB
72	SM Count	84
576	Tensor Cores	336

Board Design

Dual-slot	Slot Width	Dual-slot
267 mm 10.5 inches	Length	267 mm 10.5 inches
111 mm 4.4 inches	Width	112 mm 4.4 inches
260 W	Thermal design power (TDP)	300 W
600 W	Suggested PSU	700 W
4x DisplayPort 1x USB Type-C	Display Connectors	3x DisplayPort
1x 6-pin + 1x 8-pin	Power Connectors	8-pin EPS

API support

12 Ultimate (12_2)	DirectX	12 Ultimate (12_2)
4.6	OpenGL	4.6
3.0	OpenCL	3.0
1.2	Vulkan	1.2
6.6	Shader Model	6.6
7.5	CUDA	8.6