显卡FP32浮点性能介绍
FP32 浮点性能指的是显卡在进行 32位单精度浮点数计算 时的处理能力,常用于图形渲染、科学计算、AI 推理、物理模拟等需要高精度数学计算的场景。“FP”是 Floating Point(浮点数) 的缩写 ,“32”表示它是 32位的单精度浮点数(相对于 FP16 是半精度,FP64 是双精度),是显卡重要的性能指标之一。
显卡FP32浮点性能排名(持续更新中)
- NVIDIA GeForce GTX 1080 Ti 354.4 TFLOPS
- NVIDIA GeForce RTX 5090 104.8 TFLOPS
- NVIDIA GeForce RTX 5090 D 104.8 TFLOPS
- NVIDIA GeForce RTX 5090 DD 104.8 TFLOPS
- NVIDIA GeForce RTX 4090 82.58 TFLOPS
- NVIDIA GeForce RTX 4090 D 73.54 TFLOPS
- NVIDIA GeForce RTX 4080 Ti 73.50 TFLOPS
- AMD Radeon RX 7900 XTX 61.39 TFLOPS
- NVIDIA GeForce RTX 5080 SUPER 56.28 TFLOPS
- NVIDIA GeForce RTX 5080 56.28 TFLOPS
- NVIDIA GeForce RTX 4080 SUPER 52.22 TFLOPS
- AMD Radeon RX 7900 XT 51.48 TFLOPS
- NVIDIA GeForce RTX 4080 48.74 TFLOPS
- AMD Radeon RX 9070 XT 48.66 TFLOPS
- AMD Radeon RX 7900 GRE 45.98 TFLOPS
- NVIDIA GeForce RTX 4070 Ti SUPER 44.1 TFLOPS
- NVIDIA GeForce RTX 5070 Ti SUPER 43.94 TFLOPS
- NVIDIA GeForce RTX 5070 Ti 43.94 TFLOPS
- NVIDIA GeForce RTX 4080 12GB 40.09 TFLOPS
- NVIDIA GeForce RTX 4070 Ti 40.09 TFLOPS
- NVIDIA GeForce RTX 3090 Ti 40 TFLOPS
- AMD Radeon RX 7900M 38.52 TFLOPS
- AMD Radeon RX 7800 XT 37.32 TFLOPS
- AMD Radeon RX 9070 36.13 TFLOPS
- AMD Radeon RX 7800M 35.87 TFLOPS
- NVIDIA GeForce RTX 3090 35.58 TFLOPS
- NVIDIA GeForce RTX 4070 SUPER 35.48 TFLOPS
- AMD Radeon RX 7700 XT 35.17 TFLOPS
- AMD Radeon RX 9070 GRE 34.28 TFLOPS
- NVIDIA GeForce RTX 3080 Ti 20 GB 34.10 TFLOPS
- NVIDIA GeForce RTX 3080 Ti 34.1 TFLOPS
- NVIDIA GeForce RTX 4090 Mobile 32.98 TFLOPS
- NVIDIA GeForce RTX 5070 SUPER 32.15 TFLOPS
- AMD Radeon RX 7700 31.95 TFLOPS
- NVIDIA GeForce RTX 5090 Mobile 31.80 TFLOPS
- NVIDIA GeForce RTX 5070 30.87 TFLOPS
- NVIDIA GeForce RTX 3080 12GB 30.64 TFLOPS
- NVIDIA GeForce RTX 3080 29.77 TFLOPS
- NVIDIA GeForce RTX 4070 29.15 TFLOPS
- NVIDIA GeForce RTX 4090 Max-Q 28.31 TFLOPS
- AMD Radeon RX 9060 XT 8GB 25.64 TFLOPS
- AMD Radeon RX 9060 XT 16GB 25.64 TFLOPS
- NVIDIA GeForce RTX 4080 Mobile 24.72 TFLOPS
- NVIDIA GeForce RTX 5060 Ti 16GB 24.7 TFLOPS
- AMD Radeon RX 6950 XT 23.8 TFLOPS
- NVIDIA GeForce RTX 5060 Ti 23.7 TFLOPS
- AMD Radeon RX 6900 XT 23.04 TFLOPS
- NVIDIA GeForce RTX 5080 Mobile 23.04 TFLOPS
- AMD Radeon RX 7600 XT 22.57 TFLOPS
- AMD Radeon RX 7650 GRE 22.08 TFLOPS
- NVIDIA GeForce RTX 4060 Ti 16GB 22.06 TFLOPS
- NVIDIA GeForce RTX 4060 Ti 22.06 TFLOPS
- NVIDIA GeForce RTX 3070 Ti 21.75 TFLOPS
- AMD Radeon RX 7600 21.75 TFLOPS
- AMD Radeon RX 6800 XT 20.74 TFLOPS
- AMD Radeon RX 7700S 20.48 TFLOPS
- NVIDIA GeForce RTX 3070 20.31 TFLOPS
- AMD Radeon RX 7600M XT 20.23 TFLOPS
- NVIDIA GeForce RTX 4080 Max-Q 20.04 TFLOPS
- NVIDIA GeForce RTX 5060 19.18 TFLOPS
- NVIDIA GeForce RTX 3080 Mobile 18.98 TFLOPS
- NVIDIA GeForce RTX 3080 Ti Mobile 18.71 TFLOPS
- AMD Playstation 5 Pro GPU 18.05 TFLOPS
- AMD Radeon RX 7600M 17.27 TFLOPS
- NVIDIA GeForce RTX 5070 Ti Mobile 17.04 TFLOPS
- NVIDIA GeForce RTX 3060 Ti 16.2 TFLOPS
- NVIDIA GeForce RTX 3060 Ti GDDR6X 16.20 TFLOPS
- AMD Radeon RX 6800 16.17 TFLOPS
- NVIDIA GeForce RTX 3070 Mobile 15.97 TFLOPS
- NVIDIA GeForce RTX 3070 Ti Mobile 15.88 TFLOPS
- AMD Radeon RX 7600S 15.77 TFLOPS
- NVIDIA GeForce RTX 4070 Mobile 15.62 TFLOPS
- NVIDIA GeForce RTX 3080 Max-Q 15.30 TFLOPS
- NVIDIA GeForce RTX 4060 15.11 TFLOPS
- NVIDIA GeForce RTX 4050 13.52 TFLOPS
- NVIDIA GeForce RTX 2080 Ti 13.45 TFLOPS
- AMD Radeon RX 6750 XT 13.31 TFLOPS
- AMD Radeon RX 6700 XT 13.21 TFLOPS
- AMD Radeon RX 6850M XT 13.21 TFLOPS
- NVIDIA GeForce RTX 3070 Max-Q 13.21 TFLOPS
- AMD Radeon RX 6750 GRE 12GB 13.21 TFLOPS
- NVIDIA GeForce RTX 5050 13.17 TFLOPS
- NVIDIA GeForce RTX 5070 Mobile 13.13 TFLOPS
- NVIDIA GeForce RTX 5050 Mobile 12.90 TFLOPS
- NVIDIA GeForce RTX 3060 12.74 TFLOPS
- NVIDIA GeForce RTX 3060 8GB 12.74 TFLOPS
- AMD Radeon RX 6800M 12.24 TFLOPS
- NVIDIA GeForce RTX 3070 Ti Max-Q 12.19 TFLOPS
- AMD Xbox Series X 6nm GPU 12.15 TFLOPS
- AMD Xbox Series X GPU 12.15 TFLOPS
- AMD Radeon 8060S 11.96 TFLOPS
- NVIDIA GeForce RTX 4060 Mobile 11.61 TFLOPS
- Intel Arc B570 11.52 TFLOPS
- NVIDIA GeForce RTX 4070 Max-Q 11.34 TFLOPS
- AMD Radeon RX 6700 11.29 TFLOPS
- AMD Radeon RX 6750 GRE 10GB 11.29 TFLOPS
- NVIDIA GeForce RTX 2080 SUPER 11.15 TFLOPS
- AMD Radeon RX 6700M 11.06 TFLOPS
- NVIDIA GeForce RTX 3060 Mobile 10.94 TFLOPS
- AMD Radeon RX 6650 XT 10.79 TFLOPS
- AMD Radeon RX 6600 XT 10.6 TFLOPS
- AMD Playstation 5 GPU 10.29 TFLOPS
- AMD Radeon RX 5700 XT 50th Anniversary 10.14 TFLOPS
- NVIDIA GeForce RTX 2080 10.07 TFLOPS
- AMD Radeon RX 6650M XT 9.896 TFLOPS
- NVIDIA GeForce RTX 3060 Max-Q 9.846 TFLOPS
- AMD Radeon RX 5700 XT 9.754 TFLOPS
- NVIDIA GeForce RTX 5060 Mobile 9.684 TFLOPS
- NVIDIA GeForce RTX 2080 Super Mobile 9.585 TFLOPS
- AMD Radeon 8050S 9.564 TFLOPS
- NVIDIA GeForce RTX 2080 Mobile 9.362 TFLOPS
- NVIDIA GeForce RTX 3050 8GB 9.098 TFLOPS
- NVIDIA GeForce RTX 2070 SUPER 9.062 TFLOPS
- NVIDIA GeForce RTX 4060 Max-Q 9.032 TFLOPS
- NVIDIA GeForce RTX 3050 OEM 8.986 TFLOPS
- NVIDIA GeForce RTX 4050 Mobile 8.986 TFLOPS
- AMD Radeon RX 6600 LE 8.942 TFLOPS
- AMD Radeon RX 6600 8.928 TFLOPS
- NVIDIA GeForce GTX 1080 Mobile 8.878 TFLOPS
- NVIDIA GeForce GTX 1080 8.873 TFLOPS
- AMD Radeon RX 6600M 8.659 TFLOPS
- AMD Radeon RX 6650M 8.659 TFLOPS
- AMD Radeon RX 6800S 8.602 TFLOPS
- AMD Ryzen Z2 8.294 TFLOPS
- AMD Radeon 780M 8.294 TFLOPS
- AMD Ryzen Z1 Extreme 8.294 TFLOPS
- NVIDIA GeForce RTX 4050 Max-Q 8.218 TFLOPS
- NVIDIA GeForce GTX 1070 Ti 8.186 TFLOPS
- AMD Radeon RX 5700 7.949 TFLOPS
- AMD Radeon RX 5700M 7.926 TFLOPS
- NVIDIA GeForce RTX 3050 6GB Mobile 7.639 TFLOPS
- NVIDIA GeForce RTX 2070 7.465 TFLOPS
- AMD Radeon RX 5600 XT 7.188 TFLOPS
- NVIDIA GeForce RTX 2060 SUPER 7.181 TFLOPS
- NVIDIA GeForce RTX 2060 12 GB 7.181 TFLOPS
- AMD Radeon RX 6700S 7.168 TFLOPS
- AMD Radeon RX 6600S 7.168 TFLOPS
- NVIDIA GeForce RTX 3050 4 GB 7.127 TFLOPS
- NVIDIA GeForce RTX 2070 Super Mobile 7.066 TFLOPS
- NVIDIA GeForce GTX 1080 Max-Q 6.994 TFLOPS
- NVIDIA GeForce RTX 2070 Mobile 6.843 TFLOPS
- NVIDIA GeForce RTX 3050 6GB 6.774 TFLOPS
- NVIDIA GeForce GTX 1070 Mobile 6.738 TFLOPS
- NVIDIA GeForce RTX 2060 SUPER Mobile 6.659 TFLOPS
- NVIDIA GeForce RTX 2080 Super Max-Q 6.636 TFLOPS
- NVIDIA GeForce GTX 1070 6.463 TFLOPS
- NVIDIA GeForce RTX 2060 6.451 TFLOPS
- NVIDIA GeForce RTX 2080 Max-Q 6.447 TFLOPS
- NVIDIA GeForce RTX 3050 4GB Mobile 6.18 TFLOPS
- AMD Radeon 890M 5.939 TFLOPS
- NVIDIA GeForce RTX 2070 Super Max-Q 5.914 TFLOPS
- AMD Radeon RX 5600M 5.829 TFLOPS
- AMD Radeon RX 6550M 5.816 TFLOPS
- AMD Radeon RX 6500 XT 5.765 TFLOPS
- NVIDIA GeForce GTX 1070 Max-Q 5.648 TFLOPS
- AMD Ryzen AI Z2 Extreme 5.530 TFLOPS
- AMD Ryzen Z2 Extreme 5.530 TFLOPS
- NVIDIA GeForce RTX 3050 Mobile 5.501 TFLOPS
- NVIDIA GeForce RTX 2070 Max-Q 5.46 TFLOPS
- NVIDIA GeForce GTX 1660 Ti 5.437 TFLOPS
- AMD Radeon 760M 5.323 TFLOPS
- NVIDIA GeForce RTX 3050 Ti Mobile 5.299 TFLOPS
- AMD Radeon RX 5500 XT 5.196 TFLOPS
- AMD Radeon RX 5500 5.196 TFLOPS
- NVIDIA GeForce GTX 1660 Super 5.027 TFLOPS
- NVIDIA GeForce GTX 1660 5.027 TFLOPS
- AMD Radeon RX 6500M 4.915 TFLOPS
- NVIDIA GeForce GTX 1660 Ti Mobile 4.884 TFLOPS
- NVIDIA GeForce RTX 3050 A Mobile 4.813 TFLOPS
- AMD Radeon RX 5500M 4.632 TFLOPS
- NVIDIA GeForce RTX 2060 Mobile 4.608 TFLOPS
- NVIDIA GeForce RTX 2060 Max-Q 4.55 TFLOPS
- AMD Radeon 880M 4.454 TFLOPS
- NVIDIA GeForce GTX 1650 SUPER 4.416 TFLOPS
- NVIDIA GeForce GTX 1060 6 GB 4.375 TFLOPS
- NVIDIA GeForce GTX 1060 5GB 4.375 TFLOPS
- NVIDIA GeForce RTX 3050 Max-Q 4.329 TFLOPS
- NVIDIA GeForce GTX 1060 Mobile 4.278 TFLOPS
- NVIDIA GeForce GTX 1660 Ti Max-Q 4.101 TFLOPS
- AMD Radeon RX 5300M 4.069 TFLOPS
- AMD Xbox Series S GPU 4.006 TFLOPS
- NVIDIA GeForce GTX 1060 3 GB 3.935 TFLOPS
- NVIDIA GeForce GTX 1060 Max-Q 3.789 TFLOPS
- AMD Radeon RX 6450M 3.779 TFLOPS
- AMD Radeon RX 6400 3.565 TFLOPS
- AMD Radeon 680M 3.379 TFLOPS
- NVIDIA GeForce GTX 1650 Mobile 3.195 TFLOPS
- AMD Radeon 860M 3.072 TFLOPS
- NVIDIA GeForce GTX 1650 Ti Mobile 3.041 TFLOPS
- NVIDIA GeForce GTX 1650 2.984 TFLOPS
- AMD Radeon 740M 2.867 TFLOPS
- NVIDIA GeForce MX550 2.703 TFLOPS
- AMD Ryzen Z1 2.560 TFLOPS
- NVIDIA GeForce GTX 1050 Ti Mobile 2.488 TFLOPS
- NVIDIA GeForce GTX 1650 Ti Max-Q 2.458 TFLOPS
- NVIDIA GeForce GTX 1050 3GB 2.332 TFLOPS
- NVIDIA GeForce GTX 1650 Max-Q 2.304 TFLOPS
- NVIDIA GeForce GTX 1050 3GB Mobile 2.215 TFLOPS
- NVIDIA GeForce GTX 1050 Ti 2.138 TFLOPS
- NVIDIA GeForce GTX 1050 Ti Max-Q 1.983 TFLOPS
- NVIDIA GeForce GTX 1050 Mobile 1.911 TFLOPS
- NVIDIA GeForce MX350 1.879 TFLOPS
- NVIDIA GeForce GTX 1050 1.862 TFLOPS
- NVIDIA GeForce GTX 1630 1.828 TFLOPS
- NVIDIA GeForce MX450 12W 1.667 TFLOPS
- AMD Steam Deck OLED GPU 1.638 TFLOPS
- AMD Steam Deck GPU 1.638 TFLOPS
- AMD Radeon 660M 1.459 TFLOPS
- NVIDIA GeForce MX230 0.783 TFLOPS
- AMD Radeon 610M 0.5632 TFLOPS
- NVIDIA GeForce GTX 1050 Max-Q 0.145 TFLOPS
- NVIDIA GeForce MX330 0.122 TFLOPS
- NVIDIA GeForce MX250 0.121 TFLOPS
- NVIDIA GeForce MX150 0.117 TFLOPS
- Intel UHD Graphics 730 N/A
显卡FP32浮点性能计算方法
显卡的 FP32 浮点性能(单位是 TFLOPS)可以大致通过以下公式估算:
FP32 性能(TFLOPS) = CUDA核心数 × 主频 × 每个时钟的操作数 × 2(如果是 FMA 指令) ÷ 1,000,000
以 NVIDIA RTX 3080 为例,它的 CUDA核心数:8704,主频:1.71 GHz,根据公式计算:8704 × 1.71 × 2 ≈ 29.77 TFLOPS,这就意味着 RTX 3080 每秒大约能进行 29.77 万亿次 FP32 运算。
显卡FP32浮点性能的意义
游戏更依赖的是 图形渲染能力(光栅/着色器等),FP32 性能虽然相关,但不是决定性因素,但它确实影响一些高负载的图形计算,比如光线追踪、物理模拟等。
AI/深度学习方面,FP32 性能直接影响神经网络的前向推理和训练速度,但现在更高效的是 FP16 或更低精度的混合精度计算(Tensor Core 支持)。
科学计算 / 工业仿真的高精度需求场景下,FP32 是基本门槛,有些任务还要求 FP64,更适合专业卡如 NVIDIA A100、Quadro 系列。