Fix cuDNN convolution precision on Ampere+ GPUs by joelnn · Pull Request #3127 · davisking/dlib

joelnn · 2025-12-28T17:42:19Z

On Ampere and later GPUs (SM 8.0+), cuDNN's default math mode permits TF32 Tensor Core operations which use reduced mantissa precision. This causes numerical differences when comparing CUDA vs CPU convolution results, particularly in cudnnConvolutionBackwardFilter().

Explicitly set CUDNN_FMA_MATH to force true FP32 computation for consistent numerical results across all GPU architectures.

On Ampere and later GPUs (SM 8.0+), cuDNN's default math mode permits TF32 Tensor Core operations which use reduced mantissa precision. This causes numerical differences when comparing CUDA vs CPU convolution results, particularly in cudnnConvolutionBackwardFilter(). Explicitly set CUDNN_FMA_MATH to force true FP32 computation for consistent numerical results across all GPU architectures.

davisking · 2025-12-28T20:38:11Z

Sweet, thanks for another PR :D

Cydral · 2026-02-02T12:32:30Z

@joelnn, Would it be worth considering CUDNN_TENSOR_OP_MATH_ALLOW_CONVERSION instead of CUDNN_FMA_MATH to maintain Tensor Core performance on Ampere+ GPUs? This would only require relaxing one or two test tolerances from 1e-3 to 2e-3 in test_conv(), which seems acceptable given the significant performance benefit...

joelnn · 2026-02-02T15:10:36Z

@Cydral thats alright with me, I mainly wanted tests to pass. I would've thought that reducing precision should be opt-in, but following the default policy of cuDNN would also be a reasonable policy.

davisking merged commit 07c1e73 into davisking:master Dec 28, 2025
10 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix cuDNN convolution precision on Ampere+ GPUs#3127

Fix cuDNN convolution precision on Ampere+ GPUs#3127
davisking merged 1 commit intodavisking:masterfrom
joelnn:fix-cudnn9-tf32-precision

joelnn commented Dec 28, 2025

Uh oh!

davisking commented Dec 28, 2025

Uh oh!

Uh oh!

Cydral commented Feb 2, 2026

Uh oh!

joelnn commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

joelnn commented Dec 28, 2025

Uh oh!

davisking commented Dec 28, 2025

Uh oh!

Uh oh!

Cydral commented Feb 2, 2026

Uh oh!

joelnn commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants