Skip to content

Add FP8 block scale quantization support for FMHA forward kernel

1df0a12
Select commit
Loading
Failed to load commit list.
Merged

Fp8 block scale quantization for fmha fwd #3330

Add FP8 block scale quantization support for FMHA forward kernel
1df0a12
Select commit
Loading
Failed to load commit list.