flash_attn_generic: replace raw arith.* FP ops with FlyDSL-typed fast…#764
Open
xudoyuan wants to merge 4 commits into
Open
flash_attn_generic: replace raw arith.* FP ops with FlyDSL-typed fast…#764xudoyuan wants to merge 4 commits into
xudoyuan wants to merge 4 commits into