Actions: THUDM/slime
Actions
Showing runs from all workflows
2,500+ workflow runs
2,500+ workflow runs
--allgather-cp silently scrambles token order in hf_attention.py CP reshuffle
Slash Command Handler
#662:
Issue comment #1871 (comment)
created
by
zhuzilin