Skip to content

Supporting FIPO (Future-KL Influenced Policy Optimization)#1801

Open
SeungyounShin wants to merge 9 commits intoTHUDM:mainfrom
SeungyounShin:feature/fipo-loss
Open

Supporting FIPO (Future-KL Influenced Policy Optimization)#1801
SeungyounShin wants to merge 9 commits intoTHUDM:mainfrom
SeungyounShin:feature/fipo-loss

Commits

Commits on Apr 3, 2026