SOTAVerified

ST-MoE-L 4.1B (fine-tuned)

Papers

Showing 11 of 1 papers

TitleStatusHype
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
Show:102550

No leaderboard results yet.