SOTAVerified

Question Selection

Papers

Showing 110 of 31 papers

TitleStatusHype
Training Compute-Optimal Large Language ModelsCode6
Survey of Computerized Adaptive Testing: A Machine Learning PerspectiveCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
Active Task Disambiguation with LLMsCode1
Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive TestingCode1
BOBCAT: Bilevel Optimization-Based Computerized Adaptive TestingCode1
ComQA:Compositional Question Answering via Hierarchical Graph Neural NetworksCode1
TestAgent: An Adaptive and Intelligent Expert for Human Assessment0
AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMsCode0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.