SOTAVerified

Question Selection

Papers

Showing 110 of 31 papers

TitleStatusHype
Training Compute-Optimal Large Language ModelsCode6
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Survey of Computerized Adaptive Testing: A Machine Learning PerspectiveCode2
BOBCAT: Bilevel Optimization-Based Computerized Adaptive TestingCode1
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
ComQA:Compositional Question Answering via Hierarchical Graph Neural NetworksCode1
Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive TestingCode1
Active Task Disambiguation with LLMsCode1
Adaptive political surveys and GPT-4: Tackling the cold start problem with simulated user interactionsCode0
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsCode0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.