| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 | 5 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 | 5 |
| Survey of Computerized Adaptive Testing: A Machine Learning Perspective | Mar 31, 2024 | cognitive diagnosisQuestion Selection | CodeCode Available | 2 | 5 |
| BOBCAT: Bilevel Optimization-Based Computerized Adaptive Testing | Aug 17, 2021 | Bilevel OptimizationQuestion Selection | CodeCode Available | 1 | 5 |
| Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models | May 19, 2025 | BenchmarkingChatbot | CodeCode Available | 1 | 5 |
| ComQA:Compositional Question Answering via Hierarchical Graph Neural Networks | Jan 16, 2021 | Answer SelectionMachine Reading Comprehension | CodeCode Available | 1 | 5 |
| Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testing | Nov 19, 2024 | Question Selection | CodeCode Available | 1 | 5 |
| Active Task Disambiguation with LLMs | Feb 6, 2025 | Experimental DesignQuestion Selection | CodeCode Available | 1 | 5 |
| Adaptive political surveys and GPT-4: Tackling the cold start problem with simulated user interactions | Mar 12, 2025 | Question Selection | CodeCode Available | 0 | 5 |
| Asking Clarifying Questions in Open-Domain Information-Seeking Conversations | Jul 15, 2019 | Question SelectionRetrieval | CodeCode Available | 0 | 5 |