| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Survey of Computerized Adaptive Testing: A Machine Learning Perspective | Mar 31, 2024 | cognitive diagnosisQuestion Selection | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| BOBCAT: Bilevel Optimization-Based Computerized Adaptive Testing | Aug 17, 2021 | Bilevel OptimizationQuestion Selection | CodeCode Available | 1 |
| Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testing | Nov 19, 2024 | Question Selection | CodeCode Available | 1 |
| Active Task Disambiguation with LLMs | Feb 6, 2025 | Experimental DesignQuestion Selection | CodeCode Available | 1 |
| Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models | May 19, 2025 | BenchmarkingChatbot | CodeCode Available | 1 |
| ComQA:Compositional Question Answering via Hierarchical Graph Neural Networks | Jan 16, 2021 | Answer SelectionMachine Reading Comprehension | CodeCode Available | 1 |
| Asking More Informative Questions for Grounded Retrieval | Nov 14, 2023 | Question AnsweringQuestion Selection | —Unverified | 0 |
| A Clarifying Question Selection System from NTES_ALONG in Convai3 Challenge | Oct 27, 2020 | Information RetrievalQuestion Selection | —Unverified | 0 |