| Changing Answer Order Can Decrease MMLU Accuracy | Jun 27, 2024 | MMLUMultiple-choice | —Unverified | 0 |
| Efficient Model Development through Fine-tuning Transfer | Mar 25, 2025 | MMLUmodel | —Unverified | 0 |
| Efficiently Deploying LLMs with Controlled Risk | Oct 3, 2024 | MMLUTruthfulQA | —Unverified | 0 |
| Efficient Federated Search for Retrieval-Augmented Generation | Feb 26, 2025 | MMLURAG | —Unverified | 0 |
| Efficient Data Selection at Scale via Influence Distillation | May 25, 2025 | GSM8KMMLU | —Unverified | 0 |
| ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers | Dec 18, 2024 | MMLUReranking | —Unverified | 0 |
| Effectiveness of Zero-shot-CoT in Japanese Prompts | Mar 9, 2025 | Abstract AlgebraCollege Mathematics | —Unverified | 0 |
| From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment | Jun 7, 2025 | ARCMMLU | —Unverified | 0 |
| Lizard: An Efficient Linearization Framework for Large Language Models | Jul 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| B-score: Detecting biases in large language models using response history | May 24, 2025 | MMLU | —Unverified | 0 |