| Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation | Feb 19, 2025 | AllModel Selection | CodeCode Available | 0 |
| Pricing is All You Need to Improve Traffic Routing | Feb 18, 2025 | All | —Unverified | 0 |
| One Size doesn't Fit All: A Personalized Conversational Tutoring Agent for Mathematics Instruction | Feb 18, 2025 | All | —Unverified | 0 |
| One for All: A General Framework of LLMs-based Multi-Criteria Decision Making on Human Expert Level | Feb 17, 2025 | AllDecision Making | —Unverified | 0 |
| OCT Data is All You Need: How Vision Transformers with and without Pre-training Benefit Imaging | Feb 17, 2025 | Allimage-classification | —Unverified | 0 |
| Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making | Feb 17, 2025 | AllDecision Making | —Unverified | 0 |
| All Models Are Miscalibrated, But Some Less So: Comparing Calibration with Conditional Mean Operators | Feb 17, 2025 | All | —Unverified | 0 |
| Eye Tracking Based Cognitive Evaluation of Automatic Readability Assessment Measures | Feb 16, 2025 | AllReading Comprehension | —Unverified | 0 |
| Distraction is All You Need for Multimodal Large Language Model Jailbreaking | Feb 15, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Is Depth All You Need? An Exploration of Iterative Reasoning in LLMs | Feb 15, 2025 | AllDiversity | CodeCode Available | 0 |