| Smoothie: Label Free Language Model Routing | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Transformers Can Navigate Mazes With Multi-Step Prediction | Dec 6, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 |
| MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Model | Dec 5, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Composed Image Retrieval for Training-Free Domain Conversion | Dec 4, 2024 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |