| LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging | Oct 22, 2024 | Out-of-Distribution Generalization | CodeCode Available | 1 |
| Looking Inward: Language Models Can Learn About Themselves by Introspection | Oct 17, 2024 | Out-of-Distribution Generalization | CodeCode Available | 1 |
| Rejecting Hallucinated State Targets during Planning | Oct 9, 2024 | Decision MakingOut-of-Distribution Generalization | CodeCode Available | 1 |
| Collaboration! Towards Robust Neural Methods for Routing Problems | Oct 7, 2024 | Out-of-Distribution Generalization | CodeCode Available | 1 |
| Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment | Oct 3, 2024 | Image Quality AssessmentOut-of-Distribution Generalization | CodeCode Available | 1 |
| Positional Attention: Expressivity and Learnability of Algorithmic Computation | Oct 2, 2024 | Out-of-Distribution Generalization | CodeCode Available | 1 |
| H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark | Sep 2, 2024 | ARCOut-of-Distribution Generalization | CodeCode Available | 1 |
| LCA-on-the-Line: Benchmarking Out-of-Distribution Generalization with Class Taxonomies | Jul 22, 2024 | BenchmarkingOut-of-Distribution Generalization | CodeCode Available | 1 |
| Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design | Jul 16, 2024 | Drug DiscoveryOut-of-Distribution Generalization | CodeCode Available | 1 |
| CARE: a Benchmark Suite for the Classification and Retrieval of Enzymes | Jun 21, 2024 | ClassificationOut-of-Distribution Generalization | CodeCode Available | 1 |