| Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing | Oct 22, 2024 | AttributeBenchmarking | —Unverified | 0 |
| A Complexity-Based Theory of Compositionality | Oct 18, 2024 | Out-of-Distribution Generalization | —Unverified | 0 |
| Looking Inward: Language Models Can Learn About Themselves by Introspection | Oct 17, 2024 | Out-of-Distribution Generalization | CodeCode Available | 1 |
| Feature-guided score diffusion for sampling conditional densities | Oct 15, 2024 | DenoisingOut-of-Distribution Generalization | —Unverified | 0 |
| FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection | Oct 15, 2024 | Federated LearningOut-of-Distribution Generalization | CodeCode Available | 0 |
| ALVIN: Active Learning Via INterpolation | Oct 11, 2024 | Active LearningNatural Language Inference | —Unverified | 0 |
| AHA: Human-Assisted Out-of-Distribution Generalization and Detection | Oct 10, 2024 | Out-of-Distribution Generalization | CodeCode Available | 0 |
| Visual Scratchpads: Enabling Global Reasoning in Vision | Oct 10, 2024 | Out-of-Distribution Generalization | —Unverified | 0 |
| Can Transformers Reason Logically? A Study in SAT Solving | Oct 9, 2024 | DecoderLogical Reasoning | —Unverified | 0 |
| Rejecting Hallucinated State Targets during Planning | Oct 9, 2024 | Decision MakingOut-of-Distribution Generalization | CodeCode Available | 1 |