| Is The Watermarking Of LLM-Generated Code Robust? | Mar 24, 2024 | ARC | CodeCode Available | 1 | 5 |
| Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities | May 31, 2025 | ARC | CodeCode Available | 1 | 5 |
| LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based Representations | May 26, 2023 | ARCLanguage Modelling | CodeCode Available | 1 | 5 |
| MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment | Oct 8, 2024 | ARCBelebele | CodeCode Available | 1 | 5 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 | 5 |
| Global Greedy Dependency Parsing | Nov 20, 2019 | ARCDependency Parsing | CodeCode Available | 0 | 5 |
| Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs | Oct 24, 2023 | ARCDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Fast(er) Exact Decoding and Global Training for Transition-Based Dependency Parsing via a Minimal Feature Set | Aug 30, 2017 | ARCDependency Parsing | CodeCode Available | 0 | 5 |
| A Novel Generalised Meta-Heuristic Framework for Dynamic Capacitated Arc Routing Problems | Apr 14, 2021 | ARC | CodeCode Available | 0 | 5 |
| Exploiting Reasoning Chains for Multi-hop Science Question Answering | Sep 7, 2021 | Abstract Meaning RepresentationARC | CodeCode Available | 0 | 5 |