| metabench -- A Sparse Benchmark to Measure General Ability in Large Language Models | Jul 4, 2024 | ARCGSM8K | CodeCode Available | 0 |
| LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic | Jun 25, 2024 | ARCLogical Reasoning | —Unverified | 0 |
| VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Jun 25, 2024 | ARCBenchmarking | CodeCode Available | 0 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| PORT: Preference Optimization on Reasoning Traces | Jun 23, 2024 | ARCGSM8K | —Unverified | 0 |
| AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models | Jun 19, 2024 | ARCMixture-of-Experts | CodeCode Available | 1 |
| Circular transformation of the European steel industry renders scrap metal a strategic resource | Jun 17, 2024 | ARC | —Unverified | 0 |
| Promises, Outlooks and Challenges of Diffusion Language Modeling | Jun 17, 2024 | ARCHellaSwag | —Unverified | 0 |
| Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and Benchmark | Jun 13, 2024 | Anomaly DetectionARC | CodeCode Available | 1 |
| Regularizing Numerical Extremals Along Singular Arcs: A Lie-Theoretic Approach | Jun 11, 2024 | ARC | —Unverified | 0 |
| Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples | Jun 9, 2024 | ARCDiversity | CodeCode Available | 2 |
| Yuan 2.0-M32: Mixture of Experts with Attention Router | May 28, 2024 | ARCMath | CodeCode Available | 2 |
| A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness | May 27, 2024 | ARCDecoder | —Unverified | 0 |
| ARC: A Generalist Graph Anomaly Detector with In-Context Learning | May 27, 2024 | Anomaly DetectionARC | CodeCode Available | 1 |
| Adaptive Gradient Clipping for Robust Federated Learning | May 23, 2024 | ARCFederated Learning | —Unverified | 0 |
| Adaptive Retention & Correction: Test-Time Training for Continual Learning | May 23, 2024 | ARCContinual Learning | —Unverified | 0 |
| Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models | May 20, 2024 | ARCCitation worhtiness | CodeCode Available | 0 |
| Localized Adaptive Risk Control | May 13, 2024 | ARCFairness | CodeCode Available | 0 |
| Program Synthesis using Inductive Logic Programming for the Abstraction and Reasoning Corpus | May 10, 2024 | ARCInductive logic programming | —Unverified | 0 |
| Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning | May 1, 2024 | ARCGSM8K | CodeCode Available | 3 |
| Iterative Reasoning Preference Optimization | Apr 30, 2024 | ARCGSM8K | —Unverified | 0 |
| Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning | Apr 23, 2024 | ARCCommon Sense Reasoning | —Unverified | 0 |
| Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models | Apr 21, 2024 | ARCImage Generation | —Unverified | 0 |
| FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving | Apr 20, 2024 | ARCAutonomous Driving | —Unverified | 0 |
| Frenet-Serret Frame-based Decomposition for Part Segmentation of 3D Curvilinear Structures | Apr 19, 2024 | ARCSegmentation | CodeCode Available | 1 |