| VeriMind: Agentic LLM for Automated Verilog Generation with a Novel Evaluation Metric | Mar 15, 2025 | ARCCode Generation | —Unverified | 0 |
| Evaluation of Alignment-Regularity Characteristics in Deformable Image Registration | Mar 10, 2025 | ARCImage Registration | —Unverified | 0 |
| Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance | Mar 7, 2025 | ARCLanguage Modeling | —Unverified | 0 |
| State-of-the-Art Stroke Lesion Segmentation at 1/1000th of Parameters | Mar 7, 2025 | ARCDecoder | —Unverified | 0 |
| ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields | Mar 4, 2025 | ARC | —Unverified | 0 |
| Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System | Feb 27, 2025 | ARCAutonomous Navigation | —Unverified | 0 |
| Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation | Feb 25, 2025 | ARCPassage Retrieval | —Unverified | 0 |
| Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks | Feb 24, 2025 | 2kARC | —Unverified | 0 |
| Detecting Benchmark Contamination Through Watermarking | Feb 24, 2025 | ARCMMLU | —Unverified | 0 |
| An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning | Feb 22, 2025 | ARCContinual Learning | —Unverified | 0 |
| Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs | Feb 20, 2025 | ARCIn-Context Learning | CodeCode Available | 0 |
| Diverse Inference and Verification for Advanced Reasoning | Feb 14, 2025 | ARCHumanity's Last Exam | —Unverified | 0 |
| ORI: O Routing Intelligence | Feb 14, 2025 | ARCMMLU | —Unverified | 0 |
| MixMin: Finding Data Mixtures via Convex Minimization | Feb 14, 2025 | ARCLanguage Modeling | —Unverified | 0 |
| Safe platooning control of connected and autonomous vehicles on curved multi-lane roads | Feb 14, 2025 | ARCAutonomous Vehicles | —Unverified | 0 |
| Task Generalization With AutoRegressive Compositional Structure: Can Learning From Tasks Generalize to ^T Tasks? | Feb 13, 2025 | ARCIn-Context Learning | —Unverified | 0 |
| Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task | Feb 11, 2025 | ARC | CodeCode Available | 0 |
| Enhanced Rapid Detection of High-impedance Arc Faults in Medium Voltage Electrical Distribution Networks | Feb 9, 2025 | ARCFault Detection | —Unverified | 0 |
| Vision-Ultrasound Robotic System based on Deep Learning for Gas and Arc Hazard Detection in Manufacturing | Feb 8, 2025 | ARCFairness | —Unverified | 0 |
| Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning | Feb 5, 2025 | ARC | —Unverified | 0 |
| A Beam's Eye View to Fluence Maps 3D Network for Ultra Fast VMAT Radiotherapy Planning | Feb 5, 2025 | ARCSSIM | —Unverified | 0 |
| Efficient Implementation of the Global Cardinality Constraint with Costs | Feb 4, 2025 | ARC | —Unverified | 0 |
| The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles | Feb 3, 2025 | ARCMultimodal Reasoning | CodeCode Available | 2 |
| Pheromone-based Learning of Optimal Reasoning Paths | Jan 31, 2025 | ARCGSM8K | —Unverified | 0 |
| State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence | Jan 30, 2025 | 8kARC | —Unverified | 0 |
| FAAGC: Feature Augmentation on Adaptive Geodesic Curve Based on the shape space theory | Jan 25, 2025 | ARC | —Unverified | 0 |
| Unified 3D MRI Representations via Sequence-Invariant Contrastive Learning | Jan 21, 2025 | AnatomyARC | CodeCode Available | 0 |
| Towards A Litmus Test for Common Sense | Jan 17, 2025 | ARCCommon Sense Reasoning | —Unverified | 0 |
| Random Subspace Cubic-Regularization Methods, with Applications to Low-Rank Functions | Jan 16, 2025 | ARC | —Unverified | 0 |
| Scaling Graph-Based Dependency Parsing with Arc Vectorization and Attention-Based Refinement | Jan 16, 2025 | ARCDependency Parsing | —Unverified | 0 |
| Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI | Jan 13, 2025 | ARCBenchmarking | —Unverified | 0 |
| The Utility of Hyperplane Angle Metric in Detecting Financial Concept Drift | Jan 12, 2025 | ARCDrift Detection | CodeCode Available | 0 |
| Common Sense Is All You Need | Jan 11, 2025 | AllARC | —Unverified | 0 |
| Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models | Jan 10, 2025 | ARCDiversity | —Unverified | 0 |
| Cluster & Disperse: a general air conflict resolution heuristic using unsupervised learning | Jan 8, 2025 | ARC | —Unverified | 0 |
| NSA: Neuro-symbolic ARC Challenge | Jan 8, 2025 | ARC | CodeCode Available | 0 |
| Hybridising Reinforcement Learning and Heuristics for Hierarchical Directed Arc Routing Problems | Jan 1, 2025 | ARCreinforcement-learning | CodeCode Available | 0 |
| In Case You Missed It: ARC 'Challenge' Is Not That Challenging | Dec 23, 2024 | ARCMultiple-choice | —Unverified | 0 |
| SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs | Dec 11, 2024 | ARCGSM8K | —Unverified | 0 |
| ARCEAK: An Automated Rule Checking Framework Enhanced with Architectural Knowledge | Dec 10, 2024 | ARCCode Generation | —Unverified | 0 |
| Minimum Weighted Feedback Arc Sets for Ranking from Pairwise Comparisons | Dec 10, 2024 | ARC | CodeCode Available | 0 |
| ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC) | Dec 10, 2024 | ARCFew-Shot Learning | CodeCode Available | 0 |
| ARC Prize 2024: Technical Report | Dec 5, 2024 | ARCProgram Synthesis | CodeCode Available | 3 |
| Asymptotic enumeration of normal and hybridization networks via tree decoration | Dec 4, 2024 | ARC | —Unverified | 0 |
| Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset | Dec 3, 2024 | ARCMMLU | —Unverified | 0 |
| Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages | Dec 1, 2024 | ARCMultiple-choice | —Unverified | 0 |
| Abductive Symbolic Solver on Abstraction and Reasoning Corpus | Nov 27, 2024 | ARCVisual Reasoning | —Unverified | 0 |
| An Attempt to Develop a Neural Parser based on Simplified Head-Driven Phrase Structure Grammar on Vietnamese | Nov 26, 2024 | ARCConstituency Parsing | —Unverified | 0 |
| Lower Dimensional Spherical Representation of Medium Voltage Load Profiles for Visualization, Outlier Detection, and Generative Modelling | Nov 21, 2024 | ARCClustering | —Unverified | 0 |
| Capturing Sparks of Abstraction for the ARC Challenge | Nov 17, 2024 | ARC | CodeCode Available | 0 |