| VeriMind: Agentic LLM for Automated Verilog Generation with a Novel Evaluation Metric | Mar 15, 2025 | ARCCode Generation | —Unverified | 0 |
| Evaluation of Alignment-Regularity Characteristics in Deformable Image Registration | Mar 10, 2025 | ARCImage Registration | —Unverified | 0 |
| Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance | Mar 7, 2025 | ARCLanguage Modeling | —Unverified | 0 |
| State-of-the-Art Stroke Lesion Segmentation at 1/1000th of Parameters | Mar 7, 2025 | ARCDecoder | —Unverified | 0 |
| ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields | Mar 4, 2025 | ARC | —Unverified | 0 |
| Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System | Feb 27, 2025 | ARCAutonomous Navigation | —Unverified | 0 |
| Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation | Feb 25, 2025 | ARCPassage Retrieval | —Unverified | 0 |
| Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks | Feb 24, 2025 | 2kARC | —Unverified | 0 |
| Detecting Benchmark Contamination Through Watermarking | Feb 24, 2025 | ARCMMLU | —Unverified | 0 |
| An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning | Feb 22, 2025 | ARCContinual Learning | —Unverified | 0 |
| Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs | Feb 20, 2025 | ARCIn-Context Learning | CodeCode Available | 0 |
| MixMin: Finding Data Mixtures via Convex Minimization | Feb 14, 2025 | ARCLanguage Modeling | —Unverified | 0 |
| Diverse Inference and Verification for Advanced Reasoning | Feb 14, 2025 | ARCHumanity's Last Exam | —Unverified | 0 |
| ORI: O Routing Intelligence | Feb 14, 2025 | ARCMMLU | —Unverified | 0 |
| Safe platooning control of connected and autonomous vehicles on curved multi-lane roads | Feb 14, 2025 | ARCAutonomous Vehicles | —Unverified | 0 |
| Task Generalization With AutoRegressive Compositional Structure: Can Learning From Tasks Generalize to ^T Tasks? | Feb 13, 2025 | ARCIn-Context Learning | —Unverified | 0 |
| Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task | Feb 11, 2025 | ARC | CodeCode Available | 0 |
| Enhanced Rapid Detection of High-impedance Arc Faults in Medium Voltage Electrical Distribution Networks | Feb 9, 2025 | ARCFault Detection | —Unverified | 0 |
| Vision-Ultrasound Robotic System based on Deep Learning for Gas and Arc Hazard Detection in Manufacturing | Feb 8, 2025 | ARCFairness | —Unverified | 0 |
| Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning | Feb 5, 2025 | ARC | —Unverified | 0 |
| A Beam's Eye View to Fluence Maps 3D Network for Ultra Fast VMAT Radiotherapy Planning | Feb 5, 2025 | ARCSSIM | —Unverified | 0 |
| Efficient Implementation of the Global Cardinality Constraint with Costs | Feb 4, 2025 | ARC | —Unverified | 0 |
| The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles | Feb 3, 2025 | ARCMultimodal Reasoning | CodeCode Available | 2 |
| Pheromone-based Learning of Optimal Reasoning Paths | Jan 31, 2025 | ARCGSM8K | —Unverified | 0 |
| State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence | Jan 30, 2025 | 8kARC | —Unverified | 0 |