SOTAVerified

ARC

Papers

Showing 5175 of 554 papers

TitleStatusHype
VeriMind: Agentic LLM for Automated Verilog Generation with a Novel Evaluation Metric0
Evaluation of Alignment-Regularity Characteristics in Deformable Image Registration0
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance0
State-of-the-Art Stroke Lesion Segmentation at 1/1000th of Parameters0
ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields0
Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System0
Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation0
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks0
Detecting Benchmark Contamination Through Watermarking0
An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning0
Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMsCode0
MixMin: Finding Data Mixtures via Convex Minimization0
Diverse Inference and Verification for Advanced Reasoning0
ORI: O Routing Intelligence0
Safe platooning control of connected and autonomous vehicles on curved multi-lane roads0
Task Generalization With AutoRegressive Compositional Structure: Can Learning From Tasks Generalize to ^T Tasks?0
Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC TaskCode0
Enhanced Rapid Detection of High-impedance Arc Faults in Medium Voltage Electrical Distribution Networks0
Vision-Ultrasound Robotic System based on Deep Learning for Gas and Arc Hazard Detection in Manufacturing0
Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning0
A Beam's Eye View to Fluence Maps 3D Network for Ultra Fast VMAT Radiotherapy Planning0
Efficient Implementation of the Global Cardinality Constraint with Costs0
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal PuzzlesCode2
Pheromone-based Learning of Optimal Reasoning Paths0
State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence0
Show:102550
← PrevPage 3 of 23Next →

No leaderboard results yet.