SOTAVerified

ARC

Papers

Showing 51100 of 554 papers

TitleStatusHype
VeriMind: Agentic LLM for Automated Verilog Generation with a Novel Evaluation Metric0
Evaluation of Alignment-Regularity Characteristics in Deformable Image Registration0
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance0
State-of-the-Art Stroke Lesion Segmentation at 1/1000th of Parameters0
ARC-Flow : Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields0
Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System0
Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation0
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks0
Detecting Benchmark Contamination Through Watermarking0
An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning0
Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMsCode0
Diverse Inference and Verification for Advanced Reasoning0
ORI: O Routing Intelligence0
MixMin: Finding Data Mixtures via Convex Minimization0
Safe platooning control of connected and autonomous vehicles on curved multi-lane roads0
Task Generalization With AutoRegressive Compositional Structure: Can Learning From Tasks Generalize to ^T Tasks?0
Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC TaskCode0
Enhanced Rapid Detection of High-impedance Arc Faults in Medium Voltage Electrical Distribution Networks0
Vision-Ultrasound Robotic System based on Deep Learning for Gas and Arc Hazard Detection in Manufacturing0
Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning0
A Beam's Eye View to Fluence Maps 3D Network for Ultra Fast VMAT Radiotherapy Planning0
Efficient Implementation of the Global Cardinality Constraint with Costs0
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal PuzzlesCode2
Pheromone-based Learning of Optimal Reasoning Paths0
State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence0
FAAGC: Feature Augmentation on Adaptive Geodesic Curve Based on the shape space theory0
Unified 3D MRI Representations via Sequence-Invariant Contrastive LearningCode0
Towards A Litmus Test for Common Sense0
Random Subspace Cubic-Regularization Methods, with Applications to Low-Rank Functions0
Scaling Graph-Based Dependency Parsing with Arc Vectorization and Attention-Based Refinement0
Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI0
The Utility of Hyperplane Angle Metric in Detecting Financial Concept DriftCode0
Common Sense Is All You Need0
Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models0
Cluster & Disperse: a general air conflict resolution heuristic using unsupervised learning0
NSA: Neuro-symbolic ARC ChallengeCode0
Hybridising Reinforcement Learning and Heuristics for Hierarchical Directed Arc Routing ProblemsCode0
In Case You Missed It: ARC 'Challenge' Is Not That Challenging0
SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs0
ARCEAK: An Automated Rule Checking Framework Enhanced with Architectural Knowledge0
Minimum Weighted Feedback Arc Sets for Ranking from Pairwise ComparisonsCode0
ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC)Code0
ARC Prize 2024: Technical ReportCode3
Asymptotic enumeration of normal and hybridization networks via tree decoration0
Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset0
Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages0
Abductive Symbolic Solver on Abstraction and Reasoning Corpus0
An Attempt to Develop a Neural Parser based on Simplified Head-Driven Phrase Structure Grammar on Vietnamese0
Lower Dimensional Spherical Representation of Medium Voltage Load Profiles for Visualization, Outlier Detection, and Generative Modelling0
Capturing Sparks of Abstraction for the ARC ChallengeCode0
Show:102550
← PrevPage 2 of 12Next →

No leaderboard results yet.