SOTAVerified

ARC

Papers

Showing 150 of 554 papers

TitleStatusHype
Fast Text-to-Audio Generation with Adversarial Post-TrainingCode7
ARC Prize 2024: Technical ReportCode3
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
CAX: Cellular Automata Accelerated in JAXCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
The Surprising Effectiveness of Test-Time Training for Few-Shot LearningCode3
Neural networks for abstraction and reasoning: Towards broad generalization in machinesCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
Digitizing Touch with an Artificial Multimodal FingertipCode3
Addressing the Abstraction and Reasoning Corpus via Procedural Example GenerationCode3
A Parallelizable Lattice Rescoring Strategy with Neural Language ModelsCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-RewardingCode2
Searching Latent Program SpacesCode2
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal PuzzlesCode2
Yuan 2.0-M32: Mixture of Experts with Attention RouterCode2
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement LearningCode2
Combining Induction and Transduction for Abstract ReasoningCode2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal ExamplesCode2
Scaling Laws with Vocabulary: Larger Models Deserve Larger VocabulariesCode2
IterGen: Iterative Semantic-aware Structured LLM Generation with BacktrackingCode1
H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus BenchmarkCode1
HELM: Hyperbolic Large Language Models via Mixture-of-Curvature ExpertsCode1
Generalized Planning for the Abstraction and Reasoning CorpusCode1
Adaptive Consistency Regularization for Semi-Supervised Transfer LearningCode1
Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment AnalysisCode1
FreeLB: Enhanced Adversarial Training for Natural Language UnderstandingCode1
How to Enhance Causal Discrimination of Utterances: A Case on Affective ReasoningCode1
Hypothesis Search: Inductive Reasoning with Language ModelsCode1
Is The Watermarking Of LLM-Generated Code Robust?Code1
First Steps of an Approach to the ARC Challenge based on Descriptive Grid Models and the Minimum Description Length PrincipleCode1
Frenet-Serret Frame-based Decomposition for Part Segmentation of 3D Curvilinear StructuresCode1
Graphs, Constraints, and Search for the Abstraction and Reasoning CorpusCode1
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) ChallengeCode1
Efficient Second-Order TreeCRF for Neural Dependency ParsingCode1
Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question AnsweringCode1
Evaluating Factuality in Generation with Dependency-level EntailmentCode1
DecoupleNet: A Lightweight Backbone Network With Efficient Feature Decoupling for Remote Sensing Visual TasksCode1
An Approach to Solving the Abstraction and Reasoning Corpus (ARC) ChallengeCode1
Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and BenchmarkCode1
FactGraph: Evaluating Factuality in Summarization with Semantic Graph RepresentationsCode1
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language ModelsCode1
DIPN: Deep Interaction Prediction Network with Application to Clutter RemovalCode1
DRACO: Differentiable Reconstruction for Arbitrary CBCT OrbitsCode1
Adversarial Vulnerability of Randomized EnsemblesCode1
Efficient Adaptation of Large Vision Transformer via Adapter Re-ComposingCode1
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language ModelsCode1
Combining (second-order) graph-based and headed-span-based projective dependency parsingCode1
Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation OverlapCode1
A Comparison of Supervised Learning to Match Methods for Product SearchCode1
Show:102550
← PrevPage 1 of 12Next →

No leaderboard results yet.