| metabench -- A Sparse Benchmark to Measure General Ability in Large Language Models | Jul 4, 2024 | ARCGSM8K | CodeCode Available | 0 |
| VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Jun 25, 2024 | ARCBenchmarking | CodeCode Available | 0 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic | Jun 25, 2024 | ARCLogical Reasoning | —Unverified | 0 |
| PORT: Preference Optimization on Reasoning Traces | Jun 23, 2024 | ARCGSM8K | —Unverified | 0 |
| AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models | Jun 19, 2024 | ARCMixture-of-Experts | CodeCode Available | 1 |
| Circular transformation of the European steel industry renders scrap metal a strategic resource | Jun 17, 2024 | ARC | —Unverified | 0 |
| Promises, Outlooks and Challenges of Diffusion Language Modeling | Jun 17, 2024 | ARCHellaSwag | —Unverified | 0 |
| Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and Benchmark | Jun 13, 2024 | Anomaly DetectionARC | CodeCode Available | 1 |
| Regularizing Numerical Extremals Along Singular Arcs: A Lie-Theoretic Approach | Jun 11, 2024 | ARC | —Unverified | 0 |
| Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples | Jun 9, 2024 | ARCDiversity | CodeCode Available | 2 |
| Yuan 2.0-M32: Mixture of Experts with Attention Router | May 28, 2024 | ARCMath | CodeCode Available | 2 |
| A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness | May 27, 2024 | ARCDecoder | —Unverified | 0 |
| ARC: A Generalist Graph Anomaly Detector with In-Context Learning | May 27, 2024 | Anomaly DetectionARC | CodeCode Available | 1 |
| Adaptive Gradient Clipping for Robust Federated Learning | May 23, 2024 | ARCFederated Learning | —Unverified | 0 |
| Adaptive Retention & Correction: Test-Time Training for Continual Learning | May 23, 2024 | ARCContinual Learning | —Unverified | 0 |
| Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models | May 20, 2024 | ARCCitation worhtiness | CodeCode Available | 0 |
| Localized Adaptive Risk Control | May 13, 2024 | ARCFairness | CodeCode Available | 0 |
| Program Synthesis using Inductive Logic Programming for the Abstraction and Reasoning Corpus | May 10, 2024 | ARCInductive logic programming | —Unverified | 0 |
| Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning | May 1, 2024 | ARCGSM8K | CodeCode Available | 3 |
| Iterative Reasoning Preference Optimization | Apr 30, 2024 | ARCGSM8K | —Unverified | 0 |
| Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning | Apr 23, 2024 | ARCCommon Sense Reasoning | —Unverified | 0 |
| Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models | Apr 21, 2024 | ARCImage Generation | —Unverified | 0 |
| FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving | Apr 20, 2024 | ARCAutonomous Driving | —Unverified | 0 |
| Frenet-Serret Frame-based Decomposition for Part Segmentation of 3D Curvilinear Structures | Apr 19, 2024 | ARCSegmentation | CodeCode Available | 1 |
| Addressing the Abstraction and Reasoning Corpus via Procedural Example Generation | Apr 10, 2024 | ARCDiversity | CodeCode Available | 3 |
| Is The Watermarking Of LLM-Generated Code Robust? | Mar 24, 2024 | ARC | CodeCode Available | 1 |
| Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus | Mar 18, 2024 | ARC | CodeCode Available | 0 |
| Do Large Language Models Solve ARC Visual Analogies Like People Do? | Mar 13, 2024 | ARCLanguage Modeling | CodeCode Available | 0 |
| An Efficient Learning-based Solver Comparable to Metaheuristics for the Capacitated Arc Routing Problem | Mar 11, 2024 | ARCCombinatorial Optimization | —Unverified | 0 |
| The Emotion Dynamics of Literary Novels | Mar 4, 2024 | ARC | CodeCode Available | 0 |
| Metamorpheus: Interactive, Affective, and Creative Dream Narration Through Metaphorical Visual Storytelling | Mar 1, 2024 | ARCVisual Storytelling | —Unverified | 0 |
| CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay | Feb 7, 2024 | ARCData Augmentation | CodeCode Available | 1 |
| Neural networks for abstraction and reasoning: Towards broad generalization in machines | Feb 5, 2024 | ARCVisual Reasoning | CodeCode Available | 3 |
| A Truly Joint Neural Architecture for Segmentation and Parsing | Feb 4, 2024 | ARCSegmentation | —Unverified | 0 |
| Extending the kinematic theory of rapid movements with new primitives | Jan 29, 2024 | ARC | —Unverified | 0 |
| Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models | Jan 22, 2024 | ARCProperty Prediction | CodeCode Available | 0 |
| Generalized Planning for the Abstraction and Reasoning Corpus | Jan 15, 2024 | ARCvalid | CodeCode Available | 1 |
| Robot-Assisted Deep Venous Thrombosis Ultrasound Examination using Virtual Fixture | Jan 4, 2024 | ARCPosition | CodeCode Available | 0 |
| Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation | Dec 19, 2023 | ARCImage Segmentation | CodeCode Available | 1 |
| A Customizable Generator for Comic-Style Visual Narrative | Dec 14, 2023 | ARCDecision Making | —Unverified | 0 |
| Advancements in Arc Fault Detection for Electrical Distribution Systems: A Comprehensive Review from Artificial Intelligence Perspective | Nov 28, 2023 | ARCFault Detection | —Unverified | 0 |
| Token-Level Adaptation of LoRA Adapters for Downstream Task Generalization | Nov 17, 2023 | ARCGSM8K | CodeCode Available | 1 |
| Solving ARC visual analogies with neural embeddings and vector arithmetic: A generalized method | Nov 14, 2023 | ARCDimensionality Reduction | CodeCode Available | 0 |
| Cut-set and Stability Constrained Optimal Power Flow for Resilient Operation During Wildfires | Nov 9, 2023 | ARC | —Unverified | 0 |
| Tackling the Abstraction and Reasoning Corpus (ARC) with Object-centric Models and the MDL Principle | Nov 1, 2023 | ARC | CodeCode Available | 1 |
| Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise | Oct 26, 2023 | ARCDenoising | —Unverified | 0 |
| Online Two-stage Thermal History Prediction Method for Metal Additive Manufacturing of Thin Walls | Oct 24, 2023 | ARCComputational Efficiency | —Unverified | 0 |
| Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs | Oct 24, 2023 | ARCDeep Reinforcement Learning | CodeCode Available | 0 |
| 4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees | Oct 22, 2023 | ARC | —Unverified | 0 |