| Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages | Dec 1, 2024 | ARCMultiple-choice | —Unverified | 0 |
| Frequency Dynamic Convolutions for Sound Event Detection | Jun 15, 2025 | ARCEvent Detection | —Unverified | 0 |
| From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment | Jun 7, 2025 | ARCMMLU | —Unverified | 0 |
| From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments | Nov 26, 2022 | ARCClassification | —Unverified | 0 |
| From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models | Jun 8, 2025 | ARCFew-Shot Learning | —Unverified | 0 |
| Frustratingly Easy Sentiment Analysis of Text Streams: Generating High-Quality Emotion Arcs Using Emotion Lexicons | Oct 13, 2022 | ARCSentiment Analysis | —Unverified | 0 |
| Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI | Jan 13, 2025 | ARCBenchmarking | —Unverified | 0 |
| Generalized Support and Formal Development of Constraint Propagators | Apr 22, 2015 | ARC | —Unverified | 0 |
| Generalized Totalizer Encoding for Pseudo-Boolean Constraints | Jul 21, 2015 | ARC | —Unverified | 0 |
| Understanding Enthymemes in Argument Maps: Bridging Argument Mining and Logic-based Argumentation | Aug 16, 2024 | ARCArgument Mining | —Unverified | 0 |
| Genetic Programming Hyper-Heuristics with Vehicle Collaboration for Uncertain Capacitated Arc Routing Problems | Nov 20, 2019 | ARCMeter Reading | —Unverified | 0 |
| Geometry of epithelial cells provides a robust method for image based inference of stress within tissues | Dec 11, 2018 | ARC | —Unverified | 0 |
| Graphical Requirements for Multistationarity in Reaction Networks and their Verification in BioModels | Sep 24, 2018 | ARC | —Unverified | 0 |
| Uniform Subdivision of Omnidirectional Camera Space for Efficient Spherical Stereo Matching | Jan 1, 2022 | ARCStereo Matching | —Unverified | 0 |
| Heuristic solutions to robust variants of the minimum-cost integer flow problem | Jul 21, 2019 | ARC | —Unverified | 0 |
| Hierarchical Multi-resolution Mesh Networks for Brain Decoding | Jul 12, 2016 | ARCBrain Decoding | —Unverified | 0 |
| How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning | May 30, 2025 | ARCReinforcement Learning (RL) | —Unverified | 0 |
| IBM Research at the CoNLL 2018 Shared Task on Multilingual Parsing | Oct 1, 2018 | ARCDependency Parsing | —Unverified | 0 |
| IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions | Dec 5, 2017 | ARCClustering | —Unverified | 0 |
| Improved Acyclicity Reasoning for Bayesian Network Structure Learning with Constraint Programming | Jun 23, 2021 | ARC | —Unverified | 0 |
| Improved RNA pseudoknots prediction and classification using a new topological invariant | May 16, 2016 | ARC | —Unverified | 0 |
| Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise | Oct 26, 2023 | ARCDenoising | —Unverified | 0 |
| In Case You Missed It: ARC 'Challenge' Is Not That Challenging | Dec 23, 2024 | ARCMultiple-choice | —Unverified | 0 |
| Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning | Jun 5, 2023 | ARC | —Unverified | 0 |
| In-situ monitoring additive manufacturing process with AI edge computing | Jan 2, 2023 | ARCEdge-computing | —Unverified | 0 |