| Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages | Dec 1, 2024 | ARCMultiple-choice | —Unverified | 0 |
| Frequency Dynamic Convolutions for Sound Event Detection | Jun 15, 2025 | ARCEvent Detection | —Unverified | 0 |
| From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment | Jun 7, 2025 | ARCMMLU | —Unverified | 0 |
| From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments | Nov 26, 2022 | ARCClassification | —Unverified | 0 |
| From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models | Jun 8, 2025 | ARCFew-Shot Learning | —Unverified | 0 |
| Frustratingly Easy Sentiment Analysis of Text Streams: Generating High-Quality Emotion Arcs Using Emotion Lexicons | Oct 13, 2022 | ARCSentiment Analysis | —Unverified | 0 |
| Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI | Jan 13, 2025 | ARCBenchmarking | —Unverified | 0 |
| Generalized Support and Formal Development of Constraint Propagators | Apr 22, 2015 | ARC | —Unverified | 0 |
| Generalized Totalizer Encoding for Pseudo-Boolean Constraints | Jul 21, 2015 | ARC | —Unverified | 0 |
| Understanding Enthymemes in Argument Maps: Bridging Argument Mining and Logic-based Argumentation | Aug 16, 2024 | ARCArgument Mining | —Unverified | 0 |