SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 651675 of 659983 papers

TitleStatusHype
LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks0
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset0
LLMORPH: Automated Metamorphic Testing of Large Language Models0
LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops0
M3T: Discrete Multi-Modal Motion Tokens for Sign Language Production0
Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks0
λSplit: Self-Supervised Content-Aware Spectral Unmixing for Fluorescence Microscopy0
Foundation Model Embeddings Meet Blended Emotions: A Multimodal Fusion Approach for the BLEMORE Challenge0
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages0
Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection0
Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges0
GTO Wizard Benchmark0
Echoes: A semantically-aligned music deepfake detection dataset0
Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models0
Grounding Vision and Language to 3D Masks for Long-Horizon Box Rearrangement0
Prototype Fusion: A Training-Free Multi-Layer Approach to OOD Detection0
PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation0
Learning What Can Be Picked: Active Reachability Estimation for Efficient Robotic Fruit Harvesting0
Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots0
MoCHA: Denoising Caption Supervision for Motion-Text Retrieval0
Dual-Gated Epistemic Time-Dilation: Autonomous Compute Modulation in Asynchronous MARL0
Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers0
Bi-CRCL: Bidirectional Conservative-Radical Complementary Learning with Pre-trained Foundation Models for Class-incremental Medical Image Analysis0
An Adapter-free Fine-tuning Approach for Tuning 3D Foundation Models0
Wasserstein Parallel Transport for Predicting the Dynamics of Statistical Systems0
Show:102550
← PrevPage 27 of 26400Next →