SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 62016225 of 474278 papers

TitleStatusHype
Sequential Testing for Descriptor-Agnostic LiDAR Loop Closure in Repetitive EnvironmentsCode0
Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and OutlookCode0
Gradient-Guided Learning Network for Infrared Small Target DetectionCode0
IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual PromptingCode0
Training One Model to Master Cross-Level Agentic Actions via Reinforcement LearningCode0
Benchmarking Document Parsers on Mathematical Formula Extraction from PDFsCode0
Diffusion Is Your Friend in Show, Suggest and TellCode0
DB2-TransF: All You Need Is Learnable Daubechies Wavelets for Time Series ForecastingCode0
SoMe: A Realistic Benchmark for LLM-based Social Media AgentsCode0
Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO DetectorsCode0
MolSculpt: Sculpting 3D Molecular Geometries from Chemical SyntaxCode0
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform0
Decentralized Trust for Space AI: Blockchain-Based Federated Learning Across Multi-Vendor LEO Satellite NetworksCode0
WonderZoom: Multi-Scale 3D World Generation0
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass0
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models0
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation0
Direct transfer of optimized controllers to similar systems using dimensionless MPCCode0
SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing ImagesCode0
SimpleFold: Folding Proteins is Simpler than You Think0
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance0
LLM Collaboration With Multi-Agent Reinforcement LearningCode0
Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical SegmentationCode0
OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic WorkflowsCode0
MVP: Multiple View Prediction Improves GUI GroundingCode0
Show:102550
← PrevPage 249 of 18972Next →