SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 86518675 of 177340 papers

TitleStatusHype
SegVol: Universal and Interactive Volumetric Medical Image SegmentationCode2
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character DesignCode2
OccWorld: Learning a 3D Occupancy World Model for Autonomous DrivingCode2
Adapter is All You Need for Tuning Visual TasksCode2
Photo-SLAM: Real-time Simultaneous Localization and Photorealistic Mapping for Monocular, Stereo, and RGB-D CamerasCode2
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
Achieving Cross Modal Generalization with Multimodal Unified RepresentationCode2
M^4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and ModelsCode2
Language Models can Solve Computer TasksCode2
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous DrivingCode2
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent RepresentationCode2
Spike-driven TransformerCode2
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style AdapterCode2
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video UnderstandingCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor GenerationCode2
GauHuman: Articulated Gaussian Splatting from Monocular Human VideosCode2
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-SolvingCode2
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion modelsCode2
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement LearningCode2
AnimateZero: Video Diffusion Models are Zero-Shot Image AnimatorsCode2
Mind2Web: Towards a Generalist Agent for the WebCode2
ClimateLearn: Benchmarking Machine Learning for Weather and Climate ModelingCode2
When Do Transformers Shine in RL? Decoupling Memory from Credit AssignmentCode2
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion ModelsCode2
Show:102550
← PrevPage 347 of 7094Next →