SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 35013550 of 659983 papers

TitleStatusHype
Improving Dictionary Learning with Gated Sparse AutoencodersCode3
Open3D: A Modern Library for 3D Data ProcessingCode3
ATPrompt: Textual Prompt Learning with Embedded AttributesCode3
N-BEATS: Neural basis expansion analysis for interpretable time series forecastingCode3
Mip-Splatting: Alias-free 3D Gaussian SplattingCode3
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray InterpretationCode3
Scaling Rectified Flow Transformers for High-Resolution Image SynthesisCode3
MegaPairs: Massive Data Synthesis For Universal Multimodal RetrievalCode3
BasicVSR: The Search for Essential Components in Video Super-Resolution and BeyondCode3
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMsCode3
WHAM: Reconstructing World-grounded Humans with Accurate 3D MotionCode3
Block-NeRF: Scalable Large Scene Neural View SynthesisCode3
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General SoundCode3
Vision Transformers for Dense PredictionCode3
RepViT: Revisiting Mobile CNN From ViT PerspectiveCode3
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion ModelCode3
CRAG -- Comprehensive RAG BenchmarkCode3
Major TOM: Expandable Datasets for Earth ObservationCode3
Uni-QSAR: an Auto-ML Tool for Molecular Property PredictionCode3
Optimal Variable Speed Limit Control Strategy on Freeway Segments under Fog ConditionsCode3
Towards General-purpose Infrastructure for Protecting Scientific Data Under StudyCode3
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement LearningCode3
Genie: Generative Interactive EnvironmentsCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
Efficiently Serving LLM Reasoning Programs with CertaindexCode3
SPO: Sequential Monte Carlo Policy OptimisationCode3
AgentStudio: A Toolkit for Building General Virtual AgentsCode3
Is Value Learning Really the Main Bottleneck in Offline RL?Code3
DANA: Domain-Aware Neurosymbolic Agents for Consistency and AccuracyCode3
Compact 3D Gaussian Splatting for Static and Dynamic Radiance FieldsCode3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMCode3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2Code3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary AlgorithmsCode3
The False Promise of Imitating Proprietary LLMsCode3
Visual Geometry Grounded Deep Structure From MotionCode3
A Foundation Model for the Earth SystemCode3
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningCode3
Human-level play in the game of Diplomacy by combining language models with strategic reasoningCode3
Improving Text Embeddings with Large Language ModelsCode3
Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded ModesCode3
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation ModelsCode3
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal ControlCode3
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action ModelsCode3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse AutoencodersCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax MimicryCode3
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object DetectionCode3
Show:102550
← PrevPage 71 of 13200Next →