SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 43514400 of 661570 papers

TitleStatusHype
Is Value Learning Really the Main Bottleneck in Offline RL?Code3
DANA: Domain-Aware Neurosymbolic Agents for Consistency and AccuracyCode3
Compact 3D Gaussian Splatting for Static and Dynamic Radiance FieldsCode3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMCode3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2Code3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary AlgorithmsCode3
The False Promise of Imitating Proprietary LLMsCode3
Visual Geometry Grounded Deep Structure From MotionCode3
A Foundation Model for the Earth SystemCode3
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningCode3
Human-level play in the game of Diplomacy by combining language models with strategic reasoningCode3
Improving Text Embeddings with Large Language ModelsCode3
Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded ModesCode3
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation ModelsCode3
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal ControlCode3
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action ModelsCode3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse AutoencodersCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax MimicryCode3
UCF: Uncovering Common Features for Generalizable Deepfake DetectionCode3
Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly DetectionCode3
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion TransformersCode3
C-Adapter: Adapting Deep Classifiers for Efficient Conformal Prediction SetsCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio ClassificationCode3
Modular Duality in Deep LearningCode3
Distributed Prioritized Experience ReplayCode3
PromptHMR: Promptable Human Mesh RecoveryCode3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
U-Net: Convolutional Networks for Biomedical Image SegmentationCode3
History-Guided Video DiffusionCode3
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming ServicesCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
Probabilistic Volumetric Fusion for Dense Monocular SLAMCode3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence SegmentationCode3
Discovered Policy OptimisationCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
On Distillation of Guided Diffusion ModelsCode3
SWE-bench-java: A GitHub Issue Resolving Benchmark for JavaCode3
SoundStream: An End-to-End Neural Audio CodecCode3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization PerspectiveCode3
On the Content Bias in Fréchet Video DistanceCode3
Flow Matching for Generative ModelingCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
3D Diffuser Actor: Policy Diffusion with 3D Scene RepresentationsCode3
Physics3D: Learning Physical Properties of 3D Gaussians via Video DiffusionCode3
SkyMath: Technical ReportCode3
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions ParametersCode3
Show:102550
← PrevPage 88 of 13232Next →