SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 43014350 of 177340 papers

TitleStatusHype
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation ModelsCode3
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal ControlCode3
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action ModelsCode3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse AutoencodersCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax MimicryCode3
UCF: Uncovering Common Features for Generalizable Deepfake DetectionCode3
Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly DetectionCode3
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion TransformersCode3
C-Adapter: Adapting Deep Classifiers for Efficient Conformal Prediction SetsCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio ClassificationCode3
Modular Duality in Deep LearningCode3
Distributed Prioritized Experience ReplayCode3
PromptHMR: Promptable Human Mesh RecoveryCode3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
U-Net: Convolutional Networks for Biomedical Image SegmentationCode3
History-Guided Video DiffusionCode3
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming ServicesCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
Probabilistic Volumetric Fusion for Dense Monocular SLAMCode3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence SegmentationCode3
Discovered Policy OptimisationCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
On Distillation of Guided Diffusion ModelsCode3
SWE-bench-java: A GitHub Issue Resolving Benchmark for JavaCode3
SoundStream: An End-to-End Neural Audio CodecCode3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization PerspectiveCode3
On the Content Bias in Fréchet Video DistanceCode3
Flow Matching for Generative ModelingCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
3D Diffuser Actor: Policy Diffusion with 3D Scene RepresentationsCode3
Physics3D: Learning Physical Properties of 3D Gaussians via Video DiffusionCode3
SkyMath: Technical ReportCode3
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions ParametersCode3
Reason-RFT: Reinforcement Fine-Tuning for Visual ReasoningCode3
Designing and building the mlpack open-source machine learning libraryCode3
One-step Diffusion with Distribution Matching DistillationCode3
EAFormer: Scene Text Segmentation with Edge-Aware TransformersCode3
Accurate clinical and biomedical Named entity recognition at scaleCode3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1Code3
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language ModelsCode3
LRM: Large Reconstruction Model for Single Image to 3DCode3
GluonTS: Probabilistic Time Series Models in PythonCode3
Practical Deep Reinforcement Learning Approach for Stock TradingCode3
CodeBLEU: a Method for Automatic Evaluation of Code SynthesisCode3
Aguvis: Unified Pure Vision Agents for Autonomous GUI InteractionCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
Text Embeddings Reveal (Almost) As Much As TextCode3
Show:102550
← PrevPage 87 of 3547Next →