SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 38513900 of 661570 papers

TitleStatusHype
Distributed Prioritized Experience ReplayCode3
PromptHMR: Promptable Human Mesh RecoveryCode3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
U-Net: Convolutional Networks for Biomedical Image SegmentationCode3
History-Guided Video DiffusionCode3
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming ServicesCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
Probabilistic Volumetric Fusion for Dense Monocular SLAMCode3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence SegmentationCode3
Discovered Policy OptimisationCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
On Distillation of Guided Diffusion ModelsCode3
SWE-bench-java: A GitHub Issue Resolving Benchmark for JavaCode3
SoundStream: An End-to-End Neural Audio CodecCode3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization PerspectiveCode3
On the Content Bias in Fréchet Video DistanceCode3
Flow Matching for Generative ModelingCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
3D Diffuser Actor: Policy Diffusion with 3D Scene RepresentationsCode3
Physics3D: Learning Physical Properties of 3D Gaussians via Video DiffusionCode3
SkyMath: Technical ReportCode3
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions ParametersCode3
Reason-RFT: Reinforcement Fine-Tuning for Visual ReasoningCode3
Designing and building the mlpack open-source machine learning libraryCode3
One-step Diffusion with Distribution Matching DistillationCode3
EAFormer: Scene Text Segmentation with Edge-Aware TransformersCode3
Accurate clinical and biomedical Named entity recognition at scaleCode3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1Code3
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language ModelsCode3
LRM: Large Reconstruction Model for Single Image to 3DCode3
GluonTS: Probabilistic Time Series Models in PythonCode3
Practical Deep Reinforcement Learning Approach for Stock TradingCode3
CodeBLEU: a Method for Automatic Evaluation of Code SynthesisCode3
Aguvis: Unified Pure Vision Agents for Autonomous GUI InteractionCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
Text Embeddings Reveal (Almost) As Much As TextCode3
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive CachingCode3
SkillMimic: Learning Basketball Interaction Skills from DemonstrationsCode3
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat GenerationCode3
MDocAgent: A Multi-Modal Multi-Agent Framework for Document UnderstandingCode3
MiniViT: Compressing Vision Transformers with Weight MultiplexingCode3
SPMamba: State-space model is all you need in speech separationCode3
Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight DetectionCode3
Vision as LoRACode3
Deep Limit Order Book ForecastingCode3
Video Mamba Suite: State Space Model as a Versatile Alternative for Video UnderstandingCode3
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual ShiftingCode3
EfficientFormer: Vision Transformers at MobileNet SpeedCode3
Demystify Mamba in Vision: A Linear Attention PerspectiveCode3
Visual Large Language Models for Generalized and Specialized ApplicationsCode3
Show:102550
← PrevPage 78 of 13232Next →