SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 85768600 of 474278 papers

TitleStatusHype
RL makes MLLMs see better than SFT0
QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language ModelsCode0
Who Taught the Lie? Responsibility Attribution for Poisoned Knowledge in Retrieval-Augmented GenerationCode0
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancementCode0
Stroke2Sketch: Harnessing Stroke Attributes for Training-Free Sketch GenerationCode0
SP-Rank: A Dataset for Ranked Preferences with Secondary InformationCode0
Decoding Listeners Identity: Person Identification from EEG Signals Using a Lightweight Spiking TransformerCode0
CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations0
TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent CollaborationCode0
Nonlinear Dimensionality Reduction Techniques for Bayesian OptimizationCode0
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning0
PAFT: Prompt-Agnostic Fine-Tuning0
iLRM: An Iterative Large 3D Reconstruction Model0
Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting0
CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding0
Expanding the Action Space of LLMs to Reason Beyond Language0
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain0
Semi-Supervised Regression with Heteroscedastic Pseudo-LabelsCode0
CuSfM: CUDA-Accelerated Structure-from-MotionCode0
Proto-Former: Unified Facial Landmark Detection by Prototype TransformerCode0
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation0
Chronos-2: From Univariate to Universal Forecasting0
Paper2Web: Let's Make Your Paper Alive!0
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal0
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions0
Show:102550
← PrevPage 344 of 18972Next →