SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30763100 of 661570 papers

TitleStatusHype
Diffusion Language Models Are Versatile Protein LearnersCode3
CAMixerSR: Only Details Need More "Attention"Code3
CLLMs: Consistency Large Language ModelsCode3
SynCode: LLM Generation with Grammar AugmentationCode3
Controllable Text Generation for Large Language Models: A SurveyCode3
RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly DetectionCode3
Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force FieldsCode3
Retrieval Augmented Generation and Understanding in Vision: A Survey and New OutlookCode3
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical ImagesCode3
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain FrameworkCode3
Rotary Position Embedding for Vision TransformerCode3
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and ModulationCode3
The Elements of Differentiable ProgrammingCode3
Advancing LLM Reasoning Generalists with Preference TreesCode3
Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMsCode3
OGBench: Benchmarking Offline Goal-Conditioned RLCode3
HPNet: Dynamic Trajectory Forecasting with Historical Prediction AttentionCode3
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on GraphsCode3
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous DrivingCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement LearningCode3
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion ModelsCode3
ModernTCN: A Modern Pure Convolution Structure for General Time Series AnalysisCode3
Efficient Multimodal Large Language Models: A SurveyCode3
CV-VAE: A Compatible Video VAE for Latent Generative Video ModelsCode3
Show:102550
← PrevPage 124 of 26463Next →