SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 87268750 of 474278 papers

TitleStatusHype
Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for RussianCode2
Model Editing as a Robust and Denoised variant of DPO: A Case Study on ToxicityCode2
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept CompositionCode2
FedCache 2.0: Federated Edge Learning with Knowledge Caching and Dataset DistillationCode2
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence FunctionsCode2
BrainMorph: A Foundational Keypoint Model for Robust and Flexible Brain MRI RegistrationCode2
RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging RadarCode2
VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal GroundingCode2
I2I-Mamba: Multi-modal medical image synthesis via selective state space modelingCode2
LightningDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from VideosCode2
The future of cosmological likelihood-based inference: accelerated high-dimensional parameter estimation and model comparisonCode2
Mamba in Speech: Towards an Alternative to Self-AttentionCode2
KPConvX: Modernizing Kernel Point Convolution with Kernel AttentionCode2
FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher informationCode2
RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor SearchCode2
Large Language Models Meet NLP: A SurveyCode2
LLM Processes: Numerical Predictive Distributions Conditioned on Natural LanguageCode2
ProtT3: Protein-to-Text Generation for Text-based Protein UnderstandingCode2
Reducing Transformer Key-Value Cache Size with Cross-Layer AttentionCode2
SirLLM: Streaming Infinite Retentive LLMCode2
Wav-KAN: Wavelet Kolmogorov-Arnold NetworksCode2
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in MammographyCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text ExpertiseCode2
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkCode2
Show:102550
← PrevPage 350 of 18972Next →