SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 78017825 of 474278 papers

TitleStatusHype
Efficient Face Super-Resolution via Wavelet-based Feature Enhancement NetworkCode2
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning ProcessCode2
Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction TuningCode2
NAVIX: Scaling MiniGrid Environments with JAXCode2
Temporal Feature Matters: A Framework for Diffusion Model QuantizationCode2
Perm: A Parametric Representation for Multi-Style 3D Hair ModelingCode2
VSSD: Vision Mamba with Non-Causal State Space DualityCode2
Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement NetworkCode2
Towards A Generalizable Pathology Foundation Model via Unified Knowledge DistillationCode2
Contrastive Learning of Asset Embeddings from Financial Time SeriesCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
Self-Training with Direct Preference Optimization Improves Chain-of-Thought ReasoningCode2
FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid EditingCode2
VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic DatasetCode2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer VisionCode2
RefMask3D: Language-Guided Transformer for 3D Referring SegmentationCode2
RegionDrag: Fast Region-Based Image Editing with Diffusion ModelsCode2
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?Code2
Towards Localized Fine-Grained Control for Facial Expression GenerationCode2
The Dark Side of Function Calling: Pathways to Jailbreaking Large Language ModelsCode2
Reshape Dimensions Network for Speaker RecognitionCode2
DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car ReconstructionCode2
u-μP: The Unit-Scaled Maximal Update ParametrizationCode2
Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop RemovalCode2
dlordinal: a Python package for deep ordinal classificationCode2
Show:102550
← PrevPage 313 of 18972Next →