SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 38013825 of 661570 papers

TitleStatusHype
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?Code3
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language ModelsCode3
MACE: Mass Concept Erasure in Diffusion ModelsCode3
RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly DetectionCode3
uniGradICON: A Foundation Model for Medical Image RegistrationCode3
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of contextCode3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationCode3
Unbiased Estimator for Distorted Conics in Camera CalibrationCode3
Embodied Understanding of Driving ScenariosCode3
Bridging Language and Items for Retrieval and RecommendationCode3
Behavior Generation with Latent ActionsCode3
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion ModelsCode3
Learning to Use Tools via Cooperative and Interactive AgentsCode3
PromptKD: Unsupervised Prompt Distillation for Vision-Language ModelsCode3
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language ModelsCode3
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence ModelingCode3
KnowAgent: Knowledge-Augmented Planning for LLM-Based AgentsCode3
Scaling Rectified Flow Transformers for High-Resolution Image SynthesisCode3
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender EstimationCode3
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
Vision-Language Models for Medical Report Generation and Visual Question Answering: A ReviewCode3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeCode3
NeuSpeech: Decode Neural signal as SpeechCode3
Trial and Error: Exploration-Based Trajectory Optimization for LLM AgentsCode3
Show:102550
← PrevPage 153 of 26463Next →