SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 40014025 of 177340 papers

TitleStatusHype
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language UnderstandingCode3
Ultra-High-Resolution Image Synthesis: Data, Method and EvaluationCode3
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal RetrieversCode3
FreeMatch: Self-adaptive Thresholding for Semi-supervised LearningCode3
Unlimited-Size Diffusion RestorationCode3
TorchSparse: Efficient Point Cloud Inference EngineCode3
Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait SynthesisCode3
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly DetectionCode3
From Matching to Generation: A Survey on Generative Information RetrievalCode3
SAM-Med2DCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
DEADiff: An Efficient Stylization Diffusion Model with Disentangled RepresentationsCode3
GaussianCity: Generative Gaussian Splatting for Unbounded 3D City GenerationCode3
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate DetailsCode3
ResearchTown: Simulator of Human Research CommunityCode3
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point SupervisionCode3
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMsCode3
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for LocomotionCode3
TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug DiscoveryCode3
MathArena: Evaluating LLMs on Uncontaminated Math CompetitionsCode3
Frequency-aware Feature Fusion for Dense Image PredictionCode3
VoiceBench: Benchmarking LLM-Based Voice AssistantsCode3
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D GenerationCode3
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM AgentsCode3
Show:102550
← PrevPage 161 of 7094Next →