SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 90519075 of 177340 papers

TitleStatusHype
nnSAM: Plug-and-play Segment Anything Model Improves nnUNet PerformanceCode2
CursorCore: Assist Programming through Aligning AnythingCode2
Compositional Entailment Learning for Hyperbolic Vision-Language ModelsCode2
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration RateCode2
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language ModelsCode2
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains MoreCode2
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference AccelerationCode2
Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual LearningCode2
LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor ExtractionCode2
Progressive Autoregressive Video Diffusion ModelsCode2
IncEventGS: Pose-Free Gaussian Splatting from a Single Event CameraCode2
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven InteractionsCode2
An Undetectable Watermark for Generative Image ModelsCode2
From Cognition to Precognition: A Future-Aware Framework for Social NavigationCode2
VideoAgent: Self-Improving Video GenerationCode2
Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered CluesCode2
MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language UnderstandingCode2
Evaluating Morphological Compositional Generalization in Large Language ModelsCode2
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement LearningCode2
DM-Codec: Distilling Multimodal Representations for Speech TokenizationCode2
GPT or BERT: why not both?Code2
Model merging with SVD to tie the KnotsCode2
SciPIP: An LLM-based Scientific Paper Idea ProposerCode2
Ada-MSHyper: Adaptive Multi-Scale Hypergraph Transformer for Time Series ForecastingCode2
DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution DetectionCode2
Show:102550
← PrevPage 363 of 7094Next →