SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30763100 of 661570 papers

TitleStatusHype
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image SynthesisCode3
Rethinking the Evaluation of Visible and Infrared Image FusionCode3
Rectified Diffusion: Straightness Is Not Your Need in Rectified FlowCode3
TopoTune : A Framework for Generalized Combinatorial Complex Neural NetworksCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image GenerationCode3
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance DesignCode3
AgentSquare: Automatic LLM Agent Search in Modular Design SpaceCode3
Residual Kolmogorov-Arnold Network for Enhanced Deep LearningCode3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI AgentsCode3
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model InferenceCode3
Accelerating Diffusion Transformers with Token-wise Feature CachingCode3
Neuron-Level Sequential Editing for Large Language ModelsCode3
High-Speed Stereo Visual SLAM for Low-Powered Computing DevicesCode3
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character controlCode3
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model TransformationCode3
MELODI: Exploring Memory Compression for Long ContextsCode3
How to Train Long-Context Language Models (Effectively)Code3
ControlAR: Controllable Image Generation with Autoregressive ModelsCode3
Diffusion Models are Evolutionary AlgorithmsCode3
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation ModelsCode3
HELMET: How to Evaluate Long-Context Language Models Effectively and ThoroughlyCode3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based AgentsCode3
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMsCode3
Show:102550
← PrevPage 124 of 26463Next →