SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 78517900 of 661570 papers

TitleStatusHype
Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token DictionaryCode2
Meta Prompting for AI SystemsCode2
VideoShield: Regulating Diffusion-based Video Generation Models via WatermarkingCode2
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution AdaptationCode2
FreeVC: Towards High-Quality Text-Free One-Shot Voice ConversionCode2
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature MimickingCode2
MS-DETR: Efficient DETR Training with Mixed SupervisionCode2
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical ReasoningCode2
Bokehlicious: Photorealistic Bokeh Rendering with Controllable AperturesCode2
Accelerating Transformers with Spectrum-Preserving Token MergingCode2
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to SeeCode2
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation LearningCode2
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image GenerationCode2
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMsCode2
CATT: Character-based Arabic Tashkeel TransformerCode2
Monocular Occupancy Prediction for Scalable Indoor ScenesCode2
Very fast Bayesian Additive Regression Trees on GPUCode2
Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop RemovalCode2
MARG: Multi-Agent Review Generation for Scientific PapersCode2
YOLOv8-ResCBAM: YOLOv8 Based on An Effective Attention Module for Pediatric Wrist Fracture DetectionCode2
Ignore Previous Prompt: Attack Techniques For Language ModelsCode2
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth GenerationCode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and ReviewCode2
VSSD: Vision Mamba with Non-Causal State Space DualityCode2
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any VideoCode2
Dynamic Factor Allocation Leveraging Regime-Switching SignalsCode2
BoQ: A Place is Worth a Bag of Learnable QueriesCode2
Alpha-CLIP: A CLIP Model Focusing on Wherever You WantCode2
Dual Vision TransformerCode2
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image InpaintingCode2
Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects SuppressionCode2
Learning to Decode Collaboratively with Multiple Language ModelsCode2
Q-DiT: Accurate Post-Training Quantization for Diffusion TransformersCode2
QAQ: Quality Adaptive Quantization for LLM KV CacheCode2
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion ModelsCode2
IsolateGPT: An Execution Isolation Architecture for LLM-Based Agentic SystemsCode2
Tracking Meets LoRA: Faster Training, Larger Model, Stronger PerformanceCode2
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion ModelsCode2
Beyond Text: Frozen Large Language Models in Visual Signal ComprehensionCode2
RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation ModelCode2
The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023Code2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor EnvironmentsCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
A Comprehensive Study of Multimodal Large Language Models for Image Quality AssessmentCode2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
Graph Neural Networks for Learning Equivariant Representations of Neural NetworksCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
A Multimodal Vision Foundation Model for Clinical DermatologyCode2
Show:102550
← PrevPage 158 of 13232Next →