SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 57015750 of 661570 papers

TitleStatusHype
LandMarkSystem Technical ReportCode2
Datasets for Depression Modeling in Social Media: An OverviewCode2
UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement LearningCode2
Harmonizing Visual Representations for Unified Multimodal Understanding and GenerationCode2
Mobile-VideoGPT: Fast and Accurate Video Understanding Language ModelCode2
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D DataCode2
MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree SearchCode2
Progressive Focused Transformer for Single Image Super-ResolutionCode2
Unlocking Efficient Long-to-Short LLM Reasoning with Model MergingCode2
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation SparsityCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
Riemannian Optimization on Relaxed Indicator Matrix ManifoldCode2
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face DetectorCode2
Unified Multimodal Discrete DiffusionCode2
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly DetectionCode2
Surg-3M: A Dataset and Foundation Model for Perception in Surgical SettingsCode2
Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image AnalysisCode2
Scaling Down Text Encoders of Text-to-Image Diffusion ModelsCode2
SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data PretrainingCode2
RGL: A Graph-Centric, Modular Framework for Efficient Retrieval-Augmented Generation on GraphsCode2
GENIUS: A Generative Framework for Universal Multimodal SearchCode2
Unlocking the Hidden Potential of CLIP in Generalizable Deepfake DetectionCode2
UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder DesignCode2
HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian SplattingCode2
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian SplittingCode2
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action PolicyCode2
Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image DehazingCode2
Cross-Tokenizer Distillation via Approximate Likelihood MatchingCode2
Change3D: Revisiting Change Detection and Captioning from A Video Modeling PerspectiveCode2
Towards Training-free Anomaly Detection with Vision and Language Foundation ModelsCode2
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified ApproachCode2
Reasoning to Learn from Latent ThoughtsCode2
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV CacheCode2
LLaVAction: evaluating and training multi-modal large language models for action recognitionCode2
MaSS13K: A Matting-level Semantic Segmentation BenchmarkCode2
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse AutoencodersCode2
Hardware-Rasterized Ray-Based Gaussian SplattingCode2
LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQLCode2
DINO in the Room: Leveraging 2D Foundation Models for 3D SegmentationCode2
MC-LLaVA: Multi-Concept Personalized Vision-Language ModelCode2
FG^2: Fine-Grained Cross-View Localization by Fine-Grained Feature MatchingCode2
PolarFree: Polarization-based Reflection-free ImagingCode2
Surrogate Learning in Meta-Black-Box Optimization: A Preliminary StudyCode2
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object TrackingCode2
DCEvo: Discriminative Cross-Dimensional Evolutionary Learning for Infrared and Visible Image FusionCode2
LightLoc: Learning Outdoor LiDAR Localization at Light SpeedCode2
CODA: Repurposing Continuous VAEs for Discrete TokenizationCode2
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A BenchmarkCode2
OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-ImprovementCode2
Modifying Large Language Model Post-Training for Diverse Creative WritingCode2
Show:102550
← PrevPage 115 of 13232Next →