SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1720117250 of 474278 papers

TitleStatusHype
EgoBlind: Towards Egocentric Visual Assistance for the BlindCode1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
SAS: Segment Any 3D Scene with Integrated 2D PriorsCode1
NullFace: Training-Free Localized Face AnonymizationCode1
MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-ResolutionCode1
Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidationCode1
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical ReachabilityCode1
Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning PoliciesCode1
All That Glitters Is Not Gold: Key-Secured 3D Secrets within 3D Gaussian SplattingCode1
Open-Set Gait Recognition from Sparse mmWave Radar Point CloudsCode1
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual DecodingCode1
ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code GenerationCode1
SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground NetworksCode1
Performance-driven Constrained Optimal Auto-Tuner for MPCCode1
Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone ImagesCode1
Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian SplattingCode1
Effective and Efficient Masked Image Generation ModelsCode1
Lshan-1.0 Technical ReportCode1
SimROD: A Simple Baseline for Raw Object Detection with Global and Local EnhancementsCode1
COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity RecognitionCode1
RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox TestingCode1
TokenButler: Token Importance is PredictableCode1
V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image GenerationCode1
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language ModelsCode1
SANDRO: a Robust Solver with a Splitting Strategy for Point Cloud RegistrationCode1
Process-Supervised LLM Recommenders via Flow-guided TuningCode1
A Data-Centric Revisit of Pre-Trained Vision Models for Robot LearningCode1
Dynamic Cross-Modal Feature Interaction Network for Hyperspectral and LiDAR Data ClassificationCode1
SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion ModelsCode1
RefactorBench: Evaluating Stateful Reasoning in Language Agents Through CodeCode1
VisRL: Intention-Driven Visual Perception via Reinforced ReasoningCode1
On the Generalization of Representation Uncertainty in Earth ObservationCode1
Implicit Reasoning in Transformers is Reasoning through ShortcutsCode1
HybridReg: Robust 3D Point Cloud Registration with Hybrid MotionsCode1
GRITHopper: Decomposition-Free Multi-Hop Dense RetrievalCode1
Interactive Medical Image Analysis with Concept-based Similarity ReasoningCode1
Learning Decision Trees as Amortized Structure InferenceCode1
Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and MitigationCode1
Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation AlignmentCode1
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion ModelsCode1
AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion ModelsCode1
ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model CompetitionCode1
Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement LearningCode1
Geometric Knowledge-Guided Localized Global Distribution Alignment for Federated LearningCode1
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video GenerationCode1
One-Step Diffusion Model for Image Motion-DeblurringCode1
Dynamic Updates for Language Adaptation in Visual-Language TrackingCode1
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long VideosCode1
M^3amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing ClassificationCode1
Online Dense Point Tracking with Streaming MemoryCode1
Show:102550
← PrevPage 345 of 9486Next →