SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1675116800 of 474278 papers

TitleStatusHype
EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric VideosCode1
CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph SearchingCode1
AdaRank: Adaptive Rank Pruning for Enhanced Model MergingCode1
Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy PredictionCode1
Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound ScenesCode1
Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference OptimizationCode1
DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-IDCode1
FLIP: Towards Comprehensive and Reliable Evaluation of Federated Prompt LearningCode1
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose EstimationCode1
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous DrivingCode1
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersCode1
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented GenerationCode1
FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image RetrievalCode1
InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token CompressionCode1
uLayout: Unified Room Layout Estimation for Perspective and Panoramic ImagesCode1
LOCORE: Image Re-ranking with Long-Context Sequence ModelingCode1
Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop FeedbackCode1
Learning Class Prototypes for Unified Sparse Supervised 3D Object DetectionCode1
Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image CompressionCode1
The MVTec AD 2 Dataset: Advanced Scenarios for Unsupervised Anomaly DetectionCode1
Test-Time Visual In-Context TuningCode1
Omni-AD: Learning to Reconstruct Global and Local Features for Multi-class Anomaly DetectionCode1
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video UnderstandingCode1
R-PRM: Reasoning-Driven Process Reward ModelingCode1
Reinforced Model MergingCode1
ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model MergingCode1
The Procedural Content Generation Benchmark: An Open-source Testbed for Generative Challenges in GamesCode1
Data Poisoning in Deep Learning: A SurveyCode1
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic SegmentationCode1
A friendly introduction to triangular transportCode1
On Large Multimodal Models as Open-World Image ClassifiersCode1
FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMsCode1
OpenHuEval: Evaluating Large Language Model on Hungarian SpecificsCode1
VADMamba: Exploring State Space Models for Fast Video Anomaly DetectionCode1
UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF AugmentationCode1
Empowering Retrieval-based Conversational Recommendation with Contrasting User PreferencesCode1
DSU-Net:An Improved U-Net Model Based on DINOv2 and SAM2 with Multi-scale Cross-model Feature EnhancementCode1
ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning ModelsCode1
A Comprehensive Benchmark for RNA 3D Structure-Function ModelingCode1
LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image EditingCode1
Comprehensive segmentation of deep grey nuclei from structural MRI dataCode1
Learning from spatially inhomogenous data: resolution-adaptive convolutions for multiple sclerosis lesion segmentationCode1
BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational PathologyCode1
Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image RestorationCode1
Fast, Modular, and Differentiable Framework for Machine Learning-Enhanced Molecular SimulationsCode1
SChanger: Change Detection from a Semantic Change and Spatial Consistency PerspectiveCode1
3MDBench: Medical Multimodal Multi-agent Dialogue BenchmarkCode1
EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame InterpolationCode1
sudo rm -rf agentic_securityCode1
Siformer: Feature-isolated Transformer for Efficient Skeleton-based Sign Language RecognitionCode1
Show:102550
← PrevPage 336 of 9486Next →