SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 151200 of 474278 papers

TitleStatusHype
Multimodal Dataset Distillation via Phased Teacher Models0
FSGNet: A Frequency-Aware and Semantic Guidance Network for Infrared Small Target Detection0
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models0
Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors0
CardioDiT: Latent Diffusion Transformers for 4D Cardiac MRI Synthesis0
SafeMath: Inference-time Safety improves Math Accuracy0
Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction0
Towards Practical Lossless Neural Compression for LiDAR Point Clouds0
Adaptive Learned Image Compression with Graph Neural Networks0
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data0
Adaptive Chunking: Optimizing Chunking-Method Selection for RAG0
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders0
From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild0
AdaSFormer: Adaptive Serialized Transformers for Monocular Semantic Scene Completion from Indoor Environments0
Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence0
Elucidating the Design Space of Flow Matching for Cellular Microscopy0
Brain-Inspired Multimodal Spiking Neural Network for Image-Text Retrieval0
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks0
UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy0
LLaVA-LE: Large Language-and-Vision Assistant for Lunar Exploration0
Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models0
Is Geometry Enough? An Evaluation of Landmark-Based Gaze Estimation0
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration0
Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting0
A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study0
Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR0
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models0
ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing0
Demystifying When Pruning Works via Representation Hierarchies0
ReDiPrune: Relevance-Diversity Pre-Projection Token Pruning for Efficient Multimodal LLMs0
Accurate Point Measurement in 3DGS -- A New Alternative to Traditional Stereoscopic-View Based Measurements0
Decentralized Task Scheduling in Distributed Systems: A Deep Reinforcement Learning Approach0
Light Cones For Vision: Simple Causal Priors For Visual Hierarchy0
FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting0
Understanding the Challenges in Iterative Generative Optimization with LLMs0
Problems with Chinchilla Approach 2: Systematic Biases in IsoFLOP Parabola Fits0
MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding0
Grounding Arabic LLMs in the Doha Historical Dictionary: Retrieval-Augmented Understanding of Quran and Hadith0
Thinking with Tables: Enhancing Multi-Modal Tabular Understanding via Neuro-Symbolic Reasoning0
From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs0
A^3: Towards Advertising Aesthetic Assessment0
Hierarchical Spatial-Temporal Graph-Enhanced Model for Map-Matching0
LaDy: Lagrangian-Dynamic Informed Network for Skeleton-based Action Segmentation via Spatial-Temporal Modulation0
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare0
Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning0
Cost-Sensitive Neighborhood Aggregation for Heterophilous Graphs: When Does Per-Edge Routing Help?0
PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks0
Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs0
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience0
CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition0
Show:102550
← PrevPage 4 of 9486Next →