SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 59516000 of 661570 papers

TitleStatusHype
Universal Narrative Model: an Author-centric Storytelling Framework for Generative AICode2
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent SystemsCode2
BANet: Bilateral Aggregation Network for Mobile Stereo MatchingCode2
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop DrivingCode2
Golden Cudgel Network for Real-Time Semantic SegmentationCode2
BHViT: Binarized Hybrid Vision TransformerCode2
WMNav: Integrating Vision-Language Models into World Models for Object Goal NavigationCode2
ZAPBench: A Benchmark for Whole-Brain Activity Prediction in ZebrafishCode2
Technique Inference Engine: A Recommender Model to Support Cyber Threat HuntingCode2
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
MPO: Boosting LLM Agents with Meta Plan OptimizationCode2
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal ModelsCode2
LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent ApplicationsCode2
h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-TransformCode2
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMsCode2
Composed Multi-modal Retrieval: A Survey of Approaches and ApplicationsCode2
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual LearningCode2
DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-ResolutionCode2
An Approach for Air Drawing Using Background Subtraction and Contour ExtractionCode2
Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model LearningCode2
Large-Scale Data Selection for Instruction TuningCode2
Interactive Debugging and Steering of Multi-Agent AI SystemsCode2
Retrieval-Augmented Perception: High-Resolution Image Perception Meets Visual RAGCode2
Forgetting Transformer: Softmax Attention with a Forget GateCode2
Liger: Linearizing Large Language Models to Gated Recurrent StructuresCode2
Beyond Matryoshka: Revisiting Sparse Coding for Adaptive RepresentationCode2
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN DiscriminatorCode2
FlowDec: A flow-based full-band general audio codec with high perceptual qualityCode2
MI-DETR: An Object Detection Model with Multi-time Inquiries MechanismCode2
Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus AreasCode2
OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFDCode2
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
Patch-wise Structural Loss for Time Series ForecastingCode2
From Poses to Identity: Training-Free Person Re-Identification via Feature CentralizationCode2
Predictive Data Selection: The Data That Predicts Is the Data That TeachesCode2
Geodesic Diffusion Models for Medical Image-to-Image GenerationCode2
Streaming Video Question-Answering with In-context Video KV-Cache RetrievalCode2
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech EnhancementCode2
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction TuningCode2
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and QualityCode2
UL-UNAS: Ultra-Lightweight U-Nets for Real-Time Speech Enhancement via Network Architecture SearchCode2
Qilin: A Multimodal Information Retrieval Dataset with APP-level User SessionsCode2
PodAgent: A Comprehensive Framework for Podcast GenerationCode2
Adaptive Rectangular Convolution for Remote Sensing PansharpeningCode2
What Makes a Good Diffusion Planner for Decision Making?Code2
Remasking Discrete Diffusion Models with Inference-Time ScalingCode2
BodyGen: Advancing Towards Efficient Embodiment Co-DesignCode2
UniNet: A Contrastive Learning-guided Unified Framework with Feature Selection for Anomaly DetectionCode2
SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation ModelsCode2
Neural Posterior Estimation for Cataloging Astronomical Images with Spatially Varying Backgrounds and Point Spread FunctionsCode2
Show:102550
← PrevPage 120 of 13232Next →