SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81518175 of 474278 papers

TitleStatusHype
Sequential Multi-Agent Dynamic Algorithm ConfigurationCode0
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code IntelligenceCode0
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoTCode0
More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion ModelsCode0
PrivacyGuard: A Modular Framework for Privacy Auditing in Machine LearningCode0
Fast-MIA: Efficient and Scalable Membership Inference for LLMsCode0
3D-RAD: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic TasksCode0
DeepOmni: Towards Seamless and Smart Speech Interaction with Adaptive Modality-Specific MoECode0
MPX: Mixed Precision Training for JAXCode0
Dynamic Retriever for In-Context Knowledge Editing via Policy OptimizationCode0
Language Server CLI Empowers Language Agents with Process RewardsCode0
LoMix: Learnable Weighted Multi-Scale Logits Mixing for Medical Image SegmentationCode0
Beyond Higher Rank: Token-wise Input-Output Projections for Efficient Low-Rank AdaptationCode0
A Video Is Not Worth a Thousand WordsCode0
PlanarTrack: A high-quality and challenging benchmark for large-scale planar object trackingCode0
VideoTG-R1: Boosting Video Temporal Grounding via Curriculum Reinforcement Learning on Reflected Boundary AnnotationsCode0
Multi-Task Surrogate-Assisted Search with Bayesian Competitive Knowledge Transfer for Expensive OptimizationCode0
Distilled Protein Backbone GenerationCode0
UniMedVL: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-AnalysisCode0
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and ApplicationsCode0
Improving the Straight-Through Estimator with Zeroth-Order InformationCode0
The ISLab Solution to the Algonauts Challenge 2025: A Multimodal Deep Learning Approach to Brain Response PredictionCode0
On the Faithfulness of Visual Thinking: Measurement and EnhancementCode0
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation0
Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs0
Show:102550
← PrevPage 327 of 18972Next →