SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 83018350 of 661570 papers

TitleStatusHype
Enhancing Debunking Effectiveness through LLM-based Personality Adaptation0
Efficiently Aligning Draft Models via Parameter- and Data-Efficient AdaptationCode0
RESBev: Making BEV Perception More Robust0
Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization0
Memory-Guided View Refinement for Dynamic Human-in-the-loop EQA0
GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision0
GeoAlignCLIP: Enhancing Fine-Grained Vision-Language Alignment in Remote Sensing via Multi-Granular Consistency Learning0
Routing without Forgetting0
Build, Borrow, or Just Fine-Tune? A Political Scientist's Guide to Choosing NLP Models0
Symbolic Discovery of Stochastic Differential Equations with Genetic Programming0
Learning the Hierarchical Organization in Brain Network for Brain Disorder Diagnosis0
Surgical Repair of Collapsed Attention Heads in ALiBi Transformers0
Decoder-Free Distillation for Quantized Image Restoration0
Grounding Synthetic Data Generation With Vision and Language ModelsCode0
PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution0
Multi-DNN Inference of Sparse Models on Edge SoCs0
Evolution of Photonic Quantum Machine Learning under Noise0
Well Log-Guided Synthesis of Subsurface Images from Sparse Petrography Data Using cGANs0
OTPL-VIO: Robust Visual-Inertial Odometry with Optimal Transport Line Association and Adaptive Uncertainty0
Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 20250
EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages0
When to Lock Attention: Training-Free KV Control in Video Diffusion0
On Catastrophic Forgetting in Low-Rank Decomposition-Based Parameter-Efficient Fine-Tuning0
DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics0
Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning0
Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records0
ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling0
Physics-informed neural operator for predictive parametric phase-field modelling0
Mousse: Rectifying the Geometry of Muon with Curvature-Aware Preconditioning0
TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR0
ProGS: Towards Progressive Coding for 3D Gaussian Splatting0
OOD-MMSafe: Advancing MLLM Safety from Harmful Intent to Hidden Consequences0
Does the Question Really Matter? Training-Free Data Selection for Vision-Language SFT0
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents0
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation0
Information Theoretic Bayesian Optimization over the Probability Simplex0
A Multi-Prototype-Guided Federated Knowledge Distillation Approach in AI-RAN Enabled Multi-Access Edge Computing System0
Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments0
Upper Generalization Bounds for Neural Oscillators0
LAP: A Language-Aware Planning Model For Procedure Planning In Instructional Videos0
Beyond Fine-Tuning: Robust Food Entity Linking under Ontology Drift with FoodOntoRAG0
LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control0
World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models0
First Estimation of Model Parameters for Neutrino-Induced Nucleon Knockout Using Simulation-Based Inference0
What is Missing? Explaining Neurons Activated by Absent Concepts0
A Hybrid Quantum-Classical Framework for Financial Volatility Forecasting Based on Quantum Circuit Born Machines0
RA-SSU: Towards Fine-Grained Audio-Visual Learning with Region-Aware Sound Source Understanding0
Correction of Transformer-Based Models with Smoothing Pseudo-Projector0
ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation0
One-Eval: An Agentic System for Automated and Traceable LLM EvaluationCode0
Show:102550
← PrevPage 167 of 13232Next →