SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 98019825 of 474278 papers

TitleStatusHype
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question AnsweringCode0
LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical EncodingCode0
Understanding-in-Generation: Reinforcing Generative Capability of Unified Model via Infusing Understanding into GenerationCode0
SIM-CoT: Supervised Implicit Chain-of-ThoughtCode0
Mammo-CLIP Dissect: A Framework for Analysing Mammography Concepts in Vision-Language ModelsCode0
Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They SayCode0
AutoOEP -- A Multi-modal Framework for Online Exam ProctoringCode0
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning0
Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative ProgramsCode0
Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual UnderstandingCode0
Measuring Harmfulness of Computer-Using Agents0
MAPEX: A Multi-Agent Pipeline for Keyphrase ExtractionCode0
Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation0
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute TransitionCode0
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning0
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition0
FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models0
SynchroRaMa : Lip-Synchronized and Emotion-Aware Talking Face Generation via Multi-Modal Emotion Embedding0
Every Character Counts: From Vulnerability to Defense in Phishing DetectionCode0
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling0
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model ReasoningCode0
Improving Monte Carlo Tree Search for Symbolic RegressionCode0
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving0
Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus0
PGCLODA: Prompt-Guided Graph Contrastive Learning for Oligopeptide-Infectious Disease Association PredictionCode0
Show:102550
← PrevPage 393 of 18972Next →