SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 75267550 of 474278 papers

TitleStatusHype
AlphaResearch: Accelerating New Algorithm Discovery with Language Models0
Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study0
DynaAct: Large Language Model Reasoning with Dynamic Action SpacesCode0
STaR-Bets: Sequential Target-Recalculating Bets for Tighter Confidence IntervalsCode0
Multi-modal Deepfake Detection and Localization with FPN-TransformerCode0
Re^2MaP: Macro Placement by Recursively Prototyping and Packing Tree-based RelocatingCode0
Radar-APLANC: Unsupervised Radar-based Heartbeat Sensing via Augmented Pseudo-Label and Noise ContrastCode0
Text-based Aerial-Ground Person RetrievalCode0
HipKittens: Fast and Furious AMD KernelsCode0
Multi-Granularity Mutual Refinement Network for Zero-Shot LearningCode0
SynthTools: A Framework for Scaling Synthetic Tools for Agent DevelopmentCode0
From Confusion to Clarity: ProtoScore -- A Framework for Evaluating Prototype-Based XAICode0
Benevolent Dictators? On LLM Agent Behavior in Dictator GamesCode0
Vector Symbolic Algebras for the Abstraction and Reasoning CorpusCode0
TIGER-MARL: Enhancing Multi-Agent Reinforcement Learning with Temporal Information through Graph-based Embeddings and RepresentationsCode0
The Impact of Longitudinal Mammogram Alignment on Breast Cancer Risk AssessmentCode0
SeFA-Policy: Fast and Accurate Visuomotor Policy Learning with Selective Flow AlignmentCode0
LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer LearningCode0
Sampling 3D Molecular Conformers with Diffusion TransformersCode0
Evolutionary Profiles for Protein Fitness PredictionCode0
Some theoretical improvements on the tightness of PAC-Bayes risk certificates for neural networksCode0
Perceptual Quality Assessment of 3D Gaussian Splatting: A Subjective Dataset and Prediction MetricCode0
Boomda: Balanced Multi-objective Optimization for Multimodal Domain AdaptationCode0
VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with DiffusionCode0
Hierarchical Direction Perception via Atomic Dot-Product Operators for Rotation-Invariant Point Clouds LearningCode0
Show:102550
← PrevPage 302 of 18972Next →