SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 96519675 of 474278 papers

TitleStatusHype
Calibration Meets Reality: Making Machine Learning Predictions TrustworthyCode0
Timber: Training-free Instruct Model Refining with Base via Effective Rank0
Synthetic-to-Real Camouflaged Object DetectionCode0
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented GenerationCode0
Towards Efficient CoT Distillation: Self-Guided Rationale Selector for Better Performance with Fewer RationalesCode0
RobuQ: Pushing DiTs to W1.58A2 via Robust Activation QuantizationCode0
VAMamba: An Efficient Visual Adaptive Mamba for Image RestorationCode0
BioArtlas: Computational Clustering of Multi-Dimensional Complexity in BioartCode0
CoDA: Coding LM via Diffusion Adaptation0
Learning without Global Backpropagation via Synergistic Information DistillationCode0
An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion DetectionCode0
Quant-dLLM: Post-Training Extreme Low-Bit Quantization for Diffusion Large Language ModelsCode0
SDQ-LLM: Sigma-Delta Quantization for 1-bit LLMs of any sizeCode0
Perceptual Influence: Improving the Perceptual Loss Design for Low-Dose CT EnhancementCode0
Retrieval-Constrained Decoding Reveals Underestimated Parametric Knowledge in Language ModelsCode0
FMC-DETR: Frequency-Decoupled Multi-Domain Coordination for Aerial-View Object DetectionCode0
Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own SignalsCode0
If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition0
BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search0
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning0
WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance0
Desensitizing for Improving Corruption Robustness in Point Cloud Classification through Adversarial TrainingCode0
Follow-Your-Preference: Towards Preference-Aligned Image InpaintingCode0
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models0
Online Dynamic Goal Recognition in Gym EnvironmentsCode0
Show:102550
← PrevPage 387 of 18972Next →