SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 90019025 of 177340 papers

TitleStatusHype
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal RepresentationCode2
Scalable Autoregressive Image Generation with MambaCode2
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement LearningCode2
MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models AgentsCode2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token EmbeddingsCode2
Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable SegmentationCode2
Stochastic Parameter DecompositionCode2
Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare ApplicationsCode2
Boosting Vision-Language Models for Histopathology Classification: Predict all at onceCode2
FunctionChat-Bench: Comprehensive Evaluation of Language Models' Generative Capabilities in Korean Tool-use DialogsCode2
Make Your ViT-based Multi-view 3D Detectors Faster via Token CompressionCode2
Towards a Unified View of Preference Learning for Large Language Models: A SurveyCode2
UniDet3D: Multi-dataset Indoor 3D Object DetectionCode2
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven RefinementCode2
Assessing SPARQL capabilities of Large Language ModelsCode2
DiffusionPen: Towards Controlling the Style of Handwritten Text GenerationCode2
ThermalGaussian: Thermal 3D Gaussian SplattingCode2
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?Code2
Recent Trends of Multimodal Affective Computing: A Survey from NLP PerspectiveCode2
EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidanceCode2
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and SynthesisCode2
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language ModelsCode2
Large Language Models are Strong Audio-Visual Speech Recognition LearnersCode2
HSIGene: A Foundation Model For Hyperspectral Image GenerationCode2
Small Language Models: Survey, Measurements, and InsightsCode2
Show:102550
← PrevPage 361 of 7094Next →