SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 84518475 of 474278 papers

TitleStatusHype
Alibaba International E-commerce Product Search Competition DILAB Team Technical ReportCode0
NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-TuningCode0
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and JudgeCode0
Tree of Agents: Improving Long-Context Capabilities of Large Language Models through Multi-Perspective ReasoningCode0
LAMP-PRo: Label-aware Attention for Multi-label Prediction of DNA- and RNA-binding Proteins using Protein Language ModelsCode0
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-ReflectionCode0
MEET-Sepsis: Multi-Endogenous-View Enhanced Time-Series Representation Learning for Early Sepsis PredictionCode0
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object DetectionCode0
BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal PretrainingCode0
A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning0
Learning to Interpret Weight Differences in Language Models0
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning0
Cross-Modal Scene Semantic Alignment for Image Complexity AssessmentCode0
Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference0
Program Synthesis via Test-Time TransductionCode0
Distilling LLM Prior to Flow Model for Generalizable Agent's Imagination in Object Goal NavigationCode0
IASC: Interactive Agentic System for ConLangsCode0
Can Large Language Models Master Complex Card Games?Code0
Towards Agentic Self-Learning LLMs in Search EnvironmentCode0
DeepSeek-OCR: Contexts Optical CompressionCode0
Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language ModelsCode0
Ranking-based Preference Optimization for Diffusion Models from Implicit User FeedbackCode0
Towards Unsupervised Open-Set Graph Domain Adaptation via Dual ReprogrammingCode0
SpecExit: Accelerating Large Reasoning Model via Speculative ExitCode0
FlexQuant: A Flexible and Efficient Dynamic Precision Switching Framework for LLM QuantizationCode0
Show:102550
← PrevPage 339 of 18972Next →