SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 27512775 of 177340 papers

TitleStatusHype
Evaluating Large Language Models Trained on CodeCode3
Learning Inclusion Matching for Animation Paint Bucket ColorizationCode3
Learning to Use Tools via Cooperative and Interactive AgentsCode3
WhisperNER: Unified Open Named Entity and Speech RecognitionCode3
Theory, Analysis, and Best Practices for Sigmoid Self-AttentionCode3
Fairness in Serving Large Language ModelsCode3
Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image RestorationCode3
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution ImagesCode3
Language Models are Few-Shot LearnersCode3
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D ModelsCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
TCFormer: Visual Recognition via Token Clustering TransformerCode3
TSI-Bench: Benchmarking Time Series ImputationCode3
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-ConstraintCode3
A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and LocalizationCode3
Seamless Human Motion Composition with Blended Positional EncodingsCode3
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image DeblurringCode3
LocalMamba: Visual State Space Model with Windowed Selective ScanCode3
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language ModelsCode3
Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena PerspectiveCode3
Event-Enhanced Blurry Video Super-ResolutionCode3
Generative Data Augmentation using LLMs improves Distributional Robustness in Question AnsweringCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
A Survey of Large Language Models in Finance (FinLLMs)Code3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
Show:102550
← PrevPage 111 of 7094Next →