SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30513075 of 661570 papers

TitleStatusHype
An Evolved Universal Transformer MemoryCode3
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion ModelCode3
Automatically Interpreting Millions of Features in Large Language ModelsCode3
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image GenerationCode3
Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical PerceptionCode3
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and AudioCode3
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic ThinkingCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language ModelsCode3
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained PoliciesCode3
Latent Action Pretraining from VideosCode3
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic SegmentationCode3
LoLCATs: On Low-Rank Linearizing of Large Language ModelsCode3
Predicting from Strings: Language Model Embeddings for Bayesian OptimizationCode3
GIFT-Eval: A Benchmark For General Time Series Forecasting Model EvaluationCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
Large-Scale 3D Medical Image Pre-training with Geometric Context PriorsCode3
CtrLoRA: An Extensible and Efficient Framework for Controllable Image GenerationCode3
MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly DetectionCode3
C-Adapter: Adapting Deep Classifiers for Efficient Conformal Prediction SetsCode3
FlatQuant: Flatness Matters for LLM QuantizationCode3
SceneCraft: Layout-Guided 3D Scene GenerationCode3
Baichuan-Omni Technical ReportCode3
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image SynthesisCode3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
Show:102550
← PrevPage 123 of 26463Next →