SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 18511875 of 661570 papers

TitleStatusHype
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision ModelsCode4
LLM Inference Unveiled: Survey and Roofline Model InsightsCode4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationCode4
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPTCode4
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question AnsweringCode4
Neural Operators with Localized Integral and Differential KernelsCode4
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-stepCode4
Knowledge Fusion of Chat LLMs: A Preliminary Technical ReportCode4
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent LearningCode4
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent SystemCode4
Self-Supervised Pre-Training for Table Structure Recognition TransformerCode4
Cameras as Rays: Pose Estimation via Ray DiffusionCode4
2D Matryoshka Sentence EmbeddingsCode4
TinyLLaVA: A Framework of Small-scale Large Multimodal ModelsCode4
Large Language Models for Data Annotation and Synthesis: A SurveyCode4
Benchmarking Retrieval-Augmented Generation for MedicineCode4
Neural Network DiffusionCode4
FinBen: A Holistic Financial Benchmark for Large Language ModelsCode4
Aria Everyday Activities DatasetCode4
AnyGPT: Unified Multimodal LLM with Discrete Sequence ModelingCode4
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMsCode4
GIM: Learning Generalizable Image Matcher From Internet VideosCode4
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs MissCode4
Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image SegmentationCode4
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-DistillationCode4
Show:102550
← PrevPage 75 of 26463Next →