SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 601650 of 658356 papers

TitleStatusHype
Dynamic Datasets and Market Environments for Financial Reinforcement LearningCode6
Efficient and Effective Text Encoding for Chinese LLaMA and AlpacaCode6
Visual Instruction TuningCode6
DINOv2: Learning Robust Visual Features without SupervisionCode6
Generative Agents: Interactive Simulacra of Human BehaviorCode6
Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingCode6
A Survey of Large Language ModelsCode6
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyCode6
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging FaceCode6
Sparks of Artificial General Intelligence: Early experiments with GPT-4Code6
ART: Automatic multi-step reasoning and tool-use for large language modelsCode6
GPT-4 Technical ReportCode6
A Method for Animating Children's Drawings of the Human FigureCode6
The Dormant Neuron Phenomenon in Deep Reinforcement LearningCode6
Nerfstudio: A Modular Framework for Neural Radiance Field DevelopmentCode6
MusicLM: Generating Music From TextCode6
A Watermark for Large Language ModelsCode6
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesCode6
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face AnimationCode6
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language ModelsCode6
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-SpeechCode6
FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement LearningCode6
Automatic Chain of Thought Prompting in Large Language ModelsCode6
GLM-130B: An Open Bilingual Pre-trained ModelCode6
TimesNet: Temporal 2D-Variation Modeling for General Time Series AnalysisCode6
AudioGen: Textually Guided Audio GenerationCode6
Petals: Collaborative Inference and Fine-tuning of Large ModelsCode6
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion ModelsCode6
Synthetic Dataset Generation for Adversarial Machine Learning ResearchCode6
Quantized Training of Gradient Boosting Decision TreesCode6
TensorIR: An Abstraction for Automatic Tensorized Program OptimizationCode6
Towards Robust Blind Face Restoration with Codebook Lookup TransformerCode6
CVNets: High Performance Library for Computer VisionCode6
CogVideo: Large-scale Pretraining for Text-to-Video Generation via TransformersCode6
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessCode6
What's Behind the Mask: Understanding Masked Graph Modeling for Graph AutoencodersCode6
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
Semi-Parametric Neural Image SynthesisCode6
Training Compute-Optimal Large Language ModelsCode6
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
Long Document Summarization with Top-down and Bottom-up InferenceCode6
Training language models to follow instructions with human feedbackCode6
Pseudo Numerical Methods for Diffusion Models on ManifoldsCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Instant Neural Graphics Primitives with a Multiresolution Hash EncodingCode6
LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer5
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length5
DeepEyesV2: Toward Agentic Multimodal Model5
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery5
Show:102550
← PrevPage 13 of 13168Next →