SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 20012025 of 661570 papers

TitleStatusHype
Large Models for Time Series and Spatio-Temporal Data: A Survey and OutlookCode4
4D Gaussian Splatting for Real-Time Dynamic Scene RenderingCode4
An Empirical Study of Instruction-tuning Large Language Models in ChineseCode4
3D TransUNet: Advancing Medical Image Segmentation through Vision TransformersCode4
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?Code4
OpenWebMath: An Open Dataset of High-Quality Mathematical Web TextCode4
Language Model Beats Diffusion -- Tokenizer is Key to Visual GenerationCode4
Retrieval-Generation Synergy Augmented Large Language ModelsCode4
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step InferenceCode4
TimeGPT-1Code4
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent DiffusionCode4
Time-LLM: Time Series Forecasting by Reprogramming Large Language ModelsCode4
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic AlignmentCode4
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisCode4
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content CreationCode4
Efficient Post-training Quantization with FP8 FormatsCode4
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement LearningCode4
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer ModelsCode4
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
Baichuan 2: Open Large-scale Language ModelsCode4
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted TreesCode4
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis TestingCode4
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMsCode4
Advancing Parsimonious Deep Learning Weather Prediction using the HEALPix MeshCode4
Show:102550
← PrevPage 81 of 26463Next →