SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10761100 of 659983 papers

TitleStatusHype
Astraios: Parameter-Efficient Instruction Tuning Code Large Language ModelsCode5
Point Transformer V3: Simpler Faster StrongerCode5
GenCast: Diffusion-based ensemble forecasting for medium-range weatherCode5
DUSt3R: Geometric 3D Vision Made EasyCode5
AppAgent: Multimodal Agents as Smartphone UsersCode5
StarVector: Generating Scalable Vector Graphics Code from Images and TextCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
MobileSAMv2: Faster Segment Anything to EverythingCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
Weakly Supervised Detection of Hallucinations in LLM ActivationsCode5
TaskWeaver: A Code-First Agent FrameworkCode5
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in MedicineCode5
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction FollowingCode5
Human Gaussian Splatting: Real-time Rendering of Animatable AvatarsCode5
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGICode5
Structure-Aware Sparse-View X-ray 3D ReconstructionCode5
Instruction-Following Evaluation for Large Language ModelsCode5
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language ModelsCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
VideoCrafter1: Open Diffusion Models for High-Quality Video GenerationCode5
Zephyr: Direct Distillation of LM AlignmentCode5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningCode5
Wonder3D: Single Image to 3D using Cross-Domain DiffusionCode5
NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable RailsCode5
Ferret: Refer and Ground Anything Anywhere at Any GranularityCode5
Show:102550
← PrevPage 44 of 26400Next →