SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30013025 of 661570 papers

TitleStatusHype
Relation DETR: Exploring Explicit Position Relation Prior for Object DetectionCode3
Benchmarking Multimodal AutoML for Tabular Data with Text FieldsCode3
NeROIC: Neural Rendering of Objects from Online Image CollectionsCode3
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons LearnedCode3
Scientific Machine Learning through Physics-Informed Neural Networks: Where we are and What's nextCode3
Point-NeRF: Point-based Neural Radiance FieldsCode3
SymForce: Symbolic Computation and Code Generation for RoboticsCode3
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D AvatarsCode3
OmniThink: Expanding Knowledge Boundaries in Machine Writing through ThinkingCode3
Robust deep learning based protein sequence design using ProteinMPNNCode3
ViperGPT: Visual Inference via Python Execution for ReasoningCode3
Cautious Optimizers: Improving Training with One Line of CodeCode3
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge NetworksCode3
Greykite: Deploying Flexible Forecasting at Scale at LinkedInCode3
PSALM: Pixelwise SegmentAtion with Large Multi-Modal ModelCode3
PyTorch Image Quality: Metrics for Image Quality AssessmentCode3
DPA-1: Pretraining of Attention-based Deep Potential Model for Molecular SimulationCode3
Towards Accurate Reconstruction of 3D Scene Shape from A Single Monocular ImageCode3
MapTR: Structured Modeling and Learning for Online Vectorized HD Map ConstructionCode3
MetaDE: Evolving Differential Evolution by Differential EvolutionCode3
DiffDock: Diffusion Steps, Twists, and Turns for Molecular DockingCode3
AutoAct: Automatic Agent Learning from Scratch for QA via Self-PlanningCode3
Probing the 3D Awareness of Visual Foundation ModelsCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
SongEval: A Benchmark Dataset for Song Aesthetics EvaluationCode3
Show:102550
← PrevPage 121 of 26463Next →