SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 45764600 of 177340 papers

TitleStatusHype
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code GenerationCode3
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing ReasoningCode3
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel MethodsCode3
AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning MatrixCode3
Language-based Audio Moment RetrievalCode3
Unified Data Management and Comprehensive Performance Evaluation for Urban Spatial-Temporal Prediction [Experiment, Analysis & Benchmark]Code3
A Chinese Dataset for Evaluating the Safeguards in Large Language ModelsCode3
Multi-Modality Representation Learning for Antibody-Antigen Interactions PredictionCode3
Improving Alignment and Robustness with Circuit BreakersCode3
The OpenLAM ChallengesCode3
TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D EnvironmentsCode3
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and PlanningCode3
InstructIE: A Bilingual Instruction-based Information Extraction DatasetCode3
Reservoir History Matching of the Norne field with generative exotic priors and a coupled Mixture of Experts -- Physics Informed Neural Operator Forward ModelCode3
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved OptimallyCode3
Vision-based 3D occupancy prediction in autonomous driving: a review and outlookCode3
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language ModelsCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360degCode3
Drone Data Analytics for Measuring Traffic Metrics at Intersections in High-Density AreasCode3
A Survey on Video Action Recognition in Sports: Datasets, Methods and ApplicationsCode3
UrbanGPT: Spatio-Temporal Large Language ModelsCode3
Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language ModelsCode3
ViTPose: Simple Vision Transformer Baselines for Human Pose EstimationCode3
Show:102550
← PrevPage 184 of 7094Next →