SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 96519700 of 661570 papers

TitleStatusHype
RM-R1: Reward Modeling as ReasoningCode2
OBELiX: A Curated Dataset of Crystal Structures and Experimentally Measured Ionic Conductivities for Lithium Solid-State ElectrolytesCode2
pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing ModelsCode2
Lemur: Harmonizing Natural Language and Code for Language AgentsCode2
FewJoint: A Few-shot Learning Benchmark for Joint Language UnderstandingCode2
ForesightNav: Learning Scene Imagination for Efficient ExplorationCode2
MARFT: Multi-Agent Reinforcement Fine-TuningCode2
DiSA: Diffusion Step Annealing in Autoregressive Image GenerationCode2
Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language ModelsCode2
TaskCraft: Automated Generation of Agentic TasksCode2
Audio synthesizer inversion in symmetric parameter spaces with approximately equivariant flow matchingCode2
LeanExplore: A search engine for Lean 4 declarationsCode2
Improving spliced alignment by modeling splice sites with deep learningCode2
any4: Learned 4-bit Numeric Representation for LLMsCode2
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL TaskCode2
Session-based Social Recommendation via Dynamic Graph Attention NetworksCode2
Bag of Tricks and A Strong Baseline for Deep Person Re-identificationCode2
Measuring Coding Challenge Competence With APPSCode2
Learning Semantic Segmentation of Large-Scale Point Clouds with Random SamplingCode2
Learning To Describe Player Form in The MLBCode2
Learning Efficient Online 3D Bin Packing on Packing Configuration TreesCode2
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding SharingCode2
NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw ImagesCode2
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language ModelsCode2
ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specificationCode2
Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for DermatologyCode2
Cedille: A large autoregressive French language modelCode2
Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless ObjectsCode2
ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical ReasoningCode2
scikit-fda: A Python Package for Functional Data AnalysisCode2
TopFormer: Token Pyramid Transformer for Mobile Semantic SegmentationCode2
Perturbation Augmentation for Fairer NLPCode2
HaGRID - HAnd Gesture Recognition Image DatasetCode2
Unsupervised High-Resolution Portrait Gaze Correction and AnimationCode2
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement LearningCode2
Visual Prompting via Image InpaintingCode2
Scalable SoftGroup for 3D Instance Segmentation on Point CloudsCode2
SinDiffusion: Learning a Diffusion Model from a Single Natural ImageCode2
Diffusion Probabilistic Models beat GANs on Medical ImagesCode2
Physics-Informed Neural Networks for Prognostics and Health Management of Lithium-Ion BatteriesCode2
Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot LearningCode2
Robust Dynamic Radiance FieldsCode2
ClimaX: A foundation model for weather and climateCode2
SceneDreamer: Unbounded 3D Scene Generation from 2D Image CollectionsCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
DIRE for Diffusion-Generated Image DetectionCode2
A Dynamic Multi-Scale Voxel Flow Network for Video PredictionCode2
Leapfrog Diffusion Model for Stochastic Trajectory PredictionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical AlignmentCode2
Show:102550
← PrevPage 194 of 13232Next →