SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 74267450 of 177340 papers

TitleStatusHype
Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional EncodingCode2
Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and PerspectivesCode2
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language ModelsCode2
Infinite Recommendation Networks: A Data-Centric ApproachCode2
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human FeedbackCode2
Efficient LLM Inference on CPUsCode2
Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsCode2
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot LearningCode2
Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and SupervisionCode2
Deep Learning Methods for Partial Differential Equations and Related Parameter Identification ProblemsCode2
Samba: Semantic Segmentation of Remotely Sensed Images with State Space ModelCode2
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent LearningCode2
Critique-out-Loud Reward ModelsCode2
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet DiffusionCode2
LLM-PBE: Assessing Data Privacy in Large Language ModelsCode2
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve ThemCode2
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action ChainCode2
LeviTor: 3D Trajectory Oriented Image-to-Video SynthesisCode2
MAT: Mask-Aware Transformer for Large Hole Image InpaintingCode2
MANIQA: Multi-dimension Attention Network for No-Reference Image Quality AssessmentCode2
A Closer Look at Learned Optimization: Stability, Robustness, and Inductive BiasesCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep LearningCode2
RecDiff: Diffusion Model for Social RecommendationCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
Show:102550
← PrevPage 298 of 7094Next →