SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 95769600 of 177340 papers

TitleStatusHype
Jailbreak Vision Language Models via Bi-Modal Adversarial PromptCode2
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance AssessorCode2
Exploring Orthogonality in Open World Object DetectionCode2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary ProjectsCode2
Equinox: neural networks in JAX via callable PyTrees and filtered transformationsCode2
Deep Architectures for Content Moderation and Movie Content RatingCode2
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal ModelsCode2
Denoising as Adaptation: Noise-Space Domain Adaptation for Image RestorationCode2
Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields ReconstructionCode2
Investigating Tradeoffs in Real-World Video Super-ResolutionCode2
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood SearchCode2
Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AICode2
POCO: Point Convolution for Surface ReconstructionCode2
MoCapAct: A Multi-Task Dataset for Simulated Humanoid ControlCode2
Speech Denoising in the Waveform Domain with Self-AttentionCode2
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP FrameworkCode2
Differentiable and Learnable Robot ModelsCode2
OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for RoboticsCode2
Recovering 3D Human Mesh from Monocular Images: A SurveyCode2
SoftGroup for 3D Instance Segmentation on Point CloudsCode2
Freeform Body Motion Generation from SpeechCode2
Class-incremental Learning for Time Series: Benchmark and EvaluationCode2
MotionCLIP: Exposing Human Motion Generation to CLIP SpaceCode2
Real-time Object Detection for Streaming PerceptionCode2
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?Code2
Show:102550
← PrevPage 384 of 7094Next →