SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 82768300 of 474278 papers

TitleStatusHype
MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual CorrespondenceCode0
MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly DetectionCode0
Parameter-Free Hypergraph Neural Network for Few-Shot Node ClassificationCode0
Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech ModelsCode0
FrameShield: Adversarially Robust Video Anomaly DetectionCode0
Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband RangingCode0
Online Optimization for Offline Safe Reinforcement LearningCode0
Deep Literature Survey Automation with an Iterative WorkflowCode0
Foundation of Intelligence: Review of Math Word Problems from Human Cognition PerspectiveCode0
A Benchmark for Open-Domain Numerical Fact-Checking Enhanced by Claim DecompositionCode0
Automatic Assessment of Students' Classroom Engagement with Bias Mitigated Multi-task ModelCode0
VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept SetCode0
REx86: A Local Large Language Model for Assisting in x86 Assembly Reverse EngineeringCode0
Radar-Camera Fused Multi-Object Tracking: Online Calibration and Common FeatureCode0
DiNo and RanBu: Lightweight Predictions from Shallow Random ForestsCode0
TAMI: Taming Heterogeneity in Temporal Interactions for Temporal Graph Link PredictionCode0
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation0
From Masks to Worlds: A Hitchhiker's Guide to World Models0
Amplifying Prominent Representations in Multimodal Learning via Variational Dirichlet ProcessCode0
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging0
ARC-Encoder: learning compressed text representations for large language modelsCode0
Finding the Sweet Spot: Trading Quality, Cost, and Speed During Inference-Time LLM ReflectionCode0
ALICE-LRI: A General Method for Lossless Range Image Generation for Spinning LiDAR Sensors without Calibration Metadata0
AlphaFlow: Understanding and Improving MeanFlow Models0
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost0
Show:102550
← PrevPage 332 of 18972Next →