SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 14511500 of 659983 papers

TitleStatusHype
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
VideoChat-Flash: Hierarchical Compression for Long-Context Video ModelingCode4
Training Software Engineering Agents and Verifiers with SWE-GymCode4
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference OptimizationCode4
MINIMA: Modality Invariant Image MatchingCode4
The Thousand Brains Project: A New Paradigm for Sensorimotor IntelligenceCode4
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-EncodersCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
Dimension Reduction with Locally Adjusted GraphsCode4
Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from DemonstrationCode4
Autoregressive Video Generation without Vector QuantizationCode4
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall SpacesCode4
SocialED: A Python Library for Social Event DetectionCode4
Neural general circulation models optimized to predict satellite-based precipitation observationsCode4
SepLLM: Accelerate Large Language Models by Compressing One Segment into One SeparatorCode4
DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid SpacesCode4
Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic SpaceCode4
Hidden Biases of End-to-End Driving DatasetsCode4
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned EncodersCode4
MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental LearningCode4
Video Seal: Open and Efficient Video WatermarkingCode4
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow ModelsCode4
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse ViewpointsCode4
SAT: Dynamic Spatial Aptitude Training for Multimodal Language ModelsCode4
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 SecondsCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at ScaleCode4
Fully Open Source Moxin-7B Technical ReportCode4
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation MethodsCode4
UniScene: Unified Occupancy-centric Driving Scene GenerationCode4
Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field RenderingCode4
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy PredictionCode4
Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian NoiseCode4
Best-of-N JailbreakingCode4
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA ApproachCode4
Weighted-Reward Preference Optimization for Implicit Model FusionCode4
Navigation World ModelsCode4
Taming Scalable Visual Tokenizer for Autoregressive Image GenerationCode4
HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture RecognitionCode4
FullStack Bench: Evaluating LLMs as Full Stack CodersCode4
FLARE: Toward Universal Dataset Purification against Backdoor AttacksCode4
Multimodal Whole Slide Foundation Model for PathologyCode4
AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using SmartphonesCode4
sbi reloaded: a toolkit for simulation-based inference workflowsCode4
Identity-Preserving Text-to-Video Generation by Frequency DecompositionCode4
One Diffusion to Generate Them AllCode4
Parameter Efficient Instruction Tuning: An Empirical StudyCode4
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judgeCode4
Show:102550
← PrevPage 30 of 13200Next →