SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15011550 of 659983 papers

TitleStatusHype
FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient DescentCode4
Panoptic-FlashOcc: An Efficient Baseline to Marry Semantic Occupancy with Panoptic via Instance CenterCode4
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
Dimension Reduction with Locally Adjusted GraphsCode4
Mastering Diverse Domains through World ModelsCode4
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation LearnersCode4
IBD-PSC: Input-level Backdoor Detection via Parameter-oriented Scaling ConsistencyCode4
Thin-Plate Spline Motion Model for Image AnimationCode4
DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR ImagesCode4
Guaranteed Approximation Bounds for Mixed-Precision Neural OperatorsCode4
DN-DETR: Accelerate DETR Training by Introducing Query DeNoisingCode4
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsCode4
Graph of Thoughts: Solving Elaborate Problems with Large Language ModelsCode4
Neural Operators with Localized Integral and Differential KernelsCode4
Self-Supervised Pre-Training for Table Structure Recognition TransformerCode4
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-EncodersCode4
Autonomous LLM-driven research from data to human-verifiable research papersCode4
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by StepCode4
One-Shot Diffusion Mimicker for Handwritten Text GenerationCode4
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion PlanningCode4
Mean Flows for One-step Generative ModelingCode4
Tag2Text: Guiding Vision-Language Model via Image TaggingCode4
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs MissCode4
ImgEdit: A Unified Image Editing Dataset and BenchmarkCode4
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo LabellingCode4
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language ModelsCode4
Image Fusion via Vision-Language ModelCode4
Looking Backward: Streaming Video-to-Video Translation with Feature BanksCode4
Restructuring Vector Quantization with the Rotation TrickCode4
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning AgentsCode4
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory SynthesisCode4
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion GenerationCode4
TrueTeacher: Learning Factual Consistency Evaluation with Large Language ModelsCode4
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A SurveyCode4
OpenAgents: An Open Platform for Language Agents in the WildCode4
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisCode4
A Survey on Diffusion Models for Time Series and Spatio-Temporal DataCode4
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLMCode4
Factorio Learning EnvironmentCode4
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data GenerationCode4
SimPO: Simple Preference Optimization with a Reference-Free RewardCode4
FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical TrainingCode4
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video GenerationCode4
ParkingE2E: Camera-based End-to-end Parking Network, from Images to PlanningCode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different ModalitiesCode4
LESS: Selecting Influential Data for Targeted Instruction TuningCode4
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree SearchCode4
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene SegmentationCode4
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent BehaviorsCode4
Show:102550
← PrevPage 31 of 13200Next →