SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 28512900 of 659983 papers

TitleStatusHype
Faithful Logical Reasoning via Symbolic Chain-of-ThoughtCode3
Multimodal Table UnderstandingCode3
KV-Edit: Training-Free Image Editing for Precise Background PreservationCode3
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous DrivingCode3
VideoGen-Eval: Agent-based System for Video Generation EvaluationCode3
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech SynthesisCode3
JAFAR: Jack up Any Feature at Any ResolutionCode3
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video GenerationCode3
GENERator: A Long-Context Generative Genomic Foundation ModelCode3
EVEv2: Improved Baselines for Encoder-Free Vision-Language ModelsCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
Half-Inverse Gradients for Physical Deep LearningCode3
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D ReconstructionCode3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsCode3
DisCo: Disentangled Control for Realistic Human Dance GenerationCode3
^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network PotentialsCode3
DARWIN 1.5: Large Language Models as Materials Science Adapted LearnersCode3
NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal SimulationCode3
A Comprehensive Survey on Segment Anything Model for Vision and BeyondCode3
HLOB -- Information Persistence and Structure in Limit Order BooksCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and RoadmapCode3
Mini-Splatting: Representing Scenes with a Constrained Number of GaussiansCode3
Rectified Diffusion: Straightness Is Not Your Need in Rectified FlowCode3
Flash-VStream: Memory-Based Real-Time Understanding for Long Video StreamsCode3
Opportunities and Risks of LLMs for Scalable Deliberation with PolisCode3
RePlay: a Recommendation Framework for Experimentation and Production UseCode3
Deep Reinforcement LearningCode3
SAM3D: Segment Anything in 3D ScenesCode3
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse ViewsCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
Learning to Reason without External RewardsCode3
XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAMCode3
UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion SegmentationCode3
Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement LearningCode3
Pipeline Gradient-based Model Training on Analog In-memory AcceleratorsCode3
General Geospatial Inference with a Population Dynamics Foundation ModelCode3
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth FusionCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
PuzzleAvatar: Assembling 3D Avatars from Personal AlbumsCode3
GES: Generalized Exponential Splatting for Efficient Radiance Field RenderingCode3
Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic SegmentationCode3
Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance FieldsCode3
Self-Refine: Iterative Refinement with Self-FeedbackCode3
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation ModelCode3
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent SystemsCode3
LEAP-VO: Long-term Effective Any Point Tracking for Visual OdometryCode3
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive SurveyCode3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
Score-Guided Diffusion for 3D Human RecoveryCode3
Show:102550
← PrevPage 58 of 13200Next →