SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36013650 of 177340 papers

TitleStatusHype
Olympus: A Universal Task Router for Computer Vision TasksCode3
A guide to convolution arithmetic for deep learningCode3
ARC Prize 2024: Technical ReportCode3
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity RepresentationCode3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR SynthesisCode3
Defeating Prompt Injections by DesignCode3
SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masksCode3
MeshXL: Neural Coordinate Field for Generative 3D Foundation ModelsCode3
Faithful Logical Reasoning via Symbolic Chain-of-ThoughtCode3
Multimodal Table UnderstandingCode3
KV-Edit: Training-Free Image Editing for Precise Background PreservationCode3
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous DrivingCode3
VideoGen-Eval: Agent-based System for Video Generation EvaluationCode3
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech SynthesisCode3
JAFAR: Jack up Any Feature at Any ResolutionCode3
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video GenerationCode3
GENERator: A Long-Context Generative Genomic Foundation ModelCode3
EVEv2: Improved Baselines for Encoder-Free Vision-Language ModelsCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
Half-Inverse Gradients for Physical Deep LearningCode3
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D ReconstructionCode3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsCode3
DisCo: Disentangled Control for Realistic Human Dance GenerationCode3
^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network PotentialsCode3
DARWIN 1.5: Large Language Models as Materials Science Adapted LearnersCode3
NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal SimulationCode3
A Comprehensive Survey on Segment Anything Model for Vision and BeyondCode3
HLOB -- Information Persistence and Structure in Limit Order BooksCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and RoadmapCode3
Mini-Splatting: Representing Scenes with a Constrained Number of GaussiansCode3
Rectified Diffusion: Straightness Is Not Your Need in Rectified FlowCode3
Flash-VStream: Memory-Based Real-Time Understanding for Long Video StreamsCode3
Opportunities and Risks of LLMs for Scalable Deliberation with PolisCode3
RePlay: a Recommendation Framework for Experimentation and Production UseCode3
Deep Reinforcement LearningCode3
SAM3D: Segment Anything in 3D ScenesCode3
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse ViewsCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
Learning to Reason without External RewardsCode3
XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAMCode3
UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion SegmentationCode3
Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement LearningCode3
Pipeline Gradient-based Model Training on Analog In-memory AcceleratorsCode3
General Geospatial Inference with a Population Dynamics Foundation ModelCode3
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth FusionCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
PuzzleAvatar: Assembling 3D Avatars from Personal AlbumsCode3
GES: Generalized Exponential Splatting for Efficient Radiance Field RenderingCode3
Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic SegmentationCode3
Show:102550
← PrevPage 73 of 3547Next →