SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1415114200 of 474278 papers

TitleStatusHype
DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and TrajectoryCode2
MOMENT: A Family of Open Time-series Foundation ModelsCode2
OpenShape: Scaling Up 3D Shape Representation Towards Open-World UnderstandingCode2
ConFIG: Towards Conflict-free Training of Physics Informed Neural NetworksCode2
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly DetectionCode2
CODA: Repurposing Continuous VAEs for Discrete TokenizationCode2
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation SparsityCode2
Optimisation & Generalisation in Networks of NeuronsCode2
Leveraging medical Twitter to build a visual–language foundation model for pathology AICode2
VideoAgent: Long-form Video Understanding with Large Language Model as AgentCode2
Instant Volumetric Head AvatarsCode2
Efficient and Effective SPARQL Autocompletion on Very Large Knowledge GraphsCode2
Proximal Policy Optimization AlgorithmsCode2
CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian FieldCode2
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action PolicyCode2
Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech SynthesisCode2
Directly Fine-Tuning Diffusion Models on Differentiable RewardsCode2
A Dataset and Explorer for 3D Signed Distance FunctionsCode2
U-Mamba: Enhancing Long-range Dependency for Biomedical Image SegmentationCode2
Semantic Photo Manipulation with a Generative Image PriorCode2
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech RecognitionCode2
Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel FieldsCode2
FixMatch: Simplifying Semi-Supervised Learning with Consistency and ConfidenceCode2
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic TasksCode2
Freeing Hybrid Distributed AI Training ConfigurationCode2
Omnipose: a high-precision, morphology-independent solution for bacterial cell segmentationCode2
Towards A Generalizable Pathology Foundation Model via Unified Knowledge DistillationCode2
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity KnowledgeCode2
Enhancing Video Super-Resolution via Implicit Resampling-based AlignmentCode2
Algorithm Evolution Using Large Language ModelCode2
Efficient Neural Network Analysis with Sum-of-InfeasibilitiesCode2
SyncTweedies: A General Generative Framework Based on Synchronized DiffusionsCode2
Visual Programming: Compositional visual reasoning without trainingCode2
CC-3DT: Panoramic 3D Object Tracking via Cross-Camera FusionCode2
Interactive Differentiable SimulationCode2
Accurate, Large Minibatch SGD: Training ImageNet in 1 HourCode2
Single-View View Synthesis in the Wild with Learned Adaptive Multiplane ImagesCode2
Delivering Document Conversion as a Cloud Service with High Throughput and ResponsivenessCode2
SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts CommentariesCode2
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image GenerationCode2
Controlling Text-to-Image Diffusion by Orthogonal FinetuningCode2
DreamColour: Controllable Video Colour Editing without TrainingCode2
PowerSimulationsDynamics.jl -- An Open Source Modeling Package for Modern Power Systems with Inverter-Based ResourcesCode2
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks YetCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMsCode2
Distillation Enhanced Generative RetrievalCode2
Any-point Trajectory Modeling for Policy LearningCode2
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional TransformerCode2
Teeth3DS+: An Extended Benchmark for Intraoral 3D Scans AnalysisCode2
Show:102550
← PrevPage 284 of 9486Next →