SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 63016350 of 661570 papers

TitleStatusHype
GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian SplattingCode2
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive DiffusionCode2
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language ModelsCode2
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point CloudsCode2
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language ModelsCode2
mdCATH: A Large-Scale MD Dataset for Data-Driven Computational BiophysicsCode2
BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task TuningCode2
Melting Pot 2.0Code2
A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online AdaptationCode2
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image InterpretationCode2
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short DramaCode2
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual LocalizationCode2
TraDiffusion: Trajectory-Based Training-Free Image GenerationCode2
Mephisto: A Framework for Portable, Reproducible, and Iterative CrowdsourcingCode2
Attention-based Deep Multiple Instance LearningCode2
Interacting Attention Graph for Single Image Two-Hand ReconstructionCode2
Frequency-domain MLPs are More Effective Learners in Time Series ForecastingCode2
REALY: Rethinking the Evaluation of 3D Face ReconstructionCode2
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-TrainingCode2
Does Image Anonymization Impact Computer Vision Training?Code2
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving ScenarioCode2
In-Context Imitation Learning via Next-Token PredictionCode2
A Hybrid Transformer-Mamba Network for Single Image DerainingCode2
Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning IncentivizationCode2
LViT: Language meets Vision Transformer in Medical Image SegmentationCode2
gRNAde: Geometric Deep Learning for 3D RNA inverse designCode2
Uni3D: Exploring Unified 3D Representation at ScaleCode2
OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution FittingCode2
VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentationCode2
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for ReasoningCode2
Cross-Prediction-Powered InferenceCode2
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task AutomationCode2
FedCLIP: Fast Generalization and Personalization for CLIP in Federated LearningCode2
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head SynthesisCode2
VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned PriorsCode2
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language ModelsCode2
Exploring CLIP for Assessing the Look and Feel of ImagesCode2
Visual Perception by Large Language Model's WeightsCode2
MCP-Solver: Integrating Language Models with Constraint Programming SystemsCode2
SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point CloudCode2
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose EstimationCode2
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual EditingCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
CMB: A Comprehensive Medical Benchmark in ChineseCode2
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D PolicyCode2
StructChart: On the Schema, Metric, and Augmentation for Visual Chart UnderstandingCode2
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision MakingCode2
The P^3 dataset: Pixels, Points and Polygons for Multimodal Building VectorizationCode2
Protein Representation Learning by Geometric Structure PretrainingCode2
Show:102550
← PrevPage 127 of 13232Next →