SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 60016025 of 177340 papers

TitleStatusHype
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head SynthesisCode2
VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned PriorsCode2
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language ModelsCode2
Exploring CLIP for Assessing the Look and Feel of ImagesCode2
Visual Perception by Large Language Model's WeightsCode2
MCP-Solver: Integrating Language Models with Constraint Programming SystemsCode2
SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point CloudCode2
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose EstimationCode2
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual EditingCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
CMB: A Comprehensive Medical Benchmark in ChineseCode2
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D PolicyCode2
StructChart: On the Schema, Metric, and Augmentation for Visual Chart UnderstandingCode2
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision MakingCode2
The P^3 dataset: Pixels, Points and Polygons for Multimodal Building VectorizationCode2
Protein Representation Learning by Geometric Structure PretrainingCode2
SegNeXt: Rethinking Convolutional Attention Design for Semantic SegmentationCode2
JudgeLM: Fine-tuned Large Language Models are Scalable JudgesCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
Internal Consistency and Self-Feedback in Large Language Models: A SurveyCode2
Hybrid-SORT: Weak Cues Matter for Online Multi-Object TrackingCode2
PartIR: Composing SPMD Partitioning Strategies for Machine LearningCode2
SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine CollaborationCode2
FastVID: Dynamic Density Pruning for Fast Video Large Language ModelsCode2
Embedding Earth: Self-supervised contrastive pre-training for dense land cover classificationCode2
Show:102550
← PrevPage 241 of 7094Next →