SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 37513800 of 177340 papers

TitleStatusHype
Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2Code3
Large-Scale 3D Medical Image Pre-training with Geometric Context PriorsCode3
ERNIE 2.0: A Continual Pre-training Framework for Language UnderstandingCode3
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly DetectionCode3
SINERGYM -- A virtual testbed for building energy optimization with Reinforcement LearningCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
Video ReCap: Recursive Captioning of Hour-Long VideosCode3
Magnitude-aware Probabilistic Speaker EmbeddingsCode3
RobustSAM: Segment Anything Robustly on Degraded ImagesCode3
Centaur: a foundation model of human cognitionCode3
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network FabricsCode3
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image GenerationCode3
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion PriorsCode3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning TasksCode3
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-DistillationCode3
Model-based Asynchronous Hyperparameter and Neural Architecture SearchCode3
ContextCite: Attributing Model Generation to ContextCode3
Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials ScienceCode3
Language Model InversionCode3
Evalverse: Unified and Accessible Library for Large Language Model EvaluationCode3
DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and MappingCode3
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF FusionCode3
Improved motif-scaffolding with SE(3) flow matchingCode3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEACode3
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model CompressionCode3
OmniPred: Language Models as Universal RegressorsCode3
Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-IdentificationCode3
ReEvo: Large Language Models as Hyper-Heuristics with Reflective EvolutionCode3
ADBench: Anomaly Detection BenchmarkCode3
94% on CIFAR-10 in 3.29 Seconds on a Single GPUCode3
MapTRv2: An End-to-End Framework for Online Vectorized HD Map ConstructionCode3
On the Trajectory Regularity of ODE-based Diffusion SamplingCode3
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series ForecastingCode3
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language ModelsCode3
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisCode3
Single-Image Shadow Removal Using Deep Learning: A Comprehensive SurveyCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian SplattingCode3
GraphStorm: all-in-one graph machine learning framework for industry applicationsCode3
TokenPacker: Efficient Visual Projector for Multimodal LLMCode3
WeatherMesh-3: Fast and accurate operational global weather forecastingCode3
NdLinear Is All You Need for Representation LearningCode3
Bake off redux: a review and experimental evaluation of recent time series classification algorithmsCode3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic RepresentationCode3
CameraHMR: Aligning People with PerspectiveCode3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
Show:102550
← PrevPage 76 of 3547Next →