SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 94269450 of 474278 papers

TitleStatusHype
Model Uncertainty in Evolutionary Optimization and Bayesian Optimization: A Comparative AnalysisCode2
SoftPatch: Unsupervised Anomaly Detection with Noisy DataCode2
MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly DetectionCode2
AutoRE: Document-Level Relation Extraction with Large Language ModelsCode2
SyncTweedies: A General Generative Framework Based on Synchronized DiffusionsCode2
Volumetric Environment Representation for Vision-Language NavigationCode2
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera NetworkCode2
Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-LocalizationCode2
SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural NetworksCode2
Protein Conformation Generation via Force-Guided SE(3) Diffusion ModelsCode2
Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality AssessmentCode2
AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent BehaviorCode2
Evaluating Frontier Models for Dangerous CapabilitiesCode2
RAR: Retrieving And Ranking Augmented MLLMs for Visual RecognitionCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
Certified Human Trajectory PredictionCode2
Nellie: Automated organelle segmentation, tracking, and hierarchical feature extraction in 2D/3D live-cell microscopyCode2
eRST: A Signaled Graph Theory of Discourse Relations and OrganizationCode2
vid-TLDR: Training Free Token merging for Light-weight Video TransformerCode2
Modeling the Label Distributions for Weakly-Supervised Semantic SegmentationCode2
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration TransducerCode2
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object TrackingCode2
DanceCamera3D: 3D Camera Movement Synthesis with Music and DanceCode2
Scale Decoupled DistillationCode2
SocialBench: Sociality Evaluation of Role-Playing Conversational AgentsCode2
Show:102550
← PrevPage 378 of 18972Next →