SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 83018325 of 474278 papers

TitleStatusHype
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian SplattingCode2
SuperSVG: Superpixel-based Scalable Vector Graphics SynthesisCode2
ControlVAR: Exploring Controllable Visual Autoregressive ModelingCode2
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMsCode2
DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving ApplicationsCode2
Consistency-diversity-realism Pareto fronts of conditional image generative modelsCode2
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code GenerationCode2
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian LanguagesCode2
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language NavigationCode2
QQQ: Quality Quattuor-Bit Quantization for Large Language ModelsCode2
RaNeuS: Ray-adaptive Neural Surface ReconstructionCode2
BEACON: Benchmark for Comprehensive RNA Tasks and Language ModelsCode2
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano PerformanceCode2
JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language ModelsCode2
Are We There Yet? A Brief Survey of Music Emotion Prediction Datasets, Models and Outstanding ChallengesCode2
Yo'LLaVA: Your Personalized Language and Vision AssistantCode2
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare RecordsCode2
Fredformer: Frequency Debiased Transformer for Time Series ForecastingCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided TransformerCode2
Understanding Hallucinations in Diffusion Models through Mode InterpolationCode2
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource ScenariosCode2
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language ModelsCode2
Classic GNNs are Strong Baselines: Reassessing GNNs for Node ClassificationCode2
Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content DetectorsCode2
Show:102550
← PrevPage 333 of 18972Next →