SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1010110125 of 177340 papers

TitleStatusHype
Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary DataCode2
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM JailbreakersCode2
Numerical Association Rule Mining: A Systematic Literature ReviewCode2
SciAssess: Benchmarking LLM Proficiency in Scientific Literature AnalysisCode2
Deep Constrained Least Squares for Blind Image Super-ResolutionCode2
Beyond Accuracy: Behavioral Testing of NLP models with CheckListCode2
Unified Contrastive Learning in Image-Text-Label SpaceCode2
Monitoring and explainability of models in productionCode2
Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language ModelsCode2
Shift-ConvNets: Small Convolutional Kernel with Large Kernel EffectsCode2
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language ModelsCode2
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation ModelCode2
M^3-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and DiscoveryCode2
JoJoGAN: One Shot Face StylizationCode2
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object DetectionCode2
Reinforcing General Reasoning without VerifiersCode2
pyRDF2Vec: A Python Implementation and Extension of RDF2VecCode2
Search Arena: Analyzing Search-Augmented LLMsCode2
R3M: A Universal Visual Representation for Robot ManipulationCode2
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured EnvironmentsCode2
UV-free Texture Generation with Denoising and Geodesic Heat DiffusionsCode2
From Tiny Machine Learning to Tiny Deep Learning: A SurveyCode2
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation GenerationCode2
Unishox: A hybrid encoder for Short Unicode StringsCode2
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion UsersCode2
Show:102550
← PrevPage 405 of 7094Next →