SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 69767000 of 474278 papers

TitleStatusHype
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail ModelsCode2
Controlling Language and Diffusion Models by Transporting ActivationsCode2
A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly DetectionCode2
ET-Flow: Equivariant Flow-Matching for Molecular Conformer GenerationCode2
CHORDONOMICON: A Dataset of 666,000 Songs and their Chord ProgressionsCode2
PC-Gym: Benchmark Environments For Process Control ProblemsCode2
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial ApplicationsCode2
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
Multimodality Helps Few-Shot 3D Point Cloud Semantic SegmentationCode2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer AttemptsCode2
Semantic Editing Increment Benefits Zero-Shot Composed Image RetrievalCode2
LongReward: Improving Long-context Large Language Models with AI FeedbackCode2
Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven CyberattacksCode2
ODRL: A Benchmark for Off-Dynamics Reinforcement LearningCode2
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive LearningCode2
BSD: a Bayesian framework for parametric models of neural spectraCode2
Fast Calibrated Explanations: Efficient and Uncertainty-Aware Explanations for Machine Learning ModelsCode2
RecFlow: An Industrial Full Flow Recommendation DatasetCode2
Skinned Motion Retargeting with Dense Geometric Interaction PerceptionCode2
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural NetworksCode2
Flaming-hot Initiation with Regular Execution Sampling for Large Language ModelsCode2
Domain Adaptation with a Single Vision-Language EmbeddingCode2
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse AutoencodersCode2
Show:102550
← PrevPage 280 of 18972Next →