SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1575115800 of 474278 papers

TitleStatusHype
crossMoDA Challenge: Evolution of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation from 2021 to 20230
Structural Similarity-Inspired Unfolding for Lightweight Image Super-ResolutionCode1
Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning0
SIMSHIFT: A Benchmark for Adapting Neural Surrogates to Distribution ShiftsCode1
A Gamified Evaluation and Recruitment Platform for Low Resource Language Machine Translation Systems0
EconGym: A Scalable AI Testbed with Diverse Economic Tasks0
Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis0
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion ModelsCode1
EyeSim-VQA: A Free-Energy-Guided Eye Simulation Framework for Video Quality Assessment0
AgriPotential: A Novel Multi-Spectral and Multi-Temporal Remote Sensing Dataset for Agricultural Potentials0
Statistical Machine Learning for Astronomy -- A TextbookCode2
FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-ResolutionCode0
Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure0
VEIGAR: View-consistent Explicit Inpainting and Geometry Alignment for 3D object Removal0
code_transformed: The Influence of Large Language Models on Code0
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards0
VGR: Visual Grounded Reasoning0
LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned ModelCode0
Dual-View Disentangled Multi-Intent Learning for Enhanced Collaborative FilteringCode0
LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment0
ReVeal: Self-Evolving Code Agents via Iterative Generation-Verification0
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards0
TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models0
Learning Causality for Modern Machine Learning0
Interpretable representation learning of quantum data enabled by probabilistic variational autoencoders0
Let the Tree Decide: FABART A Non-Parametric Factor Model0
Camera-based method for the detection of lifted truck axles using convolutional neural networks0
Visual Pre-Training on Unlabeled Images using Reinforcement LearningCode1
Mind the XAI Gap: A Human-Centered LLM Framework for Democratizing Explainable AICode0
Vision-based Lifting of 2D Object Detections for Automated Driving0
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?0
Fast Bayesian Optimization of Function Networks with Partial EvaluationsCode0
Prohibited Items Segmentation via Occlusion-aware Bilayer ModelingCode0
Robust Molecular Property Prediction via Densifying Scarce Labeled DataCode0
Learn to Preserve Personality: Federated Foundation Models in Recommendations0
A Watermark for Auto-Regressive Image Generation Models0
Improving Large Language Model Safety with Contrastive Representation LearningCode0
On the Natural Robustness of Vision-Language Models Against Visual Perception Attacks in Autonomous Driving0
TreeRL: LLM Reinforcement Learning with On-Policy Tree SearchCode2
TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure AttacksCode0
Long-Short Alignment for Effective Long-Context Modeling in LLMsCode0
Optimization of bi-directional gated loop cell based on multi-head attention mechanism for SSD health state classification model0
A Hybrid Multi-Agent Prompting Approach for Simplifying Complex Sentences0
Feedforward Ordering in Neural Connectomes via Feedback Arc Minimization0
BraTS orchestrator : Democratizing and Disseminating state-of-the-art brain tumor image analysisCode2
Semantic Preprocessing for LLM-based Malware Analysis0
Abstract Sound Fusion with Unconditioned Inversion Model0
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix ApproximationCode1
SecONNds: Secure Outsourced Neural Network Inference on ImageNetCode0
Real-World Deployment of a Lane Change Prediction Architecture Based on Knowledge Graph Embeddings and Bayesian Inference0
Show:102550
← PrevPage 316 of 9486Next →