SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 81518200 of 661570 papers

TitleStatusHype
Federated Active Learning Under Extreme Non-IID and Global Class ImbalanceCode0
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGDCode0
Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image PrognosisCode0
Reinforcement Learning with Conditional Expectation RewardCode0
CodePercept: Code-Grounded Visual STEM Perception for MLLMsCode0
Guiding Diffusion Models with Semantically Degraded ConditionsCode0
Ranking Reasoning LLMs under Test-Time ScalingCode0
DNS-GT: A Graph-based Transformer Approach to Learn Embeddings of Domain Names from DNS QueriesCode0
Benchmarking Graph Neural Networks in Solving Hard Constraint Satisfaction ProblemsCode0
Bilevel Layer-Positioning LoRA for Real Image DehazingCode0
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without GenerationCode0
CUPID: A Plug-in Framework for Joint Aleatoric and Epistemic Uncertainty Estimation with a Single ModelCode0
Protein Counterfactuals via Diffusion-Guided Latent OptimizationCode0
ZACH-ViT: Regime-Dependent Inductive Bias in Compact Vision Transformers for Medical ImagingCode0
Shadow in the Cache: Unveiling and Mitigating Privacy Risks of KV-cache in LLM Inference1
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance1
LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination3
Efficient Audio-Visual Speech Separation with Discrete Lip Semantics and Multi-Scale Global-Local Attention2
CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI AgentsCode0
Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions0
Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge0
FRIEND: Federated Learning for Joint Optimization of multi-RIS Configuration and Eavesdropper Intelligent Detection in B5G Networks0
Operationalizing Perceptions of Agent Gender: Foundations and Guidelines0
LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval0
Decoding the decoder: Contextual sequence-to-sequence modeling for intracortical speech decoding0
Stability of AI Governance Systems: A Coupled Dynamics Model of Public Trust and Social Disruptions0
Developing Machine Learning-Based Watch-to-Warning Severe Weather Guidance from the Warn-on-Forecast System0
A Visualization for Comparative Analysis of Regression Models0
Automatic Analysis of Collaboration Through Human Conversational Data Resources: A Review0
Maximizing mutual information between user-contexts and responses improve LLM personalization with no additional data0
LLM-MRD: LLM-Guided Multi-View Reasoning Distillation for Fake News DetectionCode0
Semantic Chameleon: Corpus-Dependent Poisoning Attacks and Defenses in RAG Systems0
Quantizer-Aware Hierarchical Neural Codec Modeling for Speech Deepfake Detection0
Privacy and Safety Experiences and Concerns of U.S. Women Using Generative AI for Seeking Sexual and Reproductive Health Information0
HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free ModelingCode0
OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners0
Quantum Amplitude Estimation for Catastrophe Insurance Tail-Risk Pricing: Empirical Convergence and NISQ Noise Analysis0
OpenClaw-RL: Train Any Agent Simply by TalkingCode0
Enhancing Reconstruction Capability of Wavelet Transform Amorphous Radial Distribution Function via Machine Learning Assisted Parameter Tuning0
Geometry-Aware Semantic Reasoning for Training Free Video Anomaly Detection0
InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization0
A Computer-aided Framework for Detecting Osteosarcoma in Computed Tomography Scans0
Deep Learning for BioImaging: What Are We Learning?0
Do Large Language Models Get Caught in Hofstadter-Mobius Loops?0
A Hierarchical End-of-Turn Model with Primary Speaker Segmentation for Real-Time Conversational AI0
FusionNet: a frame interpolation network for 4D heart modelsCode0
Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning0
GPU-Accelerated Genetic Programming for Symbolic Regression with Beagle Framework0
A Causal Graph Approach to Oppositional Narrative Analysis0
Learning Bayesian and Markov Networks with an Unreliable Oracle0
Show:102550
← PrevPage 164 of 13232Next →