SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1510115150 of 474278 papers

TitleStatusHype
NeuralPDR: Neural Differential Equations as surrogate models for Photodissociation RegionsCode0
Complete Characterization for Adjustment in Summary Causal Graphs of Time Series0
Synthetic Data Augmentation for Table Detection: Re-evaluating TableNet's Performance with Automatically Generated Document Images0
On the Hardness of Bandit Learning0
Structured and Informed Probabilistic Modeling with the Thermodynamic Kolmogorov-Arnold ModelCode0
Single-Example Learning in a Mixture of GPDMs with Latent Geometries0
RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?Code0
Integrating Radiomics with Deep Learning Enhances Multiple Sclerosis Lesion Delineation0
Knowledge Adaptation as Posterior Correction0
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset SelectionCode0
GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection VectorsCode0
Adaptive Data Augmentation for Thompson Sampling0
Don't throw the baby out with the bathwater: How and why deep learning for ARC0
SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks0
Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images0
AgentSynth: Scalable Task Generation for Generalist Computer-Use AgentsCode1
Towards Desiderata-Driven Design of Visual Counterfactual Explainers0
Collaborative Editable Model0
Latent Anomaly Detection: Masked VQ-GAN for Unsupervised Segmentation in Medical CBCT0
Dense360: Dense Understanding from Omnidirectional Panoramas0
FocalClick-XL: Towards Unified and High-quality Interactive Segmentation0
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM0
S^4C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models0
Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot0
Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models0
Situational-Constrained Sequential Resources Allocation via Reinforcement Learning0
Bayesian Hybrid Machine Learning of Gallstone Risk0
DCRM: A Heuristic to Measure Response Pair Quality in Preference OptimizationCode0
Adjustment for Confounding using Pre-Trained RepresentationsCode0
Leveraging Predictive Equivalence in Decision TreesCode0
sHGCN: Simplified hyperbolic graph convolutional neural networksCode0
VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and ReasoningCode0
Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM DatasetCode0
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue AgentsCode0
MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task AdaptationCode0
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad TeamCode1
When Does Meaning Backfire? Investigating the Role of AMRs in NLI0
Mxplainer: Explain and Learn Insights by Imitating Mahjong AgentsCode0
Essential-Web v1.0: 24T tokens of organized web dataCode2
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection FrameworkCode4
A Model-Mediated Stacked Ensemble Approach for Depression Prediction Among Professionals0
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference OptimizationCode1
CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data GenerationCode0
ASAP-FE: Energy-Efficient Feature Extraction Enabling Multi-Channel Keyword Spotting on Edge Processors0
Refining music sample identification with a self-supervised graph neural networkCode1
Egocentric Human-Object Interaction Detection: A New Benchmark and Method0
Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition0
KDMOS:Knowledge Distillation for Motion SegmentationCode0
Causally Steered Diffusion for Automated Video Counterfactual GenerationCode0
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models0
Show:102550
← PrevPage 303 of 9486Next →