SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 96019650 of 661570 papers

TitleStatusHype
Distributionally Robust Self Paced Curriculum Reinforcement Learning0
Beyond Data Splitting: Full-Data Conformal Prediction by Differential Privacy0
Brain-WM: Brain Glioblastoma World ModelCode0
Accelerating Diffusion Models for Generative AI Applications with Silicon Photonics0
Probabilistic Inference and Learning with Stein's Method0
PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point CloudsCode0
Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network ExtractionCode0
Extracting Recurring Vulnerabilities from Black-Box LLM-Generated SoftwareCode0
Dial: A Knowledge-Grounded Dialect-Specific NL2SQL SystemCode0
FedEU: Evidential Uncertainty-Driven Federated Fine-Tuning of Vision Foundation Models for Remote Sensing Image SegmentationCode0
EVLF: Early Vision-Language Fusion for Generative Dataset DistillationCode0
PureCC: Pure Learning for Text-to-Image Concept CustomizationCode0
Revisiting the LiRA Membership Inference Attack Under Realistic AssumptionsCode0
KCoEvo: A Knowledge Graph Augmented Framework for Evolutionary Code GenerationCode0
Duala: Dual-Level Alignment of Subjects and Stimuli for Cross-Subject fMRI DecodingCode0
3DMedAgent: Unified Perception-to-Understanding for 3D Medical AnalysisCode0
PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous SpaceCode0
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM AgentsCode0
MUGSQA: Novel Multi-Uncertainty-Based Gaussian Splatting Quality Assessment Method, Dataset, and BenchmarksCode0
KVSlimmer: Theoretical Insights and Practical Optimizations for Asymmetric KV MergingCode0
Backdoor4Good: Benchmarking Beneficial Uses of Backdoors in LLMsCode0
SiamGM: Siamese Geometry-Aware and Motion-Guided Network for Real-Time Satellite Video Object TrackingCode0
KohakuRAG: A simple RAG framework with hierarchical document indexingCode0
NAAMSE: Framework for Evolutionary Security Evaluation of AgentsCode0
Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical ImagingCode0
TAPFormer: Robust Arbitrary Point Tracking via Transient Asynchronous Fusion of Frames and Events1
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence3
HiconAgent: History Context-aware Policy Optimization for GUI Agents1
GLASS: Graph and Vision-Language Assisted Semantic Shape Correspondence0
Generalization in Online Reinforcement Learning for Mobile AgentsCode0
Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Language0
Did You Forget What I Asked? Prospective Memory Failures in Large Language Models0
Large Language Models Unpack Complex Political Opinions through Target-Stance Extraction0
Fusing Driver Perceived and Physical Risk for Safety Critical Scenario Screening in Autonomous Driving0
Discovering the Hidden Role of Gini Index In Prompt-based Classification0
Beyond Reward Suppression: Reshaping Steganographic Communication Protocols in MARL via Dynamic Representational Circuit Breaking0
Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context0
Complementarity-Supervised Spectral-Band Routing for Multimodal Emotion Recognition0
MS2MetGAN: Latent-space adversarial training for metabolite-spectrum matching in MS/MS database search0
Post Training Quantization for Efficient Dataset Condensation0
AI-Driven Predictive Maintenance with Real-Time Contextual Data Fusion for Connected Vehicles: A Multi-Dataset Evaluation0
DDS-UDA: Dual-Domain Synergy for Unsupervised Domain Adaptation in Joint Segmentation of Optic Disc and Optic Cup0
DyACE: Dynamic Algorithm Co-evolution for Online Automated Heuristic Design with Large Language Model0
PolyGLU: State-Conditional Activation Routing in Transformer Feed-Forward Networks0
AutoTool: Automatic Scaling of Tool-Use Capabilities in RL via Decoupled Entropy Constraints0
MURE: Hierarchical Multi-Resolution Encoding via Vision-Language Models for Visual Document Retrieval0
Thermal Robustness of Retrieval in Dense Associative Memories: LSE vs LSR Kernels0
Prompt Complexity Dilutes Structured Reasoning: A Follow-Up Study on the Car Wash Problem0
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot LearningCode0
ConfHit: Conformal Generative Design with Oracle Free Guarantees0
Show:102550
← PrevPage 193 of 13232Next →