SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 37763800 of 661570 papers

TitleStatusHype
Few-shot Acoustic Synthesis with Multimodal Flow Matching0
Improving RCT-Based Treatment Effect Estimation Under Covariate Mismatch via Calibrated Alignment0
Tinted Frames: Question Framing Blinds Vision-Language Models0
FinTradeBench: A Financial Reasoning Benchmark for LLMs0
Under One Sun: Multi-Object Generative Perception of Materials and Illumination0
Learning-to-Defer with Expert-Conditioned Advice0
iSatCR: Graph-Empowered Joint Onboard Computing and Routing for LEO Data Delivery0
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model1
Attack by Unlearning: Unlearning-Induced Adversarial Attacks on Graph Neural Networks0
Inst4DGS: Instance-Decomposed 4D Gaussian Splatting with Multi-Video Label Permutation Learning0
TopoChunker: Topology-Aware Agentic Document Chunking Framework0
Nonparametric Variational Differential Privacy via Embedding Parameter Clipping0
AdaSwitch: Balancing Exploration and Guidance in Knowledge Distillation via Adaptive Switching0
Affect Decoding in Phonated and Silent Speech Production from Surface EMG0
Social Simulacra in the Wild: AI Agent Communities on Moltbook0
SAVeS: Steering Safety Judgments in Vision-Language Models via Semantic Cues0
Hardness of High-Dimensional Linear Classification0
Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution0
From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making0
AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science0
Spectrally-Guided Diffusion Noise Schedules0
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World0
Mixture of Style Experts for Diverse Image Stylization1
When Differential Privacy Meets Wireless Federated Learning: An Improved Analysis for Privacy and Convergence0
Enhancing Multi-Corpus Training in SSL-Based Anti-Spoofing Models: Domain-Invariant Feature Extraction0
Show:102550
← PrevPage 152 of 26463Next →