SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 16761700 of 659983 papers

TitleStatusHype
Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning0
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech0
Current LLMs still cannot 'talk much' about grammar modules: Evidence from syntax0
Conditioning Protein Generation via Hopfield Pattern Multiplicity0
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning0
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models0
Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives0
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management0
Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents0
Reasoning Gets Harder for LLMs Inside A Dialogue0
Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning0
Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning0
Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification0
Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case0
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD0
Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models0
Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models0
The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning0
EgoForge: Goal-Directed Egocentric World Simulator0
Learning Dynamic Belief Graphs for Theory-of-mind Reasoning0
TinyML Enhances CubeSat Mission Capabilities0
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis0
AI Agents Can Already Autonomously Perform Experimental High Energy Physics0
Adaptive Greedy Frame Selection for Long Video Understanding0
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking0
Show:102550
← PrevPage 68 of 26400Next →