SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 57015725 of 661570 papers

TitleStatusHype
Practicing with Language Models Cultivates Human Empathic Communication0
Directional Embedding Smoothing for Robust Vision Language Models0
A Closer Look into LLMs for Table Understanding0
Scalable Simulation-Based Model Inference with Test-Time Complexity Control0
Enhancing classification accuracy through chaos0
Tagarela - A Portuguese speech dataset from podcasts0
MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction0
Persistence Spheres: a Bi-continuous Linear Representation of Measures for Partial Optimal Transport0
RieMind: Geometry-Grounded Spatial Agent for Scene Understanding0
Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models0
Evasive Intelligence: Lessons from Malware Analysis for Evaluating AI Agents0
Agent Lifecycle Toolkit (ALTK): Reusable Middleware Components for Robust AI Agents0
Estimating Staged Event Tree Models via Hierarchical Clustering on the Simplex0
ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models0
Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models0
Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation0
Self-Distillation of Hidden Layers for Self-Supervised Representation Learning0
Mamba-3: Improved Sequence Modeling using State Space Principles0
Do Metrics for Counterfactual Explanations Align with User Perception?0
Towards Generalizable Robotic Manipulation in Dynamic EnvironmentsCode0
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering2
Diverse AI Personas Can Mitigate the Homogenization Effect in Human-AI Collaborative Ideation0
IMAIA: Interactive Maps AI Assistant for Travel Planning and Geo-Spatial Intelligence0
CountLoop: Training-Free High-Instance Image Generation via Iterative Agent Guidance0
From Image Generation to Infrastructure Design: a Multi-agent Pipeline for Street Design Generation0
Show:102550
← PrevPage 229 of 26463Next →