SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 56515675 of 661570 papers

TitleStatusHype
Visual Set Program Synthesizer0
TrajMamba: An Ego-Motion-Guided Mamba Model for Pedestrian Trajectory Prediction from an Egocentric Perspective0
Can LLMs Simulate Personas with Reversed Performance? A Systematic Investigation for Counterfactual Instruction Following in Math Reasoning Context0
FAIRGAME: a Framework for AI Agents Bias Recognition using Game Theory0
SloPal: A 60-Million-Word Slovak Parliamentary Corpus with Aligned Speech and Fine-Tuned ASR Models0
Illustrator's Depth: Monocular Layer Index Prediction for Image Decomposition0
Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation0
An Implemention of Two-Phase Image Segmentation using the Split Bregman Method0
Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition0
When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis0
The silence of the weights: a structural pruning strategy for attention-based audio signal architectures with second order metrics0
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception0
AWARE: Audio Watermarking with Adversarial Resistance to Edits0
Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation0
Off the Planckian Locus: Using 2D Chromaticity to Improve In-Camera Color0
From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields0
Reason2Decide: Rationale-Driven Multi-Task Learning0
A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR0
AGE-Net: Spectral--Spatial Fusion and Anatomical Graph Reasoning with Evidential Ordinal Regression for Knee Osteoarthritis Grading0
Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches0
Covo-Audio Technical Report0
EMPA: Evaluating Persona-Aligned Empathy as a Process0
Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing0
QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment0
Grounding Machine Creativity in Game Design Knowledge Representations: Empirical Probing of LLM-Based Executable Synthesis of Goal Playable Patterns under Structural Constraints0
Show:102550
← PrevPage 227 of 26463Next →