SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 56515700 of 661570 papers

TitleStatusHype
Visual Set Program Synthesizer0
TrajMamba: An Ego-Motion-Guided Mamba Model for Pedestrian Trajectory Prediction from an Egocentric Perspective0
Can LLMs Simulate Personas with Reversed Performance? A Systematic Investigation for Counterfactual Instruction Following in Math Reasoning Context0
FAIRGAME: a Framework for AI Agents Bias Recognition using Game Theory0
SloPal: A 60-Million-Word Slovak Parliamentary Corpus with Aligned Speech and Fine-Tuned ASR Models0
Illustrator's Depth: Monocular Layer Index Prediction for Image Decomposition0
Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation0
An Implemention of Two-Phase Image Segmentation using the Split Bregman Method0
Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition0
When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis0
The silence of the weights: a structural pruning strategy for attention-based audio signal architectures with second order metrics0
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception0
AWARE: Audio Watermarking with Adversarial Resistance to Edits0
Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation0
Off the Planckian Locus: Using 2D Chromaticity to Improve In-Camera Color0
From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields0
Reason2Decide: Rationale-Driven Multi-Task Learning0
A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR0
AGE-Net: Spectral--Spatial Fusion and Anatomical Graph Reasoning with Evidential Ordinal Regression for Knee Osteoarthritis Grading0
Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches0
Covo-Audio Technical Report0
EMPA: Evaluating Persona-Aligned Empathy as a Process0
Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing0
QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment0
Grounding Machine Creativity in Game Design Knowledge Representations: Empirical Probing of LLM-Based Executable Synthesis of Goal Playable Patterns under Structural Constraints0
Differentiable Thermodynamic Phase-Equilibria for Machine Learning0
Slack More, Predict Better: Proximal Relaxation for Probabilistic Latent Variable Model-based Soft Sensors0
PhysMoDPO: Physically-Plausible Humanoid Motion with Preference Optimization0
Applications of Intuitionistic Temporal Logic to Temporal Answer Set Programming0
Design Space of Self--Consistent Electrostatic Machine Learning Interatomic Potentials0
Beyond Creed: A Non-Identity Safety Condition A Strong Empirical Alternative to Identity Framing in Low-Data LoRA Fine-Tuning0
CAMD: Coverage-Aware Multimodal Decoding for Efficient Reasoning of Multimodal Large Language Models0
AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas0
High-Fidelity 3D Facial Avatar Synthesis with Controllable Fine-Grained Expressions0
Mind-of-Director: Multi-modal Agent-Driven Film Previsualization via Collaborative Decision-Making0
Preconditioned One-Step Generative Modeling for Bayesian Inverse Problems in Function Spaces0
Universe Routing: Why Self-Evolving Agents Need Epistemic Control0
Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling0
From Horizontal to Rotated: Cross-View Object Geo-Localization with Orientation Awareness0
LLMind: Bio-inspired Training-free Adaptive Visual Representations for Vision-Language Models0
PASTE: Physics-Aware Scattering Topology Embedding Framework for SAR Object Detection0
Photonic Quantum-Enhanced Knowledge Distillation0
Algorithms for Deciding the Safety of States in Fully Observable Non-deterministic Problems: Technical Report0
ILV: Iterative Latent Volumes for Fast and Accurate Sparse-View CT Reconstruction0
Pansharpening for Thin-Cloud Contaminated Remote Sensing Images: A Unified Framework and Benchmark Dataset0
A convolutional autoencoder and neural ODE framework for surrogate modeling of transient counterflow flames0
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents0
PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units0
Modeling Matches as Language: A Generative Transformer Approach for Counterfactual Player Valuation in Football0
SCAN: Sparse Circuit Anchor Interpretable Neuron for Lifelong Knowledge Editing0
Show:102550
← PrevPage 114 of 13232Next →