SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 151200 of 658356 papers

TitleStatusHype
Timestep-Aware Block Masking for Efficient Diffusion Model Inference0
Hybrid topic modelling for computational close reading: Mapping narrative themes in Pushkin's Evgenij Onegin0
TAPAS: Efficient Two-Server Asymmetric Private Aggregation Beyond Prio(+)0
Structural Controllability of Large-Scale Hypergraphs0
Cov2Pose: Leveraging Spatial Covariance for Direct Manifold-aware 6-DoF Object Pose Estimation0
Channel Prediction-Based Physical Layer Authentication under Consecutive Spoofing Attacks0
2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction0
Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features0
Graph2TS: Structure-Controlled Time Series Generation via Quantile-Graph VAEs0
Model-Driven Learning-Based Physical Layer Authentication for Mobile Wi-Fi Devices0
Promoting Critical Thinking With Domain-Specific Generative AI Provocations0
X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving0
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States0
Evaluating Test-Time Adaptation For Facial Expression Recognition Under Natural Cross-Dataset Distribution Shifts0
When Contextual Inference Fails: Cancelability in Interactive Instruction Following0
An Agentic Approach to Generating XAI-Narratives0
ReViSQL: Achieving Human-Level Text-to-SQL0
Physics-Informed Long-Range Coulomb Correction for Machine-learning Hamiltonians0
AgenticRS-EnsNAS: Ensemble-Decoupled Self-Evolving Architecture Search0
Detached Skip-Links and R-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR0
Orchestrating Human-AI Software Delivery: A Retrospective Longitudinal Field Study of Three Software Modernization Programs0
CoverageBench: Evaluating Information Coverage across Tasks and Domains0
Continual Learning as Shared-Manifold Continuation Under Compatible Shift0
Federated Hyperdimensional Computing for Resource-Constrained Industrial IoT0
LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families0
Investigating a Policy-Based Formulation for Endoscopic Camera Pose Recovery0
Structured Latent Dynamics in Wireless CSI via Homomorphic World Models0
DIAL-KG: Schema-Free Incremental Knowledge Graph Construction via Dynamic Schema Induction and Evolution-Intent Assessment0
The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries0
The monotonicity of the Franz-Parisi potential is equivalent with Low-degree MMSE lower bounds0
Antenna Array Beamforming Based on a Hybrid Quantum Optimization Framework0
A Unified Platform and Quality Assurance Framework for 3D Ultrasound Reconstruction with Robotic, Optical, and Electromagnetic Tracking0
Predicting States of Understanding in Explanatory Interactions Using Cognitive Load-Related Linguistic Cues0
Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment0
How Out-of-Equilibrium Phase Transitions can Seed Pattern Formation in Trained Diffusion Models0
LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain0
Pitfalls in Evaluating Interpretability Agents0
Spectral Alignment in Forward-Backward Representations via Temporal Abstraction0
Trojan horse hunt in deep forecasting models: Insights from the European Space Agency competition0
GO-GenZip: Goal-Oriented Generative Sampling and Hybrid Compression0
Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning0
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech0
Current LLMs still cannot 'talk much' about grammar modules: Evidence from syntax0
Conditioning Protein Generation via Hopfield Pattern Multiplicity0
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning0
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models0
Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives0
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management0
Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents0
Reasoning Gets Harder for LLMs Inside A Dialogue0
Show:102550
← PrevPage 4 of 13168Next →