SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 12011250 of 659983 papers

TitleStatusHype
QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression0
DepthTCM: High Efficient Depth Compression via Physics-aware Transformer-CNN Mixed Architecture0
Enhancing Brain Tumor Classification Using Vision Transformers with Colormap-Based Feature Representation on BRISC2025 Dataset0
Domain Elastic Transform: Bayesian Function Registration for High-Dimensional Scientific Data0
Does Mechanistic Interpretability Transfer Across Data Modalities? A Cross-Domain Causal Circuit Analysis of Variational Autoencoders0
WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making0
Fusing Memory and Attention: A study on LSTM, Transformer and Hybrid Architectures for Symbolic Music Generation0
Sonny: Breaking the Compute Wall in Medium-Range Weather Forecasting0
Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting0
More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection0
Identity-Consistent Video Generation under Large Facial-Angle Variations0
The Average Relative Entropy and Transpilation Depth determines the noise robustness in Variational Quantum Classifiers0
Privacy-Preserving Federated Action Recognition via Differentially Private Selective Tuning and Efficient Communication0
Active Inference Agency Formalization, Metrics, and Convergence Assessments0
Improving Coherence and Persistence in Agentic AI for System Optimization0
Which Alert Removals are Beneficial?0
B-jet Tagging Using a Hybrid Edge Convolution and Transformer Architecture0
PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences0
Semantic Shift: the Fundamental Challenge in Text Embedding and Retrieval0
PROMPT2BOX: Uncovering Entailment Structure among LLM Prompts0
KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning0
Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study0
T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search0
Neutrino Oscillation Parameter Estimation Using Structured Hierarchical Transformers0
Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation0
Graphs RAG at Scale: Beyond Retrieval-Augmented Generation With Labeled Property Graphs and Resource Description Framework for Complex and Unknown Search Spaces0
Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction0
Subject Information Extraction for Novelty Detection with Domain Shifts0
LJ-Bench: Ontology-Based Benchmark for U.S. Crime0
Context Cartography: Toward Structured Governance of Contextual Space in Large Language Model Systems0
Position: Multi-Agent Algorithmic Care Systems Demand Contestability for Trustworthy AI0
Graph-based data-driven discovery of interpretable laws governing corona-induced noise and radio interference for high-voltage transmission lines0
Interpretable Operator Learning for Inverse Problems via Adaptive Spectral Filtering: Convergence and Discretization Invariance0
Bayesian Learning in Episodic Zero-Sum Games0
Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models0
GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction0
Beyond Token Eviction: Mixed-Dimension Budget Allocation for Efficient KV Cache Compression0
Where can AI be used? Insights from a deep ontology of work activities0
Reasoning Traces Shape Outputs but Models Won't Say So0
LassoFlexNet: Flexible Neural Architecture for Tabular Data0
Optimal low-rank stochastic gradient estimation for LLM training0
Seed1.8 Model Card: Towards Generalized Real-World Agency0
CFNN: Continued Fraction Neural Network0
A Modular LLM Framework for Explainable Price Outlier Detection0
AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing0
Agentic AI and the next intelligence explosion0
Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention0
Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models0
Diffusion Model for Manifold Data: Score Decomposition, Curvature, and Statistical Complexity0
A Multihead Continual Learning Framework for Fine-Grained Fashion Image Retrieval with Contrastive Learning and Exponential Moving Average Distillation0
Show:102550
← PrevPage 25 of 13200Next →