SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 17511800 of 659983 papers

TitleStatusHype
Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges0
kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation0
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection0
VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs0
Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence0
Bypassing Document Ingestion: An MCP Approach to Financial Q&A0
Which Workloads Belong in Orbit? A Workload-First Framework for Orbital Data Centers Using Semantic Abstraction0
The Causal Impact of Tool Affordance on Safety Alignment in LLM Agents0
GIP-RAG: An Evidence-Grounded Retrieval-Augmented Framework for Interpretable Gene Interaction and Pathway Impact Analysis0
HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting0
Collaborative Adaptive Curriculum for Progressive Knowledge Distillation0
Transformer-Based Predictive Maintenance for Risk-Aware Instrument Calibration0
HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs0
From Human Interfaces to Agent Interfaces: Rethinking Software Design in the Age of AI-Native Systems0
The Global-Local loop: what is missing in bridging the gap between geospatial data from numerous communities?0
EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control0
Reason-to-Transmit: Deliberative Adaptive Communication for Cooperative Perception0
GraphiContact: Pose-aware Human-Scene Robust Contact Perception for Interactive Systems0
DAPA: Distribution Aware Piecewise Activation Functions for On-Device Transformer Inference and Training0
TuLaBM: Tumor-Biased Latent Bridge Matching for Contrast-Enhanced MRI Synthesis0
Pseudo-Labeling for Unsupervised Domain Adaptation with Kernel GLMs0
VeloxNet: Efficient Spatial Gating for Lightweight Embedded Image Classification0
Depictions of Depression in Generative AI Video Models: A Preliminary Study of OpenAI's Sora 20
dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv30
Predicting Hidden Links and Missing Nodes in Scale-Free Networks with Artificial Neural Networks0
POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA Optimization0
Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions0
Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity0
Spectral Tempering for Embedding Compression in Dense Passage Retrieval0
Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons0
Exploring the Agentic Frontier of Verilog Code Generation0
Anatomical Heterogeneity in Transformer Language Models0
A Mathematical Theory of Understanding0
A Novel Solution for Zero-Day Attack Detection in IDS using Self-Attention and Jensen-Shannon Divergence in WGAN-GP0
Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation0
Factored Levenberg-Marquardt for Diffeomorphic Image Registration: An efficient optimizer for FireANTs0
Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents0
Bridging Conformal Prediction and Scenario Optimization: Discarded Constraints and Modular Risk Allocation0
Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning0
Scalable Prompt Routing via Fine-Grained Latent Task Discovery0
Investigating In-Context Privacy Learning by Integrating User-Facing Privacy Tools into Conversational Agents0
The Autonomy Tax: Defense Training Breaks LLM Agents0
Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure0
Vocabulary shapes cross-lingual variation of word-order learnability in language models0
When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)0
Subspace Projection Methods for Fast Spectral Embeddings of Evolving Graphs0
Near-Equivalent Q-learning Policies for Dynamic Treatment Regimes0
LoFi: Location-Aware Fine-Grained Representation Learning for Chest X-ray0
TrustFlow: Topic-Aware Vector Reputation Propagation for Multi-Agent Ecosystems0
Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL0
Show:102550
← PrevPage 36 of 13200Next →