SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13011350 of 659983 papers

TitleStatusHype
Characterizing the onset and offset of motor imagery during passive arm movements induced by an upper-body exoskeleton0
Scene Graph-guided SegCaptioning Transformer with Fine-grained Alignment for Controllable Video Segmentation and Captioning0
Auto-differentiable data assimilation: Co-learning of states, dynamics, and filtering algorithms0
LLM Router: Prefill is All You Need0
Beyond the Birkhoff Polytope: Spectral-Sphere-Constrained Hyper-Connections0
The data heat island effect: quantifying the impact of AI data centers in a warming world0
Natural Gradient Descent for Online Continual Learning0
Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach0
The Hidden Puppet Master: A Theoretical and Real-World Account of Emotional Manipulation in LLMs0
Bayesian Scattering: A Principled Baseline for Uncertainty on Image Data0
LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models0
Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues0
Enhancing LIME using Neural Decision Trees0
Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing0
Discriminative Representation Learning for Clinical Prediction0
Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions0
MOELIGA: a multi-objective evolutionary approach for feature selection with local improvement0
User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction0
gUFO: A Gentle Foundational Ontology for Semantic Web Knowledge Graphs0
Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge0
GraPHFormer: A Multimodal Graph Persistent Homology Transformer for the Analysis of Neuroscience Morphologies0
DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles0
Detection of adversarial intent in Human-AI teams using LLMs0
MERIT: Multi-domain Efficient RAW Image Translation0
Dodgersort: Uncertainty-Aware VLM-Guided Human-in-the-Loop Pairwise Ranking0
A Knowledge-Informed Pretrained Model for Causal Discovery0
HiCI: Hierarchical Construction-Integration for Long-Context Attention0
GOLDMARK: Governed Outcome-Linked Diagnostic Model Assessment Reference Kit0
Glove2Hand: Synthesizing Natural Hand-Object Interaction from Multi-Modal Sensing Gloves0
Can ChatGPT Really Understand Modern Chinese Poetry?0
SozKZ: Training Efficient Small Language Models for Kazakh from Scratch0
Ensemble of Small Classifiers For Imbalanced White Blood Cell Classification0
RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models0
PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching0
Achieving O(1/ε) Sample Complexity for Bilinear Systems Identification under Bounded Noises0
Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance0
RECLAIM: Cyclic Causal Discovery Amid Measurement Noise0
Neural collapse in the orthoplex regime0
RayMap3R: Inference-Time RayMap for Dynamic 3D Reconstruction0
Generating from Discrete Distributions Using Diffusions: Insights from Random Constraint Satisfaction Problems0
Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models0
Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs0
mmWave-Diffusion:A Novel Framework for Respiration Sensing Using Observation-Anchored Conditional Diffusion Model0
Decoupling Numerical and Structural Parameters: An Empirical Study on Adaptive Genetic Algorithms via Deep Reinforcement Learning for the Large-Scale TSP0
NDT: Non-Differential Transformer and Its Application to Sentiment Analysis0
High-Quality and Efficient Turbulence Mitigation with Events0
Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks0
VSD-MOT: End-to-End Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Distillation0
MzansiText and MzansiLM: An Open Corpus and Decoder-Only Language Model for South African Languages0
SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval0
Show:102550
← PrevPage 27 of 13200Next →