SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 73017350 of 661570 papers

TitleStatusHype
VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought0
Understanding Wikidata Qualifiers: An Analysis and Taxonomy0
IDRL: An Individual-Aware Multimodal Depression-Related Representation Learning Framework for Depression Diagnosis0
QChunker: Learning Question-Aware Text Chunking for Domain RAG via Multi-Agent Debate0
PROMO: Promptable Outfitting for Efficient High-Fidelity Virtual Try-On0
Stable Spike: Dual Consistency Optimization via Bitwise AND Operations for Spiking Neural Networks0
From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration0
Human Knowledge Integrated Multi-modal Learning for Single Source Domain Generalization0
LLMs can construct powerful representations and streamline sample-efficient supervised learning0
Entropy-Preserving Reinforcement Learning0
Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech20
In the LLM era, Word Sense Induction remains unsolved0
SemBench: A Universal Semantic Framework for LLM Evaluation0
Just Use XML: Revisiting Joint Translation and Label Projection0
Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks0
STAIRS-Former: Spatio-Temporal Attention with Interleaved Recursive Structure Transformer for Offline Multi-task Multi-agent Reinforcement Learning0
PolyCrysDiff: Controllable Generation of Three-Dimensional Computable Polycrystalline Material Structures0
OSCBench: Benchmarking Object State Change in Text-to-Video Generation0
Decomposing Observational Multiplicity in Decision Trees: Leaf and Structural Regret0
Scaling Laws for Educational AI Agents0
PicoSAM3: Real-Time In-Sensor Region-of-Interest Segmentation0
When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows0
Cross-Resolution Attention Network for High-Resolution PM2.5 Prediction0
Adapting Dijkstra for Buffers and Unlimited Transfers0
Modeling Trial-and-Error Navigation With a Sequential Decision Model of Information Scent0
VTEdit-Bench: A Comprehensive Benchmark for Multi-Reference Image Editing Models in Virtual Try-On0
CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data0
Controllable Egocentric Video Generation via Occlusion-Aware Sparse 3D Hand Joints0
Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows0
A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for m-Set Semi-Bandit Problem0
Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability and Safety Governed Memory (SSGM) Framework0
Large Language Models for Biomedical Article Classification0
From Debate to Deliberation: Structured Collective Reasoning with Typed Epistemic Acts0
Language Generation with Replay: A Learning-Theoretic View of Model Collapse0
Locating Demographic Bias at the Attention-Head Level in CLIP's Vision Encoder0
Intrinsic Concept Extraction Based on Compositional Interpretability0
DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering0
Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA0
Automated Detection of Malignant Lesions in the Ovary Using Deep Learning Models and XAI0
OSM-based Domain Adaptation for Remote Sensing VLMs0
A Diffeomorphism Groupoid and Algebroid Framework for Discontinuous Image Registration0
PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents0
Inverse Neural Operator for ODE Parameter Optimization0
Multimodal classification of Radiation-Induced Contrast Enhancements and tumor recurrence using deep learning0
Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding0
Hybrid Human-Agent Social Dilemmas in Energy Markets0
A Decade of Generative Adversarial Networks for Porous Material Reconstruction0
DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining0
Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language0
The Landscape of Generative AI in Information Systems: A Synthesis of Secondary Reviews and Research Agendas0
Show:102550
← PrevPage 147 of 13232Next →