SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1515115200 of 474278 papers

TitleStatusHype
Excessive Reasoning Attack on Reasoning LLMs0
LLM-Powered Intent-Based Categorization of Phishing Emails0
AIn't Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation0
One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification0
Computational Studies in Influencer Marketing: A Systematic Literature Review0
One Size Fits None: Rethinking Fairness in Medical AI0
A multi-stage augmented multimodal interaction network for fish feeding intensity quantification0
ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies0
ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection0
ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations0
StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework0
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription0
CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion0
Markov Regime-Switching Intelligent Driver Model for Interpretable Car-Following Behavior0
DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning0
SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning0
VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy0
Adaptive Reinforcement Learning for Unobservable Random Delays0
AMPLIFY: Actionless Motion Priors for Robot Learning from Videos0
Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control0
GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation0
Unifying Streaming and Non-streaming Zipformer-based ASR0
InsertRank: LLMs can reason over BM25 scores to Improve Listwise Reranking0
RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition0
Similarity = Value? Consultation Value Assessment and Alignment for Personalized Search0
ImpReSS: Implicit Recommender System for Support Conversations0
A Vision for Geo-Temporal Deep Research Systems: Towards Comprehensive, Transparent, and Reproducible Geo-Temporal Information Synthesis0
FADPNet: Frequency-Aware Dual-Path Network for Face Super-Resolution0
Meta-SurDiff: Classification Diffusion Model Optimized by Meta Learning is Reliable for Online Surgical Phase Recognition0
HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction0
Unified Representation Space for 3D Visual Grounding0
Exploring Non-contrastive Self-supervised Representation Learning for Image-based Profiling0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
Discrete JEPA: Learning Discrete Token Representations without Reconstruction0
DepthSeg: Depth prompting in remote sensing semantic segmentation0
GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion0
Compositional Attribute Imbalance in Vision Datasets0
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models0
AsyncSwitch: Asynchronous Text-Speech Adaptation for Code-Switched ASR0
Enhancing Symbolic Machine Learning by Subsymbolic Representations0
SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling0
Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios0
Comparison of Two Methods for Stationary Incident Detection Based on Background Image0
Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language ModelsCode0
Busting the Paper Ballot: Voting Meets Adversarial Machine LearningCode0
hyperFA*IR: A hypergeometric approach to fair rankings with finite candidate poolCode0
Déjà Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation ReuseCode1
EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric OptimizationCode0
HydroChronos: Forecasting Decades of Surface Water ChangeCode0
23 Ways to Contact How Do I Talk to Someone at Expedia®: A Step-by-Step Guide0
Show:102550
← PrevPage 304 of 9486Next →