SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 69517000 of 661570 papers

TitleStatusHype
Deferred is Better: A Framework for Multi-Granularity Deferred Interaction of Heterogeneous Features0
Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces0
SteerRM: Debiasing Reward Models via Sparse Autoencoders0
Residual SODAP: Residual Self-Organizing Domain-Adaptive Prompting with Structural Knowledge Preservation for Continual Learning0
Spectral-Geometric Neural Fields for Pose-Free LiDAR View Synthesis0
MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization0
MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLsCode0
Deep Distance Measurement Method for Unsupervised Multivariate Time Series Similarity Retrieval0
AutoClimDS: Climate Data Science Agentic AI -- A Knowledge Graph is All You Need0
Visual Alignment of Medical Vision-Language Models for Grounded Radiology Report Generation0
From Formal Language Theory to Statistical Learning: Finite Observability of Subregular LanguagesCode0
UniPrompt-CL: Sustainable Continual Learning in Medical AI with Unified Prompt Pools0
FSDAM: Few-Shot Driving Attention Modeling via Vision-Language Coupling0
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning1
DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving0
SDF-Net: Structure-Aware Disentangled Feature Learning for Opticall-SAR Ship Re-identificationCode0
Literary Narrative as Moral Probe : A Cross-System Framework for Evaluating AI Ethical Reasoning and Refusal Behavior0
When Drafts Evolve: Speculative Decoding Meets Online Learning0
Stake the Points: Structure-Faithful Instance Unlearning0
Purify Once, Edit Freely: Breaking Image Protections under Model Mismatch0
A Method for Learning Large-Scale Computational Construction Grammars from Semantically Annotated Corpora0
PISE: Physics-Anchored Semantically-Enhanced Deep Computational Ghost Imaging for Robust Low-Bandwidth Machine Perception0
MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction0
Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting0
Long-form RewardBench: Evaluating Reward Models for Long-form Generation0
EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning0
How to Build a Quantum Supercomputer: Scaling from Hundreds to Millions of Qubits0
Multimodal Continual Learning with MLLMs from Multi-scenario Perspectives0
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering0
Towards unified brain-to-text decoding across speech production and perception0
Seeing Eye to Eye: Enabling Cognitive Alignment Through Shared First-Person Perspective in Human-AI Collaboration0
VCBench: A Streaming Counting Benchmark for Spatial-Temporal State Maintenance in Long Videos0
Design-Specification Tiling for ICL-based CAD Code Generation0
SciDesignBench: Benchmarking and Improving Language Models for Scientific Inverse Design0
Graph In-Context Operator Networks for Generalizable Spatiotemporal Prediction0
On Using Machine Learning to Early Detect Catastrophic Failures in Marine Diesel Engines0
SLICE: Semantic Latent Injection via Compartmentalized Embedding for Image Watermarking0
SAP: Segment Any 4K Panorama0
PVI: Plug-in Visual Injection for Vision-Language-Action Models0
A New Kernel Regularity Condition for Distributed Mirror Descent: Broader Coverage and Simpler Analysis0
SAVA-X: Ego-to-Exo Imitation Error Detection via Scene-Adaptive View Alignment and Bidirectional Cross View FusionCode0
Upper Bounds for Local Learning Coefficients of Three-Layer Neural Networks0
NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval0
coDrawAgents: A Multi-Agent Dialogue Framework for Compositional Image Generation0
CLARIN-PT-LDB: An Open LLM Leaderboard for Portuguese to assess Language, Culture and Civility0
HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection0
Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization0
MotionAnymesh: Physics-Grounded Articulation for Simulation-Ready Digital Twins0
SGMatch: Semantic-Guided Non-Rigid Shape Matching with Flow Regularization0
ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning0
Show:102550
← PrevPage 140 of 13232Next →