SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 60016050 of 661570 papers

TitleStatusHype
Selective Fine-Tuning of GPT Architectures for Parameter-Efficient Clinical Text Classification0
Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models0
Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models0
Deep Learning From Routine Histology Improves Risk Stratification for Biochemical Recurrence in Prostate Cancer0
DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution0
ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control0
Memory as Asset: From Agent-centric to Human-centric Memory Management0
Interleaved Resampling and Refitting: Data and Compute-Efficient Evaluation of Black-Box Predictors0
FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection0
A Real-Time Neuro-Symbolic Ethical Governor for Safe Decision Control in Autonomous Robotic Manipulation0
BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification0
Membership Inference for Contrastive Pre-training Models with Text-only PII Queries0
Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys0
Not All Directions Matter: Toward Structured and Task-Aware Low-Rank Adaptation0
QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis0
FOCUS: Bridging Fine-Grained Recognition and Open-World Discovery across Domains0
CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control0
OAHuman: Occlusion-Aware 3D Human Reconstruction from Monocular Images0
Sampling Boltzmann distributions via normalizing flow approximation of transport maps0
Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring0
MedPriv-Bench: Benchmarking the Privacy-Utility Trade-off of Large Language Models in Medical Open-End Question Answering0
Learning in Function Spaces: An Unified Functional Analytic View of Supervised and Unsupervised Learning0
Controllable Accent Normalization via Discrete Diffusion0
All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation0
DC-ViT: Modulating Spatial and Channel Interactions for Multi-Channel Images0
Show Me When and Where: Towards Referring Video Object Segmentation in the Wild0
4D Synchronized Fields: Motion-Language Gaussian Splatting for Temporal Scene Understanding0
SemantiCache: Efficient KV Cache Compression via Semantic Chunking and Clustered Merging0
In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels0
Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models0
Enhancing LLM Training via Spectral Clipping0
Histo-MExNet: A Unified Framework for Real-World, Cross-Magnification, and Trustworthy Breast Cancer Histopathology0
Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting0
Structure-Dependent Regret and Constraint Violation Bounds for Online Convex Optimization with Time-Varying Constraints0
How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images0
Fundamental Limits of CSI Compression in FDD Massive MIMO0
UAVBench and UAVIT-1M: Benchmarking and Enhancing MLLMs for Low-Altitude UAV Vision-Language Understanding0
On the Nature of Attention Sink that Shapes Decoding Strategy in MLLMs0
AgroNVILA: Perception-Reasoning Decoupling for Multi-view Agricultural Multimodal Large Language Models0
Refold: Refining Protein Inverse Folding with Efficient Structural Matching and Fusion0
Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces0
Exposing Long-Tail Safety Failures in Large Language Models through Efficient Diverse Response Sampling0
M^2RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling0
AerialVLA: A Vision-Language-Action Model for UAV Navigation via Minimalist End-to-End Control0
From Specification to Architecture: A Theory Compiler for Knowledge-Guided Machine Learning0
Contests with Spillovers: Incentivizing Content Creation with GenAI0
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics0
StAR: Segment Anything Reasoner0
WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems0
PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis0
Show:102550
← PrevPage 121 of 13232Next →