SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1170111750 of 661570 papers

TitleStatusHype
Maximizing Generalization: The Effect of Different Augmentation Techniques on Lightweight Vision Transformer for Bengali Character Classification0
Low-Degree Method Fails to Predict Robust Subspace Recovery0
Lightweight Transformer for EEG Classification via Balanced Signed Graph Algorithm Unrolling0
Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions0
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model0
Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based AggregationCode0
Optimizing Data Augmentation through Bayesian Model Selection0
CLEAR: Calibrated Learning for Epistemic and Aleatoric Risk0
Interaction Field Matching: Overcoming Limitations of Electrostatic ModelsCode0
You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models0
SceneStreamer: Continuous Scenario Generation as Next Token Group Prediction0
Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators0
CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes0
InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation2
Not All Errors Are Created Equal: ASCoT Addresses Late-Stage Fragility in Efficient LLM Reasoning0
Nonparametric Reaction Coordinate Optimization with Histories: A Framework for Rare Event Dynamics0
Link Prediction for Event Logs in the Process Industry0
SiNGER: A Clearer Voice Distills Vision Transformers Further0
Zero-shot CT Super-Resolution using Diffusion-based 2D Projection Priors and Signed 3D Gaussians0
ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering0
Learning Acrobatic Flight from Preferences0
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes0
ScaleDoc: Scaling LLM-based Predicates over Large Document Collections0
Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search0
Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?0
Fast Estimation of Wasserstein Distances via Regression on Sliced Wasserstein Distances0
Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective0
Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity0
Are Language Models Borrowing-Blind? A Multilingual Evaluation of Loanword Identification across 10 Languages0
MedLA: A Logic-Driven Multi-Agent Framework for Complex Medical Reasoning with Large Language Models0
Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting0
BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration0
Arbitrary Generative Video Interpolation0
ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs0
Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner0
Fine-Tuning Diffusion Models via Intermediate Distribution Shaping0
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning0
Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability0
Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy0
MIRAGE: Runtime Scheduling for Multi-Vector Image Retrieval with Hierarchical Decomposition0
Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment0
Reducing Belief Deviation in Reinforcement Learning for Active Reasoning0
Are We Asking the Right Questions? On Ambiguity in Natural Language Queries for Tabular Data Analysis0
The Implicit Bias of Adam and Muon on Smooth Homogeneous Neural Networks0
Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning0
Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach0
Online Data Curation for Object Detection via Marginal Contributions to Dataset-level Average Precision0
Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects0
Echoing: Identity Failures when LLM Agents Talk to Each Other0
Markovian Scale Prediction: A New Era of Visual Autoregressive Generation0
Show:102550
← PrevPage 235 of 13232Next →