SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1715117200 of 474278 papers

TitleStatusHype
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors0
LEANN: A Low-Storage Vector Index0
Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence0
Reinforcement Learning from Human Feedback with High-Confidence Safety ConstraintsCode0
Hierarchical Lexical Graph for Enhanced Multi-Hop RetrievalCode3
Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on TwitchCode0
Lite-RVFL: A Lightweight Random Vector Functional-Link Neural Network for Learning Under Concept DriftCode0
BLUR: A Bi-Level Optimization Approach for LLM UnlearningCode0
WWAggr: A Window Wasserstein-based Aggregation for Ensemble Change Point Detection0
Federated Learning on Stochastic Neural Networks0
GIQ: Benchmarking 3D Geometric Reasoning of Vision Foundation Models with Simulated and Real Polyhedra0
Snap-and-tune: combining deep learning and test-time optimization for high-fidelity cardiovascular volumetric meshingCode2
A Good CREPE needs more than just Sugar: Investigating Biases in Compositional Vision-Language Benchmarks0
Temporalizing Confidence: Evaluation of Chain-of-Thought Reasoning with Signal Temporal Logic0
FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian VelocityCode1
SWAT-NN: Simultaneous Weights and Architecture Training for Neural Networks in a Latent SpaceCode0
UniVarFL: Uniformity and Variance Regularized Federated Learning for Heterogeneous DataCode0
Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques0
DLNet: Direction-Aware Feature Integration for Robust Lane Detection in Complex EnvironmentsCode0
Discrete and Continuous Difference of Submodular MinimizationCode0
Evidential Spectrum-Aware Contrastive Learning for OOD Detection in Dynamic GraphsCode0
Fractional-order Jacobian Matrix Differentiation and Its Application in Artificial Neural Networks0
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation0
Decoding Saccadic Eye Movements from Brain Signals Using an Endovascular Neural Interface0
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems0
Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong DecodingCode1
Flowing Datasets with Wasserstein over Wasserstein Gradient FlowsCode1
Circumventing Backdoor Space via Weight SymmetryCode0
ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols0
Multiple Object Stitching for Unsupervised Representation LearningCode1
CausalPFN: Amortized Causal Effect Estimation via In-Context LearningCode2
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning0
Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language ModelsCode0
Next-Generation Conflict Forecasting: Unleashing Predictive Patterns through Spatiotemporal Learning0
Multi-Step Visual Reasoning with Visual Tokens Scaling and VerificationCode1
Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning0
Filling the Missings: Spatiotemporal Data Imputation by Conditional DiffusionCode0
Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference0
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMsCode2
Representing Time-Continuous Behavior of Cyber-Physical Systems in Knowledge Graphs0
MS-TVNet:A Long-Term Time Series Prediction Method Based on Multi-Scale Dynamic ConvolutionCode0
Latency Optimization for Wireless Federated Learning in Multihop NetworksCode0
FLAIR-HUB: Large-scale Multimodal Dataset for Land Cover and Crop Mapping0
Manifesto from Dagstuhl Perspectives Workshop 24352 -- Conversational Agents: A Framework for Evaluation (CAFE)0
SDE-SQL: Enhancing Text-to-SQL Generation in Large Language Models via Self-Driven Exploration with SQL Probes0
Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning0
AnnoDPO: Protein Functional Annotation Learning with Direct Preference OptimizationCode0
Joint Channel and Symbol Estimation for Communication Systems with Movable Antennas0
Transfer Learning and Explainable AI for Brain Tumor Classification: A Study Using MRI Data from Bangladesh0
Simultaneous Segmentation of Ventricles and Normal/Abnormal White Matter Hyperintensities in Clinical MRI using Deep Learning0
Show:102550
← PrevPage 344 of 9486Next →