SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1945119500 of 474278 papers

TitleStatusHype
Acoustic Classification of Maritime Vessels using Learnable FilterbanksCode0
Can Emotion Fool Anti-spoofing?0
A New Deep-learning-Based Approach For mRNA Optimization: High Fidelity, Computation Efficiency, and Multiple Optimization FactorsCode0
DSR-Bench: Evaluating the Structural Reasoning Abilities of LLMs via Data StructuresCode0
Hidden Persuasion: Detecting Manipulative Narratives on Social Media During the 2022 Russian Invasion of Ukraine0
Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws0
Using Reasoning Models to Generate Search Heuristics that Solve Open Instances of Combinatorial Design ProblemsCode0
Mamba Integrated with Physics Principles Masters Long-term Chaotic System ForecastingCode0
OpenUni: A Simple Baseline for Unified Multimodal Understanding and GenerationCode2
Grounded Reinforcement Learning for Visual Reasoning0
Table-R1: Inference-Time Scaling for Table ReasoningCode1
DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance ConcentrationCode1
Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw PuzzlesCode1
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of MindCode1
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression RecognitionCode1
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context LearningCode3
K^2VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series ForecastingCode1
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning EngineeringCode2
QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without RetrainingCode0
DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision TransformersCode1
Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds0
Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding PerspectiveCode1
Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label VariationCode0
Measuring Participant Contributions in Decentralized Federated Learning0
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views0
Query Routing for Retrieval-Augmented Language Models0
Matryoshka Model Learning for Improved Elastic Student Models0
Graph Positional Autoencoders as Self-supervised Learners0
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability0
VITON-DRR: Details Retention Virtual Try-on via Non-rigid RegistrationCode0
DiCoFlex: Model-agnostic diverse counterfactuals with flexible control0
LoLA: Low-Rank Linear Attention With Sparse Caching0
Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual InformationCode0
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language ModelsCode1
Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models0
X2Graph for Cancer Subtyping Prediction on Biological Tabular Data0
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action ModelsCode3
A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs0
Conceptual Framework Toward Embodied Collective Adaptive Intelligence0
EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian0
Dynamic Spectral Backpropagation for Efficient Neural Network Training0
Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis0
Gradient Boosting Decision Tree with LSTM for Investment Prediction0
Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation0
CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis0
LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter0
R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation0
OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
Show:102550
← PrevPage 390 of 9486Next →