SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 57515800 of 661570 papers

TitleStatusHype
Two Birds, One Projection: Harmonizing Safety and Utility in LVLMs via Inference-time Feature Projection0
Spectrogram features for audio and speech analysis0
Bayesian Inference for Missing Physics0
A Score Filter Enhanced Data Assimilation Framework for Data-Driven Dynamical Systems0
MMSpec: Benchmarking Speculative Decoding for Vision-Language Models0
SmartSearch: How Ranking Beats Structure for Conversational Memory Retrieval0
Directional Routing in Transformers0
A PPO-Based Bitrate Allocation Conditional Diffusion Model for Remote Sensing Image Compression0
A Hybrid Modeling Framework for Crop Prediction Tasks via Dynamic Parameter Calibration and Multi-Task Learning0
Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference0
Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech0
Information Asymmetry across Language Varieties: A Case Study on Cantonese-Mandarin and Bavarian-German QA0
Generating solution paths of Markovian stochastic differential equations using diffusion models0
Frame Sampling Strategies Matter: A Benchmark for small vision language models0
Bayesian Optimization with Gaussian Processes to Accelerate Stationary Point Searches0
Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning0
Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling0
Personalized Federated Learning with Residual Fisher Information for Medical Image Segmentation0
Spatio-temporal probabilistic forecast using MMAF-guided learning0
Morphemes Without Borders: Evaluating Root-Pattern Morphology in Arabic Tokenizers and LLMs0
Low-light Image Enhancement with Retinex Decomposition in Latent Space0
Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies0
SSR: A Training-Free Approach for Streaming 3D Reconstruction0
Voronoi-based Second-order Descriptor with Whitened Metric in LiDAR Place Recognition0
Why Agents Compromise Safety Under Pressure0
PAKAN: Pixel Adaptive Kolmogorov-Arnold Network Modules for Pansharpening0
Effective Distillation to Hybrid xLSTM Architectures0
A proof-of-concept for automated AI-driven stellarator coil optimization with in-the-loop finite-element calculations0
PYTHEN: A Flexible Framework for Legal Reasoning in Python0
sim2art: Accurate Articulated Object Modeling from a Single Video using Synthetic Training Data Only0
Ultra-Early Prediction of Tipping Points: Integrating Dynamical Measures with Reservoir Computing0
Edit2Interp: Adapting Image Foundation Models from Spatial Editing to Video Frame Interpolation with Few-Shot Learning0
Evaluating Black-Box Vulnerabilities with Wasserstein-Constrained Data Perturbations0
RadAnnotate: Large Language Models for Efficient and Reliable Radiology Report Annotation0
Fine-tuning RoBERTa for CVE-to-CWE Classification: A 125M Parameter Model Competitive with LLMs0
Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation0
Eyes on Target: Gaze-Aware Object Detection in Egocentric Video0
Survey of Various Fuzzy and Uncertain Decision-Making Methods0
EvoIQA - Explaining Image Distortions with Evolved White-Box Logic0
Multi-Agent LLMs for Generating Research Limitations0
DeFRiS: Silo-Cooperative IoT Applications Scheduling via Decentralized Federated Reinforcement Learning0
Panoramic Affordance Prediction1
Exemplar Diffusion: Improving Medical Object Detection with Opportunistic LabelsCode0
IFNSO: Iteration-Free Newton-Schulz OrthogonalizationCode0
VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB SequencesCode0
HieraRS: A Hierarchical Segmentation Paradigm for Remote Sensing Enabling Multi-Granularity Interpretation and Cross-Domain TransferCode0
RESCUE: Retrieval Augmented Secure Code GenerationCode0
PAT: Accelerating LLM Decoding via Prefix-Aware Attention with Resource Efficient Multi-Tile KernelCode0
How (Mis)calibrated is Your Federated CLIP and What To Do About It?Code0
AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision TransformersCode0
Show:102550
← PrevPage 116 of 13232Next →