SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1200112050 of 661570 papers

TitleStatusHype
Real-time tightly coupled GNSS and IMU integration via Factor Graph Optimization0
Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction0
Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts0
SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems0
Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs0
RAG-X: Systematic Diagnosis of Retrieval-Augmented Generation for Medical Question Answering0
Tucano 2 Cool: Better Open Source LLMs for Portuguese0
PinCLIP: Large-scale Foundational Multimodal Representation at Pinterest0
Real-time loosely coupled GNSS and IMU integration via Factor Graph Optimization0
Modeling Cross-vision Synergy for Unified Large Vision Model0
Build, Judge, Optimize: A Blueprint for Continuous Improvement of Multi-Agent Consumer Assistants0
Confidence-aware Monocular Depth Estimation for Minimally Invasive Surgery0
Transport Clustering: Solving Low-Rank Optimal Transport via Clustering0
Spectrum Shortage for Radio Sensing? Leveraging Ambient 5G Signals for Human Activity Detection0
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer0
Hazard-Aware Traffic Scene Graph Generation0
Controllable Generative Sandbox for Causal Inference0
Social Norm Reasoning in Multimodal Language Models: An Evaluation0
SENTINEL: Stagewise Integrity Verification for Pipeline Parallel Decentralized Training0
Infinite dimensional generative sensing0
From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes0
Joint Training Across Multiple Activation Sparsity Regimes0
Safety Verification of Wait-Only Non-Blocking Broadcast Protocols0
Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications0
From Complex Dynamics to DynFormer: Rethinking Transformers for PDEs0
PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image EditingCode0
Designing UNICORN: a Unified Benchmark for Imaging in Computational Pathology, Radiology, and Natural Language0
TagaVLM: Topology-Aware Global Action Reasoning for Vision-Language Navigation0
TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration0
Navigating with Annealing Guidance Scale in Diffusion Space0
MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration0
Graph Homomorphism Distortion: A Metric to Distinguish Them All and in the Latent Space Bind Them0
Chain of World: World Model Thinking in Latent Motion1
Generative adversarial imitation learning for robot swarms: Learning from human demonstrations and trained policies0
ScribeTokens: Fixed-Vocabulary Tokenization of Digital Ink0
Implicit Bias in Deep Linear Discriminant Analysis0
Impact of Localization Errors on Label Quality for Online HD Map Construction0
[Re] FairDICE: A Gap Between Theory And Practice0
ChemFlow:A Hierarchical Neural Network for Multiscale Representation Learning in Chemical Mixtures0
Even Faster Kernel Matrix Linear Algebra via Density Estimation0
Linear Model Extraction via Factual and Counterfactual Queries0
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization2
Thermodynamic Regulation of Finite-Time Gibbs Training in Energy-Based Models: A Restricted Boltzmann Machine Study0
PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation0
Boosted Trees on a Diet: Compact Models for Resource-Constrained Devices0
No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata0
FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection0
MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN0
Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs0
Training-Free Multi-Concept Image Editing0
Show:102550
← PrevPage 241 of 13232Next →