SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15511600 of 659983 papers

TitleStatusHype
RiboSphere: Learning Unified and Efficient Representations of RNA Structures0
UniBioTransfer: A Unified Framework for Multiple Biometrics Transfer0
HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning0
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework0
Heavy-Tailed and Long-Range Dependent Noise in Stochastic Approximation: A Finite-Time Analysis0
PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization0
Ensembles-based Feature Guided Analysis0
GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence0
Model Selection and Parameter Estimation of Multi-dimensional Gaussian Mixture Model0
CS-MUNet: A Channel-Spatial Dual-Stream Mamba Network for Multi-Organ Segmentation0
The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer InferenceCode0
Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding0
Structured Prompting for Arabic Essay Proficiency: A Trait-Centric Evaluation Approach0
Scale-Dependent Radial Geometry and Metric Mismatch in Wasserstein Propagation for Reverse Diffusion0
Making Video Models Adhere to User Intent with Minor Adjustments0
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models0
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems0
WorldAgents: Can Foundation Image Models be Agents for 3D World Models?0
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction0
Ontology-Based Knowledge Modeling and Uncertainty-Aware Outdoor Air Quality Assessment Using Weighted Interval Type-2 Fuzzy Logic0
TSegAgent: Zero-Shot Tooth Segmentation via Geometry-Aware Vision-Language Agents0
Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits0
DataProphet: Demystifying Supervision Data Generalization in Multimodal LLMs0
A Unified Phase-native Computational Principle Governs Hippocampal Spike Timing and Neural Coding0
Minimax and Adaptive Covariance Matrix Estimation under Differential Privacy0
EvoTaxo: Building and Evolving Taxonomy from Social Media Streams0
Learning from Similarity/Dissimilarity and Pairwise Comparison0
LoopRPT: Reinforcement Pre-Training for Looped Language Models0
Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification0
FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients0
PerformRecast: Expression and Head Pose Disentanglement for Portrait Video Editing0
PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction0
A two-step sequential approach for hyperparameter selection in finite context models0
MOSS-TTSD: Text to Spoken Dialogue Generation0
FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment0
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation0
Rethinking Ground Truth: A Case Study on Human Label Variation in MLLM Benchmarking0
PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement0
Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation0
Growing Networks with Autonomous Pruning0
PCSTracker: Long-Term Scene Flow Estimation for Point Cloud Sequences0
FREAK: A Fine-grained Hallucination Evaluation Benchmark for Advanced MLLMs0
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision0
Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders0
Template-based Object Detection Using a Foundation Model0
Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach0
Embodied Science: Closing the Discovery Loop with Agentic Embodied AI0
Learning Hierarchical Orthogonal Prototypes for Generalized Few-Shot 3D Point Cloud Segmentation0
Decoupled Sensitivity-Consistency Learning for Weakly Supervised Video Anomaly DetectionCode0
From Plausibility to Verifiability: Risk-Controlled Generative OCR for Vision-Language Models0
Show:102550
← PrevPage 32 of 13200Next →