SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1560115650 of 474278 papers

TitleStatusHype
A Review of the Long Horizon Forecasting Problem in Time Series AnalysisCode0
The Synthetic Mirror -- Synthetic Data at the Age of Agentic AI0
Serving Large Language Models on Huawei CloudMatrix3840
Using Neurogram Similarity Index Measure (NSIM) to Model Hearing Loss and Cochlear Neural Degeneration0
Bridging Data-Driven and Physics-Based Models: A Consensus Multi-Model Kalman Filter for Robust Vehicle State Estimation0
SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition0
Homeostatic Coupling for Prosocial Behavior0
MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language ModelsCode0
SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation0
Differentially Private Bilevel Optimization: Efficient Algorithms with Near-Optimal Rates0
PDEfuncta: Spectrally-Aware Neural Representation for PDE Solution Modeling0
KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills0
Zero-shot denoising via neural compression: Theoretical and algorithmic frameworkCode0
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer0
Magnetoencephalography (MEG) Based Non-Invasive Chinese Speech DecodingCode0
BeyondRPC: A Contrastive and Augmentation-Driven Framework for Robust Point Cloud UnderstandingCode0
TCANet: A Temporal Convolutional Attention Network for Motor Imagery EEG DecodingCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Focusing on Tracks for Online Multi-Object TrackingCode2
MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document RetrievalCode0
Structural feature enhanced transformer for fine-grained image recognition0
CORONA: A Coarse-to-Fine Framework for Graph-based Recommendation with Large Language Models0
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-CheckingCode0
Is your batch size the problem? Revisiting the Adam-SGD gap in language modeling0
Quantizing Small-Scale State-Space Models for Edge AI0
Beyond Sin-Squared Error: Linear-Time Entrywise Uncertainty Quantification for Streaming PCA0
A Transfer Learning Framework for Multilayer Networks via Model Averaging0
Interpretable Causal Representation Learning for Biological Data in the Pathway Space0
Understanding the Effect of Knowledge Graph Extraction Error on Downstream Graph Analyses: A Case Study on Affiliation Graphs0
From Ground to Sky: Architectures, Applications, and Challenges Shaping Low-Altitude Wireless Networks0
Automated Heuristic Design for Unit Commitment Using Large Language Models0
Instantaneous Failure, Repair and Mobility Rates for Markov Reliability Systems: A Wind-Farm application0
Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction0
Efficient Star Distillation Attention Network for Lightweight Image Super-Resolution0
Deploying and Evaluating Multiple Deep Learning Models on Edge Devices for Diabetic Retinopathy Detection0
Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning0
ECLIP: Energy-efficient and Practical Co-Location of ML Inference on Spatially Partitioned GPUs0
Adaptive Multi-resolution Hash-Encoding Framework for INR-based Dental CBCT Reconstruction with Truncated FOV0
Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis0
Less Conservative Adaptive Gain-scheduling Control for Continuous-time Systems with Polytopic Uncertainties0
Behavioral Generative Agents for Energy Operations0
Step-by-Step Reasoning Attack: Revealing 'Erased' Knowledge in Large Language Models0
Second Order State Hallucinations for Adversarial Attack Mitigation in Formation Control of Multi-Agent Systems0
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
Detecting Narrative Shifts through Persistent Structures: A Topological Analysis of Media Discourse0
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented GenerationCode3
SPIRE: Conditional Personalization for Federated Diffusion Generative Models0
A Gradient Meta-Learning Joint Optimization for Beamforming and Antenna Position in Pinching-Antenna Systems0
OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and MetricsCode4
Cross-Domain Conditional Diffusion Models for Time Series ImputationCode0
Show:102550
← PrevPage 313 of 9486Next →