SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 52515275 of 177340 papers

TitleStatusHype
Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A BenchmarkCode2
Dilated Neighborhood Attention TransformerCode2
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language ModelsCode2
SEAL: Steerable Reasoning Calibration of Large Language Models for FreeCode2
LightGNN: Simple Graph Neural Network for RecommendationCode2
Edicho: Consistent Image Editing in the WildCode2
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language ModelCode2
Real-Time Fitness Exercise Classification and Counting from Video FramesCode2
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction TuningCode2
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length GeneralizationCode2
RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQLCode2
FinBERT-QA: Financial Question Answering with pre-trained BERT Language ModelsCode2
Iterative Methods for Vecchia-Laplace Approximations for Latent Gaussian Process ModelsCode2
LitSearch: A Retrieval Benchmark for Scientific Literature SearchCode2
xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend DecompositionCode2
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion ModelsCode2
Auto-Encoded Supervision for Perceptual Image Super-ResolutionCode2
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality AssessmentCode2
Learning Spatio-Temporal Dynamics for Trajectory Recovery via Time-Aware TransformerCode2
JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation FrameworkCode2
Squeezed Attention: Accelerating Long Context Length LLM InferenceCode2
FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher informationCode2
Adaptive Dual-domain Learning for Underwater Image EnhancementCode2
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion ModelsCode2
FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language ModelsCode2
Show:102550
← PrevPage 211 of 7094Next →