SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 77017750 of 661570 papers

TitleStatusHype
Heterogeneous Multi-Robot Reinforcement LearningCode2
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence InferenceCode2
TRADES: Generating Realistic Market Simulations with Diffusion ModelsCode2
Learning to Compress Prompts with Gist TokensCode2
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language ModelsCode2
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image TranslationCode2
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A SurveyCode2
LongReward: Improving Long-context Large Language Models with AI FeedbackCode2
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language ModelsCode2
Conformal Symplectic Optimization for Stable Reinforcement LearningCode2
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMsCode2
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image GenerationCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A BenchmarkCode2
Dilated Neighborhood Attention TransformerCode2
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language ModelsCode2
SEAL: Steerable Reasoning Calibration of Large Language Models for FreeCode2
LightGNN: Simple Graph Neural Network for RecommendationCode2
Edicho: Consistent Image Editing in the WildCode2
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language ModelCode2
Real-Time Fitness Exercise Classification and Counting from Video FramesCode2
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction TuningCode2
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length GeneralizationCode2
RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQLCode2
FinBERT-QA: Financial Question Answering with pre-trained BERT Language ModelsCode2
Iterative Methods for Vecchia-Laplace Approximations for Latent Gaussian Process ModelsCode2
LitSearch: A Retrieval Benchmark for Scientific Literature SearchCode2
xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend DecompositionCode2
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion ModelsCode2
Auto-Encoded Supervision for Perceptual Image Super-ResolutionCode2
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality AssessmentCode2
Learning Spatio-Temporal Dynamics for Trajectory Recovery via Time-Aware TransformerCode2
JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation FrameworkCode2
Squeezed Attention: Accelerating Long Context Length LLM InferenceCode2
FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher informationCode2
Adaptive Dual-domain Learning for Underwater Image EnhancementCode2
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion ModelsCode2
FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language ModelsCode2
Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHACode2
Monocular Lane Detection Based on Deep Learning: A SurveyCode2
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic SegmentationCode2
PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series ForecastingCode2
Diffusion Model Quantization: A ReviewCode2
CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware TrainingCode2
A Self-Supervised Descriptor for Image Copy DetectionCode2
CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid DynamicsCode2
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language ModelsCode2
ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and BeyondCode2
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
Neural Discrete Representation LearningCode2
Show:102550
← PrevPage 155 of 13232Next →