SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 23012350 of 177339 papers

TitleStatusHype
Deep Generative Models on 3D Representations: A SurveyCode3
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to ReinforceCode3
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
Landmark Attention: Random-Access Infinite Context Length for TransformersCode3
Evaluating Hallucinations in Chinese Large Language ModelsCode3
ViTPose++: Vision Transformer for Generic Body Pose EstimationCode3
FAN: Fourier Analysis NetworksCode3
FilterNet: Harnessing Frequency Filters for Time Series ForecastingCode3
QuEst: Graph Transformer for Quantum Circuit Reliability EstimationCode3
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker ExtractionCode3
KVzip: Query-Agnostic KV Cache Compression with Context ReconstructionCode3
BERGEN: A Benchmarking Library for Retrieval-Augmented GenerationCode3
MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI ApplicationsCode3
Evaluating Text-to-Visual Generation with Image-to-Text GenerationCode3
Attention Is All You NeedCode3
CodeTF: One-stop Transformer Library for State-of-the-art Code LLMCode3
StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character CustomizationCode3
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video GenerationCode3
Residual Kolmogorov-Arnold Network for Enhanced Deep LearningCode3
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head GenerationCode3
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into OneCode3
A Survey on LoRA of Large Language ModelsCode3
VisionZip: Longer is Better but Not Necessary in Vision Language ModelsCode3
Humans in 4D: Reconstructing and Tracking Humans with TransformersCode3
Sigmoid Loss for Language Image Pre-TrainingCode3
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal UnderstandingCode3
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward FeedbackCode3
Husky: A Unified, Open-Source Language Agent for Multi-Step ReasoningCode3
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific AdaptationCode3
Restoring Images in Adverse Weather Conditions via Histogram TransformerCode3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
NeuSpeech: Decode Neural signal as SpeechCode3
YOLOv4: Optimal Speed and Accuracy of Object DetectionCode3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language UnderstandingCode3
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory DistillationCode3
EAT: Self-Supervised Pre-Training with Efficient Audio TransformerCode3
State Space Models for Event CamerasCode3
Inference Performance Optimization for Large Language Models on CPUsCode3
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation ModelsCode3
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous DrivingCode3
Visual Prompt TuningCode3
MoonCast: High-Quality Zero-Shot Podcast GenerationCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
IMAGGarment-1: Fine-Grained Garment Generation for Controllable Fashion DesignCode3
Revisiting Pre-Trained Models for Chinese Natural Language ProcessingCode3
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question ComplexityCode3
vTensor: Flexible Virtual Tensor Management for Efficient LLM ServingCode3
Frequency Dynamic Convolution for Dense Image PredictionCode3
Accelerating Goal-Conditioned RL Algorithms and ResearchCode3
Jukebox: A Generative Model for MusicCode3
Show:102550
← PrevPage 47 of 3547Next →