SOTAVerified|Agents Browse Leaderboard About

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 5629 papers

Title	Date	Tasks	Status	Hype	Score
ThunderKittens: Simple, Fast, and Adorable AI Kernels	Oct 27, 2024	GPUState Space Models	CodeCode Available	7	5
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving	Nov 27, 2024	FairnessGPU	CodeCode Available	7	5
Mirage: A Multi-Level Superoptimizer for Tensor Programs	May 9, 2024	GPUNavigate	CodeCode Available	7	5
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI	Oct 1, 2024	GPUImitation Learning	CodeCode Available	7	5
Scalable MatMul-free Language Modeling	Jun 4, 2024	GPULanguage Modeling	CodeCode Available	7	5
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning	Jan 25, 2025	BenchmarkingEvolutionary Algorithms	CodeCode Available	7	5
Revisiting PCA for time series reduction in temporal dimension	Dec 27, 2024	Computational EfficiencyDimensionality Reduction	CodeCode Available	7	5
Elixir: Train a Large Language Model on a Small GPU Cluster	Dec 10, 2022	CPUGPU	CodeCode Available	7	5
Pyramidal Flow Matching for Efficient Video Generative Modeling	Oct 8, 2024	GPUText-to-Video Generation	CodeCode Available	7	5
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers	Oct 31, 2022	GPULanguage Modelling	CodeCode Available	7	5
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	May 14, 2025	DenoisingDepth Estimation	CodeCode Available	7	5
Labeling supervised fine-tuning data with the scaling law	May 5, 2024	coreference-resolutionCoreference Resolution	CodeCode Available	7	5
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization	Mar 26, 2025	CPUGPU	CodeCode Available	7	5
EvoGP: A GPU-accelerated Framework for Tree-based Genetic Programming	Jan 21, 2025	Feature EngineeringGPU	CodeCode Available	7	5
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models	Jan 10, 2024	GPUImage Generation	CodeCode Available	7	5
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness	May 27, 2022	16k4k	CodeCode Available	6	5
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models	Sep 21, 2023	4kGPU	CodeCode Available	6	5
SqueezeLLM: Dense-and-Sparse Quantization	Jun 13, 2023	GPUQuantization	CodeCode Available	6	5
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning	Jul 17, 2023	GPULanguage Modeling	CodeCode Available	6	5
QLoRA: Efficient Finetuning of Quantized LLMs	May 23, 2023	ChatbotGPU	CodeCode Available	6	5
AudioLCM: Text-to-Audio Generation with Latent Consistency Models	Jun 1, 2024	Audio GenerationAudio Synthesis	CodeCode Available	5	5
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention	Apr 22, 2025	GPU	CodeCode Available	5	5
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models	Aug 21, 2024	GPUQuantization	CodeCode Available	5	5
DEIM: DETR with Improved Matching for Fast Convergence	Dec 5, 2024	Data AugmentationGPU	CodeCode Available	5	5
Deep Lake: a Lakehouse for Deep Learning	Sep 22, 2022	Decision MakingDeep Learning	CodeCode Available	5	5

Show:10 25 50

← PrevPage 2 of 226Next →

No leaderboard results yet.