SOTAVerified|Agents Browse Leaderboard About Blog

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 5629 papers

Title	Date	Tasks	Status	Hype
Theseus: A Library for Differentiable Nonlinear Optimization	Jul 19, 2022	GPU	CodeCode Available	4
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals	Jul 18, 2024	Experimental DesignGPU	CodeCode Available	4
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation	Jun 4, 2024	Face SwappingGPU	CodeCode Available	4
OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit	May 12, 2025	GPUPrivacy Preserving	CodeCode Available	4
On Scaling Up 3D Gaussian Splatting Training	Jun 26, 2024	3DGS3D Reconstruction	CodeCode Available	4
Multi-head Temporal Latent Attention	May 19, 2025	GPUspeech-recognition	CodeCode Available	4
Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training	Oct 28, 2021	Deep LearningGPU	CodeCode Available	4
Building reliable sim driving agents by scaling self-play	Feb 20, 2025	Autonomous VehiclesBenchmarking	CodeCode Available	4
High-Resolution Image Synthesis with Latent Diffusion Models	Dec 20, 2021	DenoisingGPU	CodeCode Available	4
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Apr 15, 2025	GPUInference Optimization	CodeCode Available	4
DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale	Jun 30, 2022	CPUGPU	CodeCode Available	4
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts	Oct 9, 2024	GPUMixture-of-Experts	CodeCode Available	4
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float	Apr 15, 2025	CPUGPU	CodeCode Available	4
FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical Training	Mar 3, 2023	Federated LearningGPU	CodeCode Available	4
FFCV: Accelerating Training by Removing Data Bottlenecks	Jun 21, 2023	CPUGPU	CodeCode Available	4
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion	Jan 27, 2023	GPUImage Generation	CodeCode Available	4
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary Computation	Jan 29, 2023	GPUNavigate	CodeCode Available	4
JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase Flows	Feb 7, 2024	GPU	CodeCode Available	4
fastai: A Layered API for Deep Learning	Feb 11, 2020	Deep LearningGPU	CodeCode Available	4
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference	Oct 6, 2023	GPUImage Generation	CodeCode Available	4
4D Gaussian Splatting for Real-Time Dynamic Scene Rendering	Oct 12, 2023	Dynamic ReconstructionGPU	CodeCode Available	4
Billion-scale similarity search with GPUs	Feb 28, 2017	GPUImage Similarity Search	CodeCode Available	4
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation	May 15, 2025	GPU	CodeCode Available	4
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module	Nov 9, 2023	GPUImage Generation	CodeCode Available	4
Mamba-FETrack: Frame-Event Tracking via State Space Model	Apr 28, 2024	GPUMamba	CodeCode Available	4

Show:10 25 50

← PrevPage 5 of 226Next →

No leaderboard results yet.