SOTAVerified

GPU

Papers

Showing 176200 of 5629 papers

TitleStatusHype
MegaBlocks: Efficient Sparse Training with Mixture-of-ExpertsCode3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
mlpack 3: a fast, flexible machine learning libraryCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
M+: Extending MemoryLLM with Scalable Long-Term MemoryCode3
94% on CIFAR-10 in 3.29 Seconds on a Single GPUCode3
MagicPIG: LSH Sampling for Efficient LLM GenerationCode3
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligenceCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
EscherNet: A Generative Model for Scalable View SynthesisCode3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI ApplicationsCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationCode3
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated CharactersCode3
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data ProcessingCode3
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token SequencesCode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AICode3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image GenerationCode3
Show:102550
← PrevPage 8 of 226Next →

No leaderboard results yet.