GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 5629 papers

Title	Date	Tasks	Status	Hype
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Jun 6, 2024	3D Scene ReconstructionDepth Estimation	CodeCode Available	3
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion	May 30, 2024	DenoisingGPU	CodeCode Available	3
Transformers Can Do Arithmetic with the Right Embeddings	May 27, 2024	GPUPosition	CodeCode Available	3
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention	May 27, 2024	GPULanguage Modeling	CodeCode Available	3
vHeat: Building Vision Models upon Heat Conduction	May 26, 2024	Computational EfficiencyGPU	CodeCode Available	3
NGD-SLAM: Towards Real-Time Dynamic SLAM without GPU	May 12, 2024	CPUDeep Learning	CodeCode Available	3
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention	May 7, 2024	GPUManagement	CodeCode Available	3
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services	Apr 25, 2024	GPU	CodeCode Available	3
SnapKV: LLM Knows What You are Looking for Before Generation	Apr 22, 2024	16kGPU	CodeCode Available	3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts	Apr 22, 2024	Common Sense ReasoningGPU	CodeCode Available	3
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding	Apr 18, 2024	GPU	CodeCode Available	3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding	Apr 8, 2024	GPUMultiple-choice	CodeCode Available	3
Allo: A Programming Model for Composable Accelerator Design	Apr 7, 2024	GPUHigh-Level Synthesis	CodeCode Available	3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models	Apr 3, 2024	GPUMath	CodeCode Available	3
Tensorized NeuroEvolution of Augmenting Topologies for GPU Acceleration	Apr 2, 2024	Computational EfficiencyGPU	CodeCode Available	3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA	Apr 1, 2024	GPUMultiobjective Optimization	CodeCode Available	3
94% on CIFAR-10 in 3.29 Seconds on a Single GPU	Mar 30, 2024	GPU	CodeCode Available	3
The Unreasonable Ineffectiveness of the Deeper Layers	Mar 26, 2024	GPUQuantization	CodeCode Available	3
GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting	Mar 13, 2024	GPUQuantization	CodeCode Available	3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve	Mar 4, 2024	GPUScheduling	CodeCode Available	3
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning	Feb 26, 2024	GPUMinecraft	CodeCode Available	3
TorchCP: A Python Library for Conformal Prediction	Feb 20, 2024	Conformal PredictionDeep Learning	CodeCode Available	3
BitDelta: Your Fine-Tune May Only Be Worth One Bit	Feb 15, 2024	GPU	CodeCode Available	3
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models	Feb 10, 2024	CPUGPU	CodeCode Available	3
EscherNet: A Generative Model for Scalable View Synthesis	Feb 6, 2024	3D ReconstructionGPU	CodeCode Available	3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs	Feb 6, 2024	BinarizationGPU	CodeCode Available	3
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces	Feb 1, 2024	Computational EfficiencyGPU	CodeCode Available	3
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization	Jan 31, 2024	GPUQuantization	CodeCode Available	3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design	Jan 25, 2024	GPUQuantization	CodeCode Available	3
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache	Jan 25, 2024	GPUmodel	CodeCode Available	3
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models	Jan 16, 2024	GPUQuantization	CodeCode Available	3
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation	Jan 9, 2024	GPUMath	CodeCode Available	3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models	Jan 9, 2024	GPU	CodeCode Available	3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices	Dec 28, 2023	AutoMLCPU	CodeCode Available	3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library	Dec 25, 2023	CPUDeep Reinforcement Learning	CodeCode Available	3
Splatter Image: Ultra-Fast Single-View 3D Reconstruction	Dec 20, 2023	3D Object Reconstruction3D Reconstruction	CodeCode Available	3
S-LoRA: Serving Thousands of Concurrent LoRA Adapters	Nov 6, 2023	GPUparameter-efficient fine-tuning	CodeCode Available	3
Punica: Multi-Tenant LoRA Serving	Oct 28, 2023	GPU	CodeCode Available	3
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs	Oct 25, 2023	Autonomous DrivingGPU	CodeCode Available	3
Take the aTrain. Introducing an Interface for the Accessible Transcription of Interviews	Oct 18, 2023	CPUGPU	CodeCode Available	3
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation	Sep 27, 2023	GPUText-to-Video Generation	CodeCode Available	3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation	Sep 12, 2023	GPUImage Generation	CodeCode Available	3
nanoT5: A PyTorch Framework for Pre-training and Fine-tuning T5-style Models with Limited Resources	Sep 5, 2023	DecoderGPU	CodeCode Available	3
Retentive Network: A Successor to Transformer for Large Language Models	Jul 17, 2023	GPULanguage Modeling	CodeCode Available	3
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement	Jun 14, 2023	GPUMotion Estimation	CodeCode Available	3
Fine-Tuning Language Models with Just Forward Passes	May 27, 2023	GPUIn-Context Learning	CodeCode Available	3
Unlimiformer: Long-Range Transformers with Unlimited Length Input	May 2, 2023	Book summarizationCPU	CodeCode Available	3
TorchBench: Benchmarking PyTorch with High API Surface Coverage	Apr 27, 2023	BenchmarkingGPU	CodeCode Available	3
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization	Mar 24, 2023	3D Hand Pose EstimationGPU	CodeCode Available	3
EvoTorch: Scalable Evolutionary Computation in Python	Feb 24, 2023	GPUreinforcement-learning	CodeCode Available	3

Show:10 25 50

← PrevPage 5 of 113Next →

No leaderboard results yet.