SOTAVerified|Agents Browse Leaderboard About

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 526–550 of 5629 papers

Title	Date	Tasks	Status	Hype
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Feb 19, 2025	GPUQuantization	CodeCode Available	2
Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference	Feb 19, 2025	GPURetrieval	—Unverified	0
Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs	Feb 19, 2025	GPU	—Unverified	0
GPU-Friendly Laplacian Texture Blending	Feb 19, 2025	GPU	—Unverified	0
YOLOv12: Attention-Centric Real-Time Object Detectors	Feb 18, 2025	GPUObject	CodeCode Available	7
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading	Feb 18, 2025	Computational EfficiencyCPU	CodeCode Available	2
An Experimental Study of SOTA LiDAR Segmentation Models	Feb 18, 2025	GPUMotion Compensation	—Unverified	0
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Feb 18, 2025	DecoderGPU	CodeCode Available	2
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference	Feb 18, 2025	GPULanguage Modeling	—Unverified	0
GPU Memory Usage Optimization for Backward Propagation in Deep Network Training	Feb 18, 2025	GPU	—Unverified	0
SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings	Feb 18, 2025	GPUSafety Alignment	CodeCode Available	0
Myna: Masking-Based Contrastive Learning of Musical Representations	Feb 18, 2025	Contrastive LearningData Augmentation	CodeCode Available	1
Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer	Feb 17, 2025	GPUQuantization	—Unverified	0
Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer Gate	Feb 17, 2025	GPUMixture-of-Experts	CodeCode Available	0
AdaSplash: Adaptive Sparse Flash Attention	Feb 17, 2025	GPULanguage Modeling	CodeCode Available	1
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption	Feb 17, 2025	BenchmarkingCode Summarization	—Unverified	0
Massively Scaling Explicit Policy-conditioned Value Functions	Feb 17, 2025	continuous-controlContinuous Control	—Unverified	0
Real-time Neural Rendering of LiDAR Point Clouds	Feb 17, 2025	GPUNeural Rendering	—Unverified	0
GPU-accelerated Multi-relational Parallel Graph Retrieval for Web-scale Recommendations	Feb 17, 2025	GPUMetric Learning	—Unverified	0
JExplore: Design Space Exploration Tool for Nvidia Jetson Boards	Feb 16, 2025	BenchmarkingGPU	CodeCode Available	0
TPCap: Unlocking Zero-Shot Image Captioning with Trigger-Augmented and Multi-Modal Purification Modules	Feb 16, 2025	GPUImage Captioning	—Unverified	0
CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs	Feb 15, 2025	Computational EfficiencyGPU	CodeCode Available	1
An Efficient Large Recommendation Model: Towards a Resource-Optimal Scaling Law	Feb 14, 2025	Feature CompressionGPU	—Unverified	0
KernelBench: Can LLMs Write Efficient GPU Kernels?	Feb 14, 2025	GPU	CodeCode Available	4
Efficient solution validation of constraint satisfaction problems on neuromorphic hardware: the case of Sudoku puzzles	Feb 13, 2025	GPU	CodeCode Available	0

Show:10 25 50

← PrevPage 22 of 226Next →

No leaderboard results yet.