SOTAVerified|Agents Browse Leaderboard About

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 311–320 of 5629 papers

Title	Date	Tasks	Status	Hype
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process	Mar 6, 2025	Autonomous NavigationComputational Efficiency	CodeCode Available	2
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models	Mar 4, 2025	DiversityGPU	CodeCode Available	2
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval	Mar 1, 2025	GPUQuestion Answering	CodeCode Available	2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation	Feb 21, 2025	Audio GenerationFAD	CodeCode Available	2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators	Feb 20, 2025	BenchmarkingCode Generation	CodeCode Available	2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Feb 19, 2025	GPUQuantization	CodeCode Available	2
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Feb 18, 2025	DecoderGPU	CodeCode Available	2
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading	Feb 18, 2025	Computational EfficiencyCPU	CodeCode Available	2
Saving 77% of the Parameters in Large Language Models Technical Report	Feb 9, 2025	GPUText Generation	CodeCode Available	2
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations	Feb 7, 2025	GPUQuantization	CodeCode Available	2

Show:10 25 50

← PrevPage 32 of 563Next →

No leaderboard results yet.