SOTAVerified

GPU

Papers

Showing 891900 of 5629 papers

TitleStatusHype
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language ModelsCode1
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language ModelsCode1
No Privacy Left Outside: On the (In-)Security of TEE-Shielded DNN Partition for On-Device MLCode1
Sparse Fine-tuning for Inference Acceleration of Large Language ModelsCode1
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural NetworksCode1
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning ModelsCode1
Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robotsCode1
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMsCode1
Label Supervised LLaMA FinetuningCode1
Training a Large Video Model on a Single Machine in a DayCode1
Show:102550
← PrevPage 90 of 563Next →

No leaderboard results yet.