SOTAVerified

GPU

Papers

Showing 12111220 of 5629 papers

TitleStatusHype
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMsCode1
LOGO -- Long cOntext aliGnment via efficient preference OptimizationCode1
Dynamic GPU Energy Optimization for Machine Learning Training WorkloadsCode1
DAGER: Exact Gradient Inversion for Large Language ModelsCode1
Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded DevicesCode1
Transformer TrackingCode1
Dynamic Low-Rank Sparse Adaptation for Large Language ModelsCode1
Efficient Quantized Sparse Matrix Operations on Tensor CoresCode1
LiteTrack: Layer Pruning with Asynchronous Feature Extraction for Lightweight and Efficient Visual TrackingCode1
CrAM: A Compression-Aware MinimizerCode1
Show:102550
← PrevPage 122 of 563Next →

No leaderboard results yet.