SOTAVerified

GPU

Papers

Showing 351360 of 5629 papers

TitleStatusHype
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic SegmentationCode2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisCode2
I-BERT: Integer-only BERT QuantizationCode2
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
Atom: Low-bit Quantization for Efficient and Accurate LLM ServingCode2
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech SynthesisCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
INT-FlashAttention: Enabling Flash Attention for INT8 QuantizationCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Show:102550
← PrevPage 36 of 563Next →

No leaderboard results yet.