SOTAVerified

GPU

Papers

Showing 38263850 of 5629 papers

TitleStatusHype
ProMoE: Fast MoE-based LLM Serving using Proactive Caching0
PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
Properties on n-dimensional convolution for image deconvolution0
PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks0
Protea: Client Profiling within Federated Systems using Flower0
ProTEA: Programmable Transformer Encoder Acceleration on FPGA0
Protecting Confidentiality, Privacy and Integrity in Collaborative Learning0
ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning0
ProTrain: Efficient LLM Training via Memory-Aware Techniques0
Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.00
Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging0
Pruned RNN-T for fast, memory-efficient ASR training0
Prune or quantize? Strategy for Pareto-optimally low-cost and accurate CNN0
Pruning Compact ConvNets for Efficient Inference0
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation0
Pushing the Limits of Beam Search Decoding for Transducer-based ASR models0
Pushing the Limits of BFP on Narrow Precision LLM Inference0
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs0
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection0
Python Workflows on HPC Systems0
PZnet: Efficient 3D ConvNet Inference on Manycore CPUs0
QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning0
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources0
Show:102550
← PrevPage 154 of 226Next →

No leaderboard results yet.