SOTAVerified

GPU

Papers

Showing 52765300 of 5629 papers

TitleStatusHype
ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning0
ProTrain: Efficient LLM Training via Memory-Aware Techniques0
Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.00
Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging0
Pruned RNN-T for fast, memory-efficient ASR training0
Prune or quantize? Strategy for Pareto-optimally low-cost and accurate CNN0
Pruning Compact ConvNets for Efficient Inference0
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation0
Pushing the Limits of Beam Search Decoding for Transducer-based ASR models0
Pushing the Limits of BFP on Narrow Precision LLM Inference0
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs0
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection0
Python Workflows on HPC Systems0
PZnet: Efficient 3D ConvNet Inference on Manycore CPUs0
QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning0
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources0
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach0
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects0
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration0
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation0
QuAILoRA: Quantization-Aware Initialization for LoRA0
Qualities, challenges and future of genetic algorithms: a literature review0
QuantEase: Optimization-based Quantization for Language Models0
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control0
Show:102550
← PrevPage 212 of 226Next →

No leaderboard results yet.