SOTAVerified

GPU

Papers

Showing 52515300 of 5629 papers

TitleStatusHype
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models0
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference0
Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving0
Privacy preserving Neural Network Inference on Encrypted Data with GPUs0
Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption0
Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption0
PrivateLoRA For Efficient Privacy Preserving LLM0
PrivFT: Private and Fast Text Classification with Homomorphic Encryption0
ProAI: An Efficient Embedded AI Hardware for Automotive Applications -- a Benchmark Study0
Probabilistic Deep Learning using Random Sum-Product Networks0
Probabilistic hypergraph grammars for efficient molecular optimization0
Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation0
Probe-based Rapid Hybrid Hyperspectral and Tissue Surface Imaging Aided by Fully Convolutional Networks0
Processing Energy Modeling for Neural Network Based Image Compression0
Productive Reproducible Workflows for DNNs: A Case Study for Industrial Defect Detection0
Profiling based Out-of-core Hybrid Method for Large Neural Networks0
Progressively refined deep joint registration segmentation (ProRSeg) of gastrointestinal organs at risk: Application to MRI and cone-beam CT0
ProMoE: Fast MoE-based LLM Serving using Proactive Caching0
PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
Properties on n-dimensional convolution for image deconvolution0
PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks0
Protea: Client Profiling within Federated Systems using Flower0
ProTEA: Programmable Transformer Encoder Acceleration on FPGA0
Protecting Confidentiality, Privacy and Integrity in Collaborative Learning0
ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning0
ProTrain: Efficient LLM Training via Memory-Aware Techniques0
Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.00
Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging0
Pruned RNN-T for fast, memory-efficient ASR training0
Prune or quantize? Strategy for Pareto-optimally low-cost and accurate CNN0
Pruning Compact ConvNets for Efficient Inference0
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation0
Pushing the Limits of Beam Search Decoding for Transducer-based ASR models0
Pushing the Limits of BFP on Narrow Precision LLM Inference0
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs0
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection0
Python Workflows on HPC Systems0
PZnet: Efficient 3D ConvNet Inference on Manycore CPUs0
QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning0
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources0
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach0
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects0
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration0
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation0
QuAILoRA: Quantization-Aware Initialization for LoRA0
Qualities, challenges and future of genetic algorithms: a literature review0
QuantEase: Optimization-based Quantization for Language Models0
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control0
Show:102550
← PrevPage 106 of 113Next →

No leaderboard results yet.