SOTAVerified

GPU

Papers

Showing 38013850 of 5629 papers

TitleStatusHype
Practice with Graph-based ANN Algorithms on Sparse Data: Chi-square Two-tower model, HNSW, Sign Cauchy Projections0
PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers0
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining0
Predicting Efficiency/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection0
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters0
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation0
Prediction of GPU Failures Under Deep Learning Workloads0
PRE-NAS: Predictor-assisted Evolutionary Neural Architecture Search0
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models0
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference0
Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving0
Privacy preserving Neural Network Inference on Encrypted Data with GPUs0
Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption0
Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption0
PrivateLoRA For Efficient Privacy Preserving LLM0
PrivFT: Private and Fast Text Classification with Homomorphic Encryption0
ProAI: An Efficient Embedded AI Hardware for Automotive Applications -- a Benchmark Study0
Probabilistic Deep Learning using Random Sum-Product Networks0
Probabilistic hypergraph grammars for efficient molecular optimization0
Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation0
Probe-based Rapid Hybrid Hyperspectral and Tissue Surface Imaging Aided by Fully Convolutional Networks0
Processing Energy Modeling for Neural Network Based Image Compression0
Productive Reproducible Workflows for DNNs: A Case Study for Industrial Defect Detection0
Profiling based Out-of-core Hybrid Method for Large Neural Networks0
Progressively refined deep joint registration segmentation (ProRSeg) of gastrointestinal organs at risk: Application to MRI and cone-beam CT0
ProMoE: Fast MoE-based LLM Serving using Proactive Caching0
PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
Properties on n-dimensional convolution for image deconvolution0
PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks0
Protea: Client Profiling within Federated Systems using Flower0
ProTEA: Programmable Transformer Encoder Acceleration on FPGA0
Protecting Confidentiality, Privacy and Integrity in Collaborative Learning0
ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning0
ProTrain: Efficient LLM Training via Memory-Aware Techniques0
Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.00
Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging0
Pruned RNN-T for fast, memory-efficient ASR training0
Prune or quantize? Strategy for Pareto-optimally low-cost and accurate CNN0
Pruning Compact ConvNets for Efficient Inference0
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation0
Pushing the Limits of Beam Search Decoding for Transducer-based ASR models0
Pushing the Limits of BFP on Narrow Precision LLM Inference0
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs0
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection0
Python Workflows on HPC Systems0
PZnet: Efficient 3D ConvNet Inference on Manycore CPUs0
QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning0
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources0
Show:102550
← PrevPage 77 of 113Next →

No leaderboard results yet.