SOTAVerified

GPU

Papers

Showing 18761900 of 5629 papers

TitleStatusHype
Edge-Enabled Real-time Railway Track Segmentation0
immrax: A Parallelizable and Differentiable Toolbox for Interval Analysis and Mixed Monotone Reachability in JAXCode1
A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN0
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning0
Exact analytical algorithm for solvent accessible surface area and derivatives in implicit solvent molecular simulations on GPUs0
Towards providing reliable job completion time predictions using PCSCode0
Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded DevicesCode1
PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map ConsistencyCode4
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelCode2
LoMA: Lossless Compressed Memory Attention0
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model InferenceCode1
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language ModelsCode3
TP-Aware Dequantization0
Efficient approximation of Earth Mover's Distance Based on Nearest Neighbor SearchCode0
Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound DiagnosisCode0
Parameter-Efficient Detoxification with Contrastive Decoding0
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models0
Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting0
Efficient Parallel Data Optimization for Homogeneous Diffusion Inpainting of 4K Images0
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase PredictionCode2
Extreme Compression of Large Language Models via Additive QuantizationCode5
PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU0
MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring0
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning0
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency ModelsCode7
Show:102550
← PrevPage 76 of 226Next →

No leaderboard results yet.