SOTAVerified

GPU

Papers

Showing 22512300 of 5629 papers

TitleStatusHype
Distilled GPT for Source Code SummarizationCode0
A Generalization of Continuous Relaxation in Structured Pruning0
Flexible Techniques for Differentiable Rendering with 3D Gaussians0
SPEED: Streaming Partition and Parallel Acceleration for Temporal Interaction Graph EmbeddingCode0
DM-VTON: Distilled Mobile Real-time Virtual Try-OnCode1
Staleness-Alleviated Distributed GNN Training via Online Dynamic-Embedding Prediction0
SoTaNa: The Open-Source Software Development AssistantCode1
Efficient Learned Lossless JPEG Recompression0
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language ModelsCode2
JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading0
POLCA: Power Oversubscription in LLM Cloud Providers0
FastSurfer-HypVINN: Automated sub-segmentation of the hypothalamus and adjacent structures on high-resolutional brain MRICode2
Computational limits to the legibility of the imaged human brainCode0
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert InferenceCode1
A Unified Framework for 3D Point Cloud Visual GroundingCode1
Efficient Benchmarking of Language Models0
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution NetworksCode1
EigenPlaces: Training Viewpoint Robust Models for Visual Place RecognitionCode1
High Performance Computing Applied to Logistic Regression: A CPU and GPU Implementation ComparisonCode0
GNNPipe: Scaling Deep GNN Training with Pipelined Model Parallelism0
Unlimited Knowledge Distillation for Action Recognition in the Dark0
Learning representations by forward-propagating errors0
MovePose: A High-performance Human Pose Estimation Algorithm on Mobile and Edge Devices0
Distributed Extra-gradient with Optimal Complexity and Communication GuaranteesCode0
GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching0
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs0
SkinDistilViT: Lightweight Vision Transformer for Skin Lesion ClassificationCode0
Digital twinning of cardiac electrophysiology models from the surface ECG: a geodesic backpropagation approach0
Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design0
Symphony: Optimized DNN Model Serving using Deferred Batch Scheduling0
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning0
Platypus: Quick, Cheap, and Powerful Refinement of LLMsCode2
SpecTracle: Wearable Facial Motion Tracking from Unobtrusive Peripheral Cameras0
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models0
When Monte-Carlo Dropout Meets Multi-Exit: Optimizing Bayesian Neural Networks on FPGACode1
Optimizing transformer-based machine translation model for single GPU training: a hyperparameter ablation study0
INR-Arch: A Dataflow Architecture and Compiler for Arbitrary-Order Gradient Computations in Implicit Neural Representation ProcessingCode0
Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMUCode0
High-performance Data Management for Whole Slide Image Analysis in Digital PathologyCode1
Real-time FPGA Implementation of CNN-based Distributed Fiber Optic Vibration Event Recognition Method0
Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesisCode0
Application-Oriented Benchmarking of Quantum Generative Learning Using QUARKCode1
High-Resolution Cranial Defect Reconstruction by Iterative, Low-Resolution, Point Cloud Completion Transformers0
Mask Frozen-DETR: High Quality Instance Segmentation with One GPU0
Communication-Free Distributed GNN Training with Vertex Cut0
Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery0
Exploiting On-chip Heterogeneity of Versal Architecture for GNN Inference Acceleration0
ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo0
Nonconvex optimization for optimum retrieval of the transmission matrix of a multimode fiberCode0
Integrating Homomorphic Encryption and Trusted Execution Technology for Autonomous and Confidential Model Refining in Cloud0
Show:102550
← PrevPage 46 of 113Next →

No leaderboard results yet.