SOTAVerified

GPU

Papers

Showing 32013250 of 5629 papers

TitleStatusHype
POLCA: Power Oversubscription in LLM Cloud Providers0
Computational limits to the legibility of the imaged human brainCode0
Efficient Benchmarking of Language Models0
High Performance Computing Applied to Logistic Regression: A CPU and GPU Implementation ComparisonCode0
GNNPipe: Scaling Deep GNN Training with Pipelined Model Parallelism0
Unlimited Knowledge Distillation for Action Recognition in the Dark0
Distributed Extra-gradient with Optimal Complexity and Communication GuaranteesCode0
GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching0
MovePose: A High-performance Human Pose Estimation Algorithm on Mobile and Edge Devices0
Learning representations by forward-propagating errors0
SkinDistilViT: Lightweight Vision Transformer for Skin Lesion ClassificationCode0
Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design0
Digital twinning of cardiac electrophysiology models from the surface ECG: a geodesic backpropagation approach0
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs0
SpecTracle: Wearable Facial Motion Tracking from Unobtrusive Peripheral Cameras0
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning0
Symphony: Optimized DNN Model Serving using Deferred Batch Scheduling0
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models0
INR-Arch: A Dataflow Architecture and Compiler for Arbitrary-Order Gradient Computations in Implicit Neural Representation ProcessingCode0
Optimizing transformer-based machine translation model for single GPU training: a hyperparameter ablation study0
Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMUCode0
Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesisCode0
Real-time FPGA Implementation of CNN-based Distributed Fiber Optic Vibration Event Recognition Method0
High-Resolution Cranial Defect Reconstruction by Iterative, Low-Resolution, Point Cloud Completion Transformers0
Mask Frozen-DETR: High Quality Instance Segmentation with One GPU0
Communication-Free Distributed GNN Training with Vertex Cut0
Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery0
Exploiting On-chip Heterogeneity of Versal Architecture for GNN Inference Acceleration0
ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo0
Nonconvex optimization for optimum retrieval of the transmission matrix of a multimode fiberCode0
Digital Twin Brain: a simulation and assimilation platform for whole human brain0
Integrating Homomorphic Encryption and Trusted Execution Technology for Autonomous and Confidential Model Refining in Cloud0
DiviML: A Module-based Heuristic for Mapping Neural Networks onto Heterogeneous PlatformsCode0
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning0
Interpolation-Split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance0
Detection of Children Abuse by Voice and Audio Classification by Short-Time Fourier Transform Machine Learning implemented on Nvidia Edge GPU device0
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems0
YOLOBench: Benchmarking Efficient Object Detectors on Embedded SystemsCode0
EasyNet: An Easy Network for 3D Industrial Anomaly Detection0
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor0
Duet: efficient and scalable hybriD neUral rElation undersTandingCode0
Multi-GPU Approach for Training of Graph ML Models on large CFD Meshes0
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel SimulationCode0
Rail-only: A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters0
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and CalibrationCode0
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
MaGNAS: A Mapping-Aware Graph Neural Architecture Search Framework for Heterogeneous MPSoC Deployment0
DistTGL: Distributed Memory-Based Temporal Graph Neural Network Training0
Show:102550
← PrevPage 65 of 113Next →

No leaderboard results yet.