SOTAVerified

GPU

Papers

Showing 48514900 of 5629 papers

TitleStatusHype
Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis0
Matrix Is All You Need0
MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors0
MCR-DL: Mix-and-Match Communication Runtime for Deep Learning0
MCSD: An Efficient Language Model with Diverse Fusion0
MDHP-Net: Detecting an Emerging Time-exciting Threat in IVN0
Mean-Field Simulation-Based Inference for Cosmological Initial Conditions0
Mechanistic PDE Networks for Discovery of Governing Equations0
MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM0
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes0
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism0
Memory Analysis on the Training Course of DeepSeek Models0
Memory and Bandwidth are All You Need for Fully Sharded Data Parallel0
Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV Imagery0
Memory-efficient GAN-based Domain Translation of High Resolution 3D Medical Images0
Memory Efficient Invertible Neural Networks for 3D Photoacoustic Imaging0
Memory-efficient Learning for High-Dimensional MRI Reconstruction0
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model0
Memory Efficient Patch-based Training for INR-based GANs0
Memory-Efficient Point Cloud Registration via Overlapping Region Sampling0
Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification0
Mini-batch Coresets for Memory-efficient Training of Large Language Models0
Merlin HugeCTR: GPU-accelerated Recommender System Training and Inference0
ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models0
Meta Large Language Model Compiler: Foundation Models of Compiler Optimization0
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning0
ME-ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers0
MEX: Memory-efficient Approach to Referring Multi-Object Tracking0
mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training0
MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring0
microYOLO: Towards Single-Shot Object Detection on Microcontrollers0
MICS : Multi-steps, Inverse Consistency and Symmetric deep learning registration network0
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models0
Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation0
Minimal Solutions to Generalized Three-View Relative Pose Problem0
minimax: Efficient Baselines for Autocurricula in JAX0
Minimax Strikes Back0
MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation0
Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU0
MIS-SLAM: Real-time Large Scale Dense Deformable SLAM System in Minimal Invasive Surgery Based on Heterogeneous Computing0
Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT0
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation0
Mixed-Precision Embedding Using a Cache0
Mixed Reality Depth Contour Occlusion Using Binocular Similarity Matching and Three-dimensional Contour Optimisation0
Mixed Sparsity Training: Achieving 4 FLOP Reduction for Transformer Pretraining0
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service0
Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness0
ML-driven Hardware Cost Model for MLIR0
MLTCP: Congestion Control for DNN Training0
ML-Triton, A Multi-Level Compilation and Language Extension to Triton GPU Programming0
Show:102550
← PrevPage 98 of 113Next →

No leaderboard results yet.