SOTAVerified

GPU

Papers

Showing 36513700 of 5629 papers

TitleStatusHype
Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models0
Optimal Piecewise Linear Function Approximation for GPU-based Applications0
Optimal Transport on the Lie Group of Roto-translations0
Optimization and Application of Cloud-based Deep Learning Architecture for Multi-Source Data Prediction0
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient0
Optimization of Heterogeneous Systems with AI Planning Heuristics and Machine Learning: A Performance and Energy Aware Approach0
Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation0
Optimized and autonomous machine learning framework for characterizing pores, particles, grains and grain boundaries in microstructural images0
Rail-only: A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters0
Optimizing Anchor-based Detectors for Autonomous Driving Scenes0
Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification0
Optimizing Data Collection in Deep Reinforcement Learning0
Optimizing Distributed Training on Frontier for Large Language Models0
Optimizing LLM Queries in Relational Data Analytics Workloads0
Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs0
Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling0
Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training0
Optimizing the Linear Fascicle Evaluation Algorithm for Multi-Core and Many-Core Systems0
Optimizing transformer-based machine translation model for single GPU training: a hyperparameter ablation study0
Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal Discovery0
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation0
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression0
Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training0
Order-sensitive Neural Constituency Parsing0
Orders-of-magnitude speedup in atmospheric chemistry modeling through neural network-based emulation0
Organ Segmentation From Full-size CT Images Using Memory-Efficient FCN0
ORIGAMI: A Heterogeneous Split Architecture for In-Memory Acceleration of Learning0
ORStereo: Occlusion-Aware Recurrent Stereo Matching for 4K-Resolution Images0
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads0
OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation0
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU0
OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models0
OUTCOMES: Rapid Under-sampling Optimization achieves up to 50% improvements in reconstruction accuracy for multi-contrast MRI sequences0
Out-of-Core GPU Gradient Boosting0
Out-of-Core Surface Reconstruction via Global TGV Minimization0
Out-of-core Training for Extremely Large-Scale Neural Networks With Adaptive Window-Based Scheduling0
P4O: Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization0
Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing0
PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training0
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer0
PAFNet: An Efficient Anchor-Free Object Detector Guidance0
Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference0
PaGraph: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning0
Paleoinspired Vision: From Exploring Colour Vision Evolution to Inspiring Camera Design0
PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms0
Pan-Cancer Diagnostic Consensus Through Searching Archival Histopathology Images Using Artificial Intelligence0
PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU0
Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables0
ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels0
Parallel 3DPIFCM Algorithm for Noisy Brain MRI Images0
Show:102550
← PrevPage 74 of 113Next →

No leaderboard results yet.