SOTAVerified

GPU

Papers

Showing 20012050 of 5629 papers

TitleStatusHype
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUsCode2
The GPU Phase Folding and Deep Learning Method for Detecting Exoplanet Transits0
Repurposing Diffusion-Based Image Generators for Monocular Depth EstimationCode4
Jellyfish: A Large Language Model for Data Preprocessing0
FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding0
Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction0
Virtual reservoir acceleration for CPU and GPU: Case study for coupled spin-torque oscillator reservoirCode0
Minuet: Accelerating 3D Sparse Convolutions on GPUsCode1
Label Delay in Online Continual Learning0
CoLLiE: Collaborative Training of Large Language Models in an Efficient WayCode2
Optimized Parallelization of Boundary Integral Poisson-Boltzmann SolversCode0
A Simple Video Segmenter by Tracking Objects Along Axial TrajectoriesCode1
Language Embedded 3D Gaussians for Open-Vocabulary Scene UnderstandingCode1
Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking0
HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation0
GNNFlow: A Distributed Framework for Continuous Temporal GNN Learning on Dynamic GraphsCode1
Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization0
SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material EstimationCode0
Compressing the Backward Pass of Large-Scale Neural Architectures by Structured Activation Pruning0
vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model TrainingCode1
Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human AvatarsCode1
SpotServe: Serving Generative Large Language Models on Preemptible InstancesCode1
An Ensemble of 2.5D ResUnet Based Models for Segmentation for Kidney and Masses0
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions0
XLB: A differentiable massively parallel lattice Boltzmann library in PythonCode2
A GPU-based Hydrodynamic Simulator with Boid InteractionsCode0
Wavelength-multiplexed Multi-mode EUV Reflection Ptychography based on Automatic-Differentiation0
SySMOL: Co-designing Algorithms and Hardware for Neural Networks with Heterogeneous Precisions0
PrivateLoRA For Efficient Privacy Preserving LLM0
Volumetric Reconstruction Resolves Off-Resonance Artifacts in Static and Dynamic PROPELLER MRICode0
NeutronOrch: Rethinking Sample-based GNN Training under CPU-GPU Heterogeneous Environments0
Vast TVB parameter space exploration: A Modular Framework for Accelerating the Multi-Scale Simulation of Human Brain Dynamics0
A Survey of Serverless Machine Learning Model Inference0
Scalable CP Decomposition for Tensor Learning using GPU Tensor Cores0
Learning to Fly in SecondsCode2
Using Human Feedback to Fine-tune Diffusion Models without Any Reward ModelCode2
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and QuantizationCode1
All-to-all reconfigurability with sparse and higher-order Ising machinesCode0
Descriptor and Word Soups: Overcoming the Parameter Efficiency Accuracy Tradeoff for Out-of-Distribution Few-shot LearningCode0
Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile RobotsCode1
minimax: Efficient Baselines for Autocurricula in JAX0
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with Transformer-Enhanced Spiking Neural Networks0
Quantum-Enhanced Support Vector Machine for Large-Scale Stellar Classification with GPU Acceleration0
Energy efficiency in Edge TPU vs. embedded GPU for computer-aided medical imaging segmentation and classification0
Towards Perturbation-Induced Static Pivoting on GPU-Based Linear Solvers0
PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction0
Practical cross-sensor color constancy using a dual-mapping strategy0
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model FinetuningCode1
Zero redundancy distributed learning with differential privacy0
Compressed 3D Gaussian Splatting for Accelerated Novel View SynthesisCode0
Show:102550
← PrevPage 41 of 113Next →

No leaderboard results yet.