SOTAVerified

GPU

Papers

Showing 626650 of 5629 papers

TitleStatusHype
Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-20240
No More Sliding Window: Efficient 3D Medical Image Segmentation with Differentiable Top-k Patch Sampling0
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models0
Good things come in small packages: Should we build AI clusters with Lite-GPUs?0
PixelBrax: Learning Continuous Control from Pixels End-to-End on the GPUCode0
The Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution0
FASP: Fast and Accurate Structured Pruning of Large Language Models0
Resource-Constrained Federated Continual Learning: What Does Matter?0
GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement0
Towards Lightweight Time Series Forecasting: a Patch-wise Transformer with Weak Data Enriching0
Keras Sig: Efficient Path Signature Computation on GPU in Keras 30
Physics-Informed Latent Neural Operator for Real-time Predictions of Complex Physical Systems0
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement LearningCode1
Hierarchical Autoscaling for Large Language Model Serving with Chiron0
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-ResolutionCode2
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlappingCode1
Ultra Memory-Efficient On-FPGA Training of Transformers via Tensor-Compressed Optimization0
Towards Early Prediction of Self-Supervised Speech Model Performance0
TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response ScenariosCode2
MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action DetectionCode1
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition0
EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models0
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters0
Show:102550
← PrevPage 26 of 226Next →

No leaderboard results yet.