SOTAVerified

GPU

Papers

Showing 326350 of 5629 papers

TitleStatusHype
GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable0
A Comparison of Deep Learning Methods for Cell Detection in Digital CytologyCode0
CRYSIM: Prediction of Symmetric Structures of Large Crystals with GPU-based Ising MachinesCode0
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching0
GPU-accelerated Evolutionary Many-objective Optimization Using Tensorized NSGA-IIICode3
Nonuniform-Tensor-Parallelism: Mitigating GPU failure impact for Scaled-up LLM Training0
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention ModelingCode1
SmolVLM: Redefining small and efficient multimodal models0
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home ClustersCode0
Leveraging State Space Models in Long Range Genomics0
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong ExecutorsCode2
Scaling Graph Neural Networks for Particle Track ReconstructionCode1
Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and SemidensificationCode0
Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models0
SLOs-Serve: Optimized Serving of Multi-SLO LLMs0
DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimization in 3D-IC DesignCode0
Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognitionCode1
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs0
Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis0
Scaling Video-Language Models to 10K Frames via Hierarchical Differential DistillationCode2
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism0
Incorporating the ChEES Criterion into Sequential Monte Carlo Samplers0
GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric CalibrationCode2
A Truncated Newton Method for Optimal TransportCode0
Show:102550
← PrevPage 14 of 226Next →

No leaderboard results yet.