SOTAVerified

GPU

Papers

Showing 18511875 of 5629 papers

TitleStatusHype
Anant-Net: Breaking the Curse of Dimensionality with Scalable and Interpretable Neural Surrogate for High-Dimensional PDEs0
NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation0
Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving0
Quantitative Analysis of Performance Drop in DeepSeek Model QuantizationCode0
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference0
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach0
Sparfels: Fast Reconstruction from Sparse Unposed Imagery0
A UNet Model for Accelerated Preprocessing of CRISM Hyperspectral Data for Mineral Identification on Mars0
Phantora: Live GPU Cluster Simulation for Machine Learning System Performance Estimation0
Feature Optimization for Time Series Forecasting via Novel Randomized Uphill Climbing0
Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L0
Aggregating empirical evidence from data strategy studies: a case on model quantization0
Sionna RT: Technical Report0
Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning0
TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language ModelsCode0
Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language0
semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage0
FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation0
Accelerating Mixture-of-Experts Training with Adaptive Expert Replication0
NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI0
GPU accelerated program synthesis: Enumerate semantics, not syntax!0
Generative Models for Fast Simulation of Cherenkov Detectors at the Electron-Ion ColliderCode0
The Big Send-off: High Performance Collectives on GPU-based Supercomputers0
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference0
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification0
Show:102550
← PrevPage 75 of 226Next →

No leaderboard results yet.