| FT K-means: A High-Performance K-means on GPU with Fault Tolerance | Aug 2, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines | Aug 2, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making | Aug 2, 2024 | Cloud ComputingDecision Making | CodeCode Available | 1 |
| Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research | Aug 1, 2024 | CPUGPU | —Unverified | 0 |
| Data-Driven Traffic Simulation for an Intersection in a Metropolis | Aug 1, 2024 | GPUTrajectory Forecasting | —Unverified | 0 |
| Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative Driving | Aug 1, 2024 | Conformal PredictionData Integration | CodeCode Available | 1 |
| Towards Scalable GPU-Accelerated SNN Training via Temporal Fusion | Aug 1, 2024 | GPUNavigate | CodeCode Available | 0 |
| Finch: Prompt-guided Key-Value Cache Compression | Jul 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| GPU-based data processing for speeding-up correlation plenoptic imaging | Jul 30, 2024 | GPU | —Unverified | 0 |
| Pruning Large Language Models with Semi-Structural Adaptive Sparse Training | Jul 30, 2024 | GPUKnowledge Distillation | CodeCode Available | 1 |