| Quantum Annealing based Power Grid Partitioning for Parallel Simulation | Aug 7, 2024 | CPUGPU | —Unverified | 0 |
| Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation | Aug 7, 2024 | GPUQuestion Answering | —Unverified | 0 |
| PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training | Aug 7, 2024 | GPUMamba | —Unverified | 0 |
| L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Aug 6, 2024 | GPUQuantization | —Unverified | 0 |
| A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search | Aug 6, 2024 | BlockingGPU | —Unverified | 0 |
| SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving | Aug 5, 2024 | GPU | —Unverified | 0 |
| VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking | Aug 5, 2024 | 3D Single Object TrackingGPU | —Unverified | 0 |
| PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aug 4, 2024 | GPUImage Generation | —Unverified | 0 |
| FT K-means: A High-Performance K-means on GPU with Fault Tolerance | Aug 2, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines | Aug 2, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| Data-Driven Traffic Simulation for an Intersection in a Metropolis | Aug 1, 2024 | GPUTrajectory Forecasting | —Unverified | 0 |
| Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research | Aug 1, 2024 | CPUGPU | —Unverified | 0 |
| Towards Scalable GPU-Accelerated SNN Training via Temporal Fusion | Aug 1, 2024 | GPUNavigate | CodeCode Available | 0 |
| Finch: Prompt-guided Key-Value Cache Compression | Jul 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| ThinK: Thinner Key Cache by Query-Driven Pruning | Jul 30, 2024 | GPUQuantization | —Unverified | 0 |
| NeuroSEM: A hybrid framework for simulating multiphysics problems by coupling PINNs and spectral elements | Jul 30, 2024 | CPUGPU | CodeCode Available | 0 |
| Toward Efficient Permutation for Hierarchical N:M Sparsity on GPUs | Jul 30, 2024 | GPU | —Unverified | 0 |
| GPU-based data processing for speeding-up correlation plenoptic imaging | Jul 30, 2024 | GPU | —Unverified | 0 |
| ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development | Jul 29, 2024 | GPU | —Unverified | 0 |
| Simply Trainable Nearest Neighbour Machine Translation with GPU Inference | Jul 29, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| SAPG: Split and Aggregate Policy Gradients | Jul 29, 2024 | Decision MakingGPU | —Unverified | 0 |
| Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation | Jul 29, 2024 | GPUtext-classification | —Unverified | 0 |
| Mini-batch Coresets for Memory-efficient Training of Large Language Models | Jul 28, 2024 | GPUNetwork Pruning | —Unverified | 0 |
| WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics | Jul 27, 2024 | GPU | —Unverified | 0 |
| NARVis: Neural Accelerated Rendering for Real-Time Scientific Point Cloud Visualization | Jul 26, 2024 | GPU | —Unverified | 0 |