| Design of a Quality Management System based on the EU Artificial Intelligence Act | Aug 8, 2024 | Document AIGPU | CodeCode Available | 0 |
| Sparse Spiking Neural-like Membrane Systems on Graphics Processing Units | Aug 8, 2024 | GPU | CodeCode Available | 0 |
| Understanding the Performance and Estimating the Cost of LLM Fine-Tuning | Aug 8, 2024 | GPUMixture-of-Experts | CodeCode Available | 0 |
| Optimization-Driven Adaptive Experimentation | Aug 8, 2024 | GPUThompson Sampling | —Unverified | 0 |
| Arctic-TILT. Business Document Understanding at Sub-Billion Scale | Aug 8, 2024 | document understandingGPU | —Unverified | 0 |
| Quantum Annealing based Power Grid Partitioning for Parallel Simulation | Aug 7, 2024 | CPUGPU | —Unverified | 0 |
| Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters | Aug 7, 2024 | GPU | CodeCode Available | 2 |
| PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training | Aug 7, 2024 | GPUMamba | —Unverified | 0 |
| Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Aug 7, 2024 | GPUIntrusion Detection | CodeCode Available | 1 |
| Image-to-LaTeX Converter for Mathematical Formulas and Text | Aug 7, 2024 | DecoderGPU | CodeCode Available | 1 |
| Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation | Aug 7, 2024 | GPUQuestion Answering | —Unverified | 0 |
| Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation | Aug 7, 2024 | GPUQuantization | CodeCode Available | 1 |
| L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Aug 6, 2024 | GPUQuantization | —Unverified | 0 |
| A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search | Aug 6, 2024 | BlockingGPU | —Unverified | 0 |
| SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving | Aug 5, 2024 | GPU | —Unverified | 0 |
| VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking | Aug 5, 2024 | 3D Single Object TrackingGPU | —Unverified | 0 |
| RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders | Aug 5, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aug 4, 2024 | GPUImage Generation | —Unverified | 0 |
| Deep Patch Visual SLAM | Aug 3, 2024 | GPUVisual Odometry | CodeCode Available | 4 |
| GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS | Aug 2, 2024 | GPUNavigate | CodeCode Available | 4 |
| FT K-means: A High-Performance K-means on GPU with Fault Tolerance | Aug 2, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines | Aug 2, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making | Aug 2, 2024 | Cloud ComputingDecision Making | CodeCode Available | 1 |
| Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research | Aug 1, 2024 | CPUGPU | —Unverified | 0 |
| Data-Driven Traffic Simulation for an Intersection in a Metropolis | Aug 1, 2024 | GPUTrajectory Forecasting | —Unverified | 0 |