| Sparse Spiking Neural-like Membrane Systems on Graphics Processing Units | Aug 8, 2024 | GPU | CodeCode Available | 0 |
| Understanding the Performance and Estimating the Cost of LLM Fine-Tuning | Aug 8, 2024 | GPUMixture-of-Experts | CodeCode Available | 0 |
| Design of a Quality Management System based on the EU Artificial Intelligence Act | Aug 8, 2024 | Document AIGPU | CodeCode Available | 0 |
| Optimization-Driven Adaptive Experimentation | Aug 8, 2024 | GPUThompson Sampling | —Unverified | 0 |
| Arctic-TILT. Business Document Understanding at Sub-Billion Scale | Aug 8, 2024 | document understandingGPU | —Unverified | 0 |
| Quantum Annealing based Power Grid Partitioning for Parallel Simulation | Aug 7, 2024 | CPUGPU | —Unverified | 0 |
| Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters | Aug 7, 2024 | GPU | CodeCode Available | 2 |
| PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training | Aug 7, 2024 | GPUMamba | —Unverified | 0 |
| Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation | Aug 7, 2024 | GPUQuestion Answering | —Unverified | 0 |
| Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Aug 7, 2024 | GPUIntrusion Detection | CodeCode Available | 1 |
| Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation | Aug 7, 2024 | GPUQuantization | CodeCode Available | 1 |
| Image-to-LaTeX Converter for Mathematical Formulas and Text | Aug 7, 2024 | DecoderGPU | CodeCode Available | 1 |
| L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Aug 6, 2024 | GPUQuantization | —Unverified | 0 |
| A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search | Aug 6, 2024 | BlockingGPU | —Unverified | 0 |
| SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving | Aug 5, 2024 | GPU | —Unverified | 0 |
| VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking | Aug 5, 2024 | 3D Single Object TrackingGPU | —Unverified | 0 |
| RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders | Aug 5, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aug 4, 2024 | GPUImage Generation | —Unverified | 0 |
| Deep Patch Visual SLAM | Aug 3, 2024 | GPUVisual Odometry | CodeCode Available | 4 |
| GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS | Aug 2, 2024 | GPUNavigate | CodeCode Available | 4 |
| FT K-means: A High-Performance K-means on GPU with Fault Tolerance | Aug 2, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines | Aug 2, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making | Aug 2, 2024 | Cloud ComputingDecision Making | CodeCode Available | 1 |
| Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research | Aug 1, 2024 | CPUGPU | —Unverified | 0 |
| Data-Driven Traffic Simulation for an Intersection in a Metropolis | Aug 1, 2024 | GPUTrajectory Forecasting | —Unverified | 0 |
| Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative Driving | Aug 1, 2024 | Conformal PredictionData Integration | CodeCode Available | 1 |
| Towards Scalable GPU-Accelerated SNN Training via Temporal Fusion | Aug 1, 2024 | GPUNavigate | CodeCode Available | 0 |
| Finch: Prompt-guided Key-Value Cache Compression | Jul 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| GPU-based data processing for speeding-up correlation plenoptic imaging | Jul 30, 2024 | GPU | —Unverified | 0 |
| CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Jul 30, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Toward Efficient Permutation for Hierarchical N:M Sparsity on GPUs | Jul 30, 2024 | GPU | —Unverified | 0 |
| Palu: Compressing KV-Cache with Low-Rank Projection | Jul 30, 2024 | GPUQuantization | CodeCode Available | 2 |
| NeuroSEM: A hybrid framework for simulating multiphysics problems by coupling PINNs and spectral elements | Jul 30, 2024 | CPUGPU | CodeCode Available | 0 |
| ThinK: Thinner Key Cache by Query-Driven Pruning | Jul 30, 2024 | GPUQuantization | —Unverified | 0 |
| OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance | Jul 30, 2024 | GPU | CodeCode Available | 1 |
| Pruning Large Language Models with Semi-Structural Adaptive Sparse Training | Jul 30, 2024 | GPUKnowledge Distillation | CodeCode Available | 1 |
| Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation | Jul 29, 2024 | GPUtext-classification | —Unverified | 0 |
| SAPG: Split and Aggregate Policy Gradients | Jul 29, 2024 | Decision MakingGPU | —Unverified | 0 |
| ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development | Jul 29, 2024 | GPU | —Unverified | 0 |
| Practical Video Object Detection via Feature Selection and Aggregation | Jul 29, 2024 | feature selectionGPU | CodeCode Available | 3 |
| Simply Trainable Nearest Neighbour Machine Translation with GPU Inference | Jul 29, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| Mini-batch Coresets for Memory-efficient Training of Large Language Models | Jul 28, 2024 | GPUNetwork Pruning | —Unverified | 0 |
| WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics | Jul 27, 2024 | GPU | —Unverified | 0 |
| NARVis: Neural Accelerated Rendering for Real-Time Scientific Point Cloud Visualization | Jul 26, 2024 | GPU | —Unverified | 0 |
| Textile Anomaly Detection: Evaluation of the State-of-the-Art for Automated Quality Inspection of Carpet | Jul 26, 2024 | Anomaly DetectionCPU | —Unverified | 0 |
| HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors | Jul 26, 2024 | Depth EstimationGPU | CodeCode Available | 2 |
| HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline | Jul 25, 2024 | GPU | —Unverified | 0 |
| Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption | Jul 25, 2024 | GPU | CodeCode Available | 0 |
| SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention | Jul 23, 2024 | Code GenerationGPU | —Unverified | 0 |
| ESOD: Efficient Small Object Detection on High-Resolution Images | Jul 23, 2024 | GPUObject | CodeCode Available | 2 |