| FreeRide: Harvesting Bubbles in Pipeline Parallelism | Sep 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself | Sep 10, 2024 | GPU | —Unverified | 0 |
| GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction | Sep 10, 2024 | 3DGSGPU | —Unverified | 0 |
| Enhancing Sequential Recommendations through Multi-Perspective Reflections and Iteration | Sep 10, 2024 | Collaborative FilteringGPU | —Unverified | 0 |
| CoDiCast: Conditional Diffusion Model for Global Weather Prediction with Uncertainty Quantification | Sep 9, 2024 | Computational EfficiencyDenoising | CodeCode Available | 0 |
| Scalable Multitask Learning Using Gradient-based Estimation of Task Affinity | Sep 9, 2024 | GPUMulti-Label Classification | CodeCode Available | 0 |
| Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal Discovery | Sep 9, 2024 | Causal DiscoveryGPU | —Unverified | 0 |
| Resource-Efficient Generative AI Model Deployment in Mobile Edge Networks | Sep 9, 2024 | GPU | —Unverified | 0 |
| TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency | Sep 9, 2024 | FairnessFederated Learning | —Unverified | 0 |
| ELMS: Elasticized Large Language Models On Mobile Devices | Sep 8, 2024 | GPULanguage Modelling | —Unverified | 0 |
| InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference | Sep 8, 2024 | Edge-computingGPU | —Unverified | 0 |
| From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems | Sep 8, 2024 | Audio TaggingEvent Detection | —Unverified | 0 |
| MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos | Sep 6, 2024 | GPURepetitive Action Counting | —Unverified | 0 |
| Confidential Computing on NVIDIA Hopper GPUs: A Performance Benchmark Study | Sep 6, 2024 | CPUGPU | —Unverified | 0 |
| mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding | Sep 5, 2024 | document understandingGPU | —Unverified | 0 |
| LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones | Sep 5, 2024 | CPUGPU | CodeCode Available | 1 |
| Hardware Acceleration of LLMs: A comprehensive survey and comparison | Sep 5, 2024 | GPUSurvey | —Unverified | 0 |
| LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Sep 5, 2024 | GPUImage Super-Resolution | CodeCode Available | 1 |
| Differentiable Discrete Event Simulation for Queuing Network Control | Sep 5, 2024 | GPUReinforcement Learning (RL) | —Unverified | 0 |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned Models | Sep 4, 2024 | GPUHallucination | CodeCode Available | 0 |
| AdvSecureNet: A Python Toolkit for Adversarial Machine Learning | Sep 4, 2024 | GPU | CodeCode Available | 0 |
| Accelerating Large Language Model Training with Hybrid GPU-based Compression | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture | Sep 4, 2024 | GPUMamba | CodeCode Available | 3 |
| LinFusion: 1 GPU, 1 Minute, 16K Image | Sep 3, 2024 | 16kCausal Inference | CodeCode Available | 3 |