| UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture | Jun 20, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated DCOPF using Gradient-Based Optimization | Jun 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Sparse High Rank Adapters | Jun 19, 2024 | CPUGPU | —Unverified | 0 |
| MCSD: An Efficient Language Model with Diverse Fusion | Jun 18, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Contraction rates for conjugate gradient and Lanczos approximate posteriors in Gaussian process regression | Jun 18, 2024 | GPU | —Unverified | 0 |
| Under the Hood of Tabular Data Generation Models: Benchmarks with Extensive Tuning | Jun 18, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| What Operations can be Performed Directly on Compressed Arrays, and with What Error? | Jun 17, 2024 | GPU | —Unverified | 0 |
| Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Jun 17, 2024 | CPUGPU | —Unverified | 0 |
| Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network | Jun 17, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| VideoLLM-online: Online Video Large Language Model for Streaming Video | Jun 17, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead | Jun 17, 2024 | GPUModel Compression | —Unverified | 0 |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Jun 16, 2024 | Automatic Speech RecognitionGPU | CodeCode Available | 0 |
| CancerLLM: A Large Language Model in Cancer Domain | Jun 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient | Jun 15, 2024 | GPUNetwork Pruning | —Unverified | 0 |
| A Training-free Sub-quadratic Cost Transformer Model Serving Framework With Hierarchically Pruned Attention | Jun 14, 2024 | GPUQuestion Answering | —Unverified | 0 |
| PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation | Jun 14, 2024 | CPUGPU | —Unverified | 0 |
| Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential Heuristics | Jun 14, 2024 | Combinatorial OptimizationCPU | CodeCode Available | 0 |
| Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectors | Jun 14, 2024 | CPUGPU | CodeCode Available | 0 |
| Cognitively Inspired Energy-Based World Models | Jun 13, 2024 | GPU | —Unverified | 0 |
| ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models | Jun 13, 2024 | Code Generationdomain classification | —Unverified | 0 |
| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| WonderWorld: Interactive 3D Scene Generation from a Single Image | Jun 13, 2024 | Depth EstimationGPU | —Unverified | 0 |
| Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation | Jun 13, 2024 | GPUImage Generation | —Unverified | 0 |
| Modeling Ambient Scene Dynamics for Free-view Synthesis | Jun 13, 2024 | 3DGSGPU | —Unverified | 0 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 |