| UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture | Jun 20, 2024 | CPUGPU | —Unverified | 0 |
| Sparse High Rank Adapters | Jun 19, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated DCOPF using Gradient-Based Optimization | Jun 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Under the Hood of Tabular Data Generation Models: Benchmarks with Extensive Tuning | Jun 18, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| Contraction rates for conjugate gradient and Lanczos approximate posteriors in Gaussian process regression | Jun 18, 2024 | GPU | —Unverified | 0 |
| MCSD: An Efficient Language Model with Diverse Fusion | Jun 18, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network | Jun 17, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Jun 17, 2024 | CPUGPU | —Unverified | 0 |
| What Operations can be Performed Directly on Compressed Arrays, and with What Error? | Jun 17, 2024 | GPU | —Unverified | 0 |
| VideoLLM-online: Online Video Large Language Model for Streaming Video | Jun 17, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead | Jun 17, 2024 | GPUModel Compression | —Unverified | 0 |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Jun 16, 2024 | Automatic Speech RecognitionGPU | CodeCode Available | 0 |
| CancerLLM: A Large Language Model in Cancer Domain | Jun 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient | Jun 15, 2024 | GPUNetwork Pruning | —Unverified | 0 |
| A Training-free Sub-quadratic Cost Transformer Model Serving Framework With Hierarchically Pruned Attention | Jun 14, 2024 | GPUQuestion Answering | —Unverified | 0 |
| Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential Heuristics | Jun 14, 2024 | Combinatorial OptimizationCPU | CodeCode Available | 0 |
| Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectors | Jun 14, 2024 | CPUGPU | CodeCode Available | 0 |
| PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation | Jun 14, 2024 | CPUGPU | —Unverified | 0 |
| Modeling Ambient Scene Dynamics for Free-view Synthesis | Jun 13, 2024 | 3DGSGPU | —Unverified | 0 |
| Cognitively Inspired Energy-Based World Models | Jun 13, 2024 | GPU | —Unverified | 0 |
| Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation | Jun 13, 2024 | GPUImage Generation | —Unverified | 0 |
| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models | Jun 13, 2024 | Code Generationdomain classification | —Unverified | 0 |
| WonderWorld: Interactive 3D Scene Generation from a Single Image | Jun 13, 2024 | Depth EstimationGPU | —Unverified | 0 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 |
| ProTrain: Efficient LLM Training via Memory-Aware Techniques | Jun 12, 2024 | CPUGPU | —Unverified | 0 |
| ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models | Jun 12, 2024 | GPU | —Unverified | 0 |
| GraphFM: A Comprehensive Benchmark for Graph Foundation Model | Jun 12, 2024 | GPUGraph Neural Network | CodeCode Available | 0 |
| VoxNeuS: Enhancing Voxel-Based Neural Surface Reconstruction via Gradient Interpolation | Jun 11, 2024 | GPUSurface Reconstruction | —Unverified | 0 |
| PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models | Jun 11, 2024 | CPUGPU | —Unverified | 0 |
| Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images | Jun 11, 2024 | BenchmarkingGPU | —Unverified | 0 |
| Sustainable self-supervised learning for speech representations | Jun 11, 2024 | GPUSelf-Supervised Learning | —Unverified | 0 |
| Label-Looping: Highly Efficient Decoding for Transducers | Jun 10, 2024 | GPUspeech-recognition | —Unverified | 0 |
| Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization | Jun 7, 2024 | Anomaly DetectionGPU | —Unverified | 0 |
| Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU | Jun 6, 2024 | GPUspeech-recognition | —Unverified | 0 |
| ReDistill: Residual Encoded Distillation for Peak Memory Reduction | Jun 6, 2024 | DenoisingGPU | —Unverified | 0 |
| Quality-Diversity with Limited Resources | Jun 6, 2024 | DiversityGPU | CodeCode Available | 0 |
| Global Parameterization-based Texture Space Optimization | Jun 6, 2024 | GPU | —Unverified | 0 |
| Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity | Jun 5, 2024 | GPUQuantization | —Unverified | 0 |
| Searching Priors Makes Text-to-Video Synthesis Better | Jun 5, 2024 | GPU | —Unverified | 0 |
| A Flexible Recursive Network for Video Stereo Matching Based on Residual Estimation | Jun 5, 2024 | GPUStereo Matching | CodeCode Available | 0 |
| A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection | Jun 5, 2024 | Anomaly DetectionBenchmarking | —Unverified | 0 |
| A Study of Optimizations for Fine-tuning Large Language Models | Jun 4, 2024 | GPU | —Unverified | 0 |
| Speeding up Policy Simulation in Supply Chain RL | Jun 4, 2024 | GPU | —Unverified | 0 |
| CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework | Jun 3, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| GPU-Accelerated Rule Evaluation and Evolution | Jun 3, 2024 | Explainable artificial intelligenceGPU | —Unverified | 0 |
| D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models | Jun 3, 2024 | GPUMath | —Unverified | 0 |
| OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models | Jun 3, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank | Jun 1, 2024 | GPUimage-classification | —Unverified | 0 |
| Multi-Objective Neural Architecture Search by Learning Search Space Partitions | Jun 1, 2024 | Bayesian OptimizationGPU | —Unverified | 0 |