| Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Jun 25, 2024 | GPUimage-classification | CodeCode Available | 1 |
| Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving | Jun 24, 2024 | CPUGPU | CodeCode Available | 7 |
| GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism | Jun 24, 2024 | GPU | —Unverified | 0 |
| Video-Infinity: Distributed Long Video Generation | Jun 24, 2024 | GPUVideo Generation | —Unverified | 0 |
| MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network | Jun 24, 2024 | GPU | CodeCode Available | 0 |
| Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGA | Jun 23, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| LaneSegNet Design Study | Jun 22, 2024 | Autonomous VehiclesDecoder | —Unverified | 0 |
| MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression | Jun 21, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation | Jun 21, 2024 | 3D GenerationGPU | CodeCode Available | 2 |
| Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Jun 20, 2024 | Autonomous DrivingCPU | CodeCode Available | 1 |
| ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning | Jun 20, 2024 | GPUVideo Generation | CodeCode Available | 0 |
| Consistency Models Made Easy | Jun 20, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 |
| UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture | Jun 20, 2024 | CPUGPU | —Unverified | 0 |
| CE-SSL: Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases Detection | Jun 20, 2024 | Computational EfficiencyElectrocardiography (ECG) | CodeCode Available | 1 |
| GPU-Accelerated DCOPF using Gradient-Based Optimization | Jun 19, 2024 | CPUGPU | CodeCode Available | 0 |
| VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models | Jun 19, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Sparse High Rank Adapters | Jun 19, 2024 | CPUGPU | —Unverified | 0 |
| Under the Hood of Tabular Data Generation Models: Benchmarks with Extensive Tuning | Jun 18, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation | Jun 18, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 |
| MCSD: An Efficient Language Model with Diverse Fusion | Jun 18, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Contraction rates for conjugate gradient and Lanczos approximate posteriors in Gaussian process regression | Jun 18, 2024 | GPU | —Unverified | 0 |
| Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead | Jun 17, 2024 | GPUModel Compression | —Unverified | 0 |
| Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network | Jun 17, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Jun 17, 2024 | CPUGPU | —Unverified | 0 |
| Duoduo CLIP: Efficient 3D Understanding with Multi-View Images | Jun 17, 2024 | GPUObject | CodeCode Available | 2 |
| VideoLLM-online: Online Video Large Language Model for Streaming Video | Jun 17, 2024 | GPULanguage Modeling | —Unverified | 0 |
| What Operations can be Performed Directly on Compressed Arrays, and with What Error? | Jun 17, 2024 | GPU | —Unverified | 0 |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Jun 16, 2024 | Automatic Speech RecognitionGPU | CodeCode Available | 0 |
| CancerLLM: A Large Language Model in Cancer Domain | Jun 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient | Jun 15, 2024 | GPUNetwork Pruning | —Unverified | 0 |
| IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization | Jun 15, 2024 | GPUImage Manipulation | CodeCode Available | 3 |
| A GPU-accelerated Large-scale Simulator for Transportation System Optimization Benchmarking | Jun 15, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion | Jun 14, 2024 | 3D GenerationGPU | CodeCode Available | 2 |
| Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential Heuristics | Jun 14, 2024 | Combinatorial OptimizationCPU | CodeCode Available | 0 |
| PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation | Jun 14, 2024 | CPUGPU | —Unverified | 0 |
| A Training-free Sub-quadratic Cost Transformer Model Serving Framework With Hierarchically Pruned Attention | Jun 14, 2024 | GPUQuestion Answering | —Unverified | 0 |
| Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectors | Jun 14, 2024 | CPUGPU | CodeCode Available | 0 |
| Coralai: Intrinsic Evolution of Embodied Neural Cellular Automata Ecosystems | Jun 14, 2024 | DiversityGPU | CodeCode Available | 1 |
| Cognitively Inspired Energy-Based World Models | Jun 13, 2024 | GPU | —Unverified | 0 |
| Optimal Kernel Orchestration for Tensor Programs with Korch | Jun 13, 2024 | DiversityGPU | CodeCode Available | 1 |
| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Modeling Ambient Scene Dynamics for Free-view Synthesis | Jun 13, 2024 | 3DGSGPU | —Unverified | 0 |
| AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring | Jun 13, 2024 | DeblurringDecoder | CodeCode Available | 3 |
| Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation | Jun 13, 2024 | GPUImage Generation | —Unverified | 0 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 |
| Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs | Jun 13, 2024 | BenchmarkingGPU | CodeCode Available | 2 |
| WonderWorld: Interactive 3D Scene Generation from a Single Image | Jun 13, 2024 | Depth EstimationGPU | —Unverified | 0 |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Jun 13, 2024 | DenoisingGPU | CodeCode Available | 1 |
| ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models | Jun 13, 2024 | Code Generationdomain classification | —Unverified | 0 |
| ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models | Jun 12, 2024 | GPU | —Unverified | 0 |