| FreeRide: Harvesting Bubbles in Pipeline Parallelism | Sep 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself | Sep 10, 2024 | GPU | —Unverified | 0 |
| GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction | Sep 10, 2024 | 3DGSGPU | —Unverified | 0 |
| Enhancing Sequential Recommendations through Multi-Perspective Reflections and Iteration | Sep 10, 2024 | Collaborative FilteringGPU | —Unverified | 0 |
| CoDiCast: Conditional Diffusion Model for Global Weather Prediction with Uncertainty Quantification | Sep 9, 2024 | Computational EfficiencyDenoising | CodeCode Available | 0 |
| Scalable Multitask Learning Using Gradient-based Estimation of Task Affinity | Sep 9, 2024 | GPUMulti-Label Classification | CodeCode Available | 0 |
| TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency | Sep 9, 2024 | FairnessFederated Learning | —Unverified | 0 |
| Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal Discovery | Sep 9, 2024 | Causal DiscoveryGPU | —Unverified | 0 |
| Resource-Efficient Generative AI Model Deployment in Mobile Edge Networks | Sep 9, 2024 | GPU | —Unverified | 0 |
| ELMS: Elasticized Large Language Models On Mobile Devices | Sep 8, 2024 | GPULanguage Modelling | —Unverified | 0 |
| InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference | Sep 8, 2024 | Edge-computingGPU | —Unverified | 0 |
| From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems | Sep 8, 2024 | Audio TaggingEvent Detection | —Unverified | 0 |
| MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos | Sep 6, 2024 | GPURepetitive Action Counting | —Unverified | 0 |
| Confidential Computing on NVIDIA Hopper GPUs: A Performance Benchmark Study | Sep 6, 2024 | CPUGPU | —Unverified | 0 |
| mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding | Sep 5, 2024 | document understandingGPU | —Unverified | 0 |
| Hardware Acceleration of LLMs: A comprehensive survey and comparison | Sep 5, 2024 | GPUSurvey | —Unverified | 0 |
| Differentiable Discrete Event Simulation for Queuing Network Control | Sep 5, 2024 | GPUReinforcement Learning (RL) | —Unverified | 0 |
| LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Sep 5, 2024 | GPUImage Super-Resolution | CodeCode Available | 1 |
| LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones | Sep 5, 2024 | CPUGPU | CodeCode Available | 1 |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned Models | Sep 4, 2024 | GPUHallucination | CodeCode Available | 0 |
| LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture | Sep 4, 2024 | GPUMamba | CodeCode Available | 3 |
| AdvSecureNet: A Python Toolkit for Adversarial Machine Learning | Sep 4, 2024 | GPU | CodeCode Available | 0 |
| Accelerating Large Language Model Training with Hybrid GPU-based Compression | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Toward Capturing Genetic Epistasis From Multivariate Genome-Wide Association Studies Using Mixed-Precision Kernel Ridge Regression | Sep 3, 2024 | CPUGPU | —Unverified | 0 |
| LinFusion: 1 GPU, 1 Minute, 16K Image | Sep 3, 2024 | 16kCausal Inference | CodeCode Available | 3 |
| GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Sep 3, 2024 | 3DGSGPU | —Unverified | 0 |
| Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment | Sep 2, 2024 | CPUGPU | —Unverified | 0 |
| TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval | Sep 2, 2024 | GPURetrieval | CodeCode Available | 1 |
| Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation | Sep 2, 2024 | GPU | CodeCode Available | 2 |
| Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare Applications | Sep 2, 2024 | CPUFederated Learning | CodeCode Available | 2 |
| VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges | Sep 2, 2024 | GPUMVBench | —Unverified | 0 |
| OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model | Sep 2, 2024 | GPUVideo Generation | —Unverified | 0 |
| Accelerating Hybrid Agent-Based Models and Fuzzy Cognitive Maps: How to Combine Agents who Think Alike? | Sep 1, 2024 | Community DetectionGPU | —Unverified | 0 |
| LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models | Aug 31, 2024 | 8kGPU | CodeCode Available | 2 |
| ContextVLM: Zero-Shot and Few-Shot Context Understanding for Autonomous Driving using Vision Language Models | Aug 30, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers | Aug 30, 2024 | GPUImage Generation | —Unverified | 0 |
| Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer | Aug 30, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MemLong: Memory-Augmented Retrieval for Long Text Modeling | Aug 30, 2024 | 4kDecoder | CodeCode Available | 2 |
| H-SGANet: Hybrid Sparse Graph Attention Network for Deformable Medical Image Registration | Aug 29, 2024 | Deformable Medical Image RegistrationGPU | —Unverified | 0 |
| TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification | Aug 29, 2024 | CPUDiagnostic | CodeCode Available | 1 |
| 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability | Aug 28, 2024 | Arithmetic ReasoningGPU | CodeCode Available | 0 |
| Conan-embedding: General Text Embedding with More and Better Negative Samples | Aug 28, 2024 | Contrastive LearningGPU | —Unverified | 0 |
| microYOLO: Towards Single-Shot Object Detection on Microcontrollers | Aug 28, 2024 | GPUObject | —Unverified | 0 |
| InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation | Aug 28, 2024 | Cell SegmentationGPU | CodeCode Available | 3 |
| SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search | Aug 27, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated Counterfactual Regret Minimization | Aug 27, 2024 | counterfactualGPU | CodeCode Available | 1 |
| OctFusion: Octree-based Diffusion Models for 3D Shape Generation | Aug 27, 2024 | 3D Generation3D Shape Generation | CodeCode Available | 3 |
| Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification | Aug 27, 2024 | DiagnosticGPU | —Unverified | 0 |
| The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Aug 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |