| S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs | May 30, 2024 | GPUQuantization | —Unverified | 0 |
| Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback | May 30, 2024 | GPUKnowledge Graphs | —Unverified | 0 |
| STAT: Shrinking Transformers After Training | May 29, 2024 | DecoderGPU | —Unverified | 0 |
| MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models | May 29, 2024 | DecoderGPU | —Unverified | 0 |
| Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification | May 29, 2024 | Contrastive LearningDenoising | —Unverified | 0 |
| Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters | May 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection | May 28, 2024 | GPU | —Unverified | 0 |
| Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model | May 28, 2024 | GPUMamba | —Unverified | 0 |
| Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training | May 27, 2024 | DecoderGPU | —Unverified | 0 |
| Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings | May 27, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs | May 27, 2024 | GPU | —Unverified | 0 |
| TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models | May 27, 2024 | Backdoor AttackGPU | CodeCode Available | 0 |
| CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy | May 27, 2024 | GPUSimultaneous Localization and Mapping | —Unverified | 0 |
| GPU Based Differential Evolution: New Insights and Comparative Study | May 26, 2024 | GPU | —Unverified | 0 |
| The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol | May 26, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| Apply Distributed CNN on Genomics to accelerate Transcription-Factor TAL1 Motif Prediction | May 25, 2024 | Deep LearningGPU | —Unverified | 0 |
| HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models | May 25, 2024 | GPU | —Unverified | 0 |
| A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning | May 25, 2024 | GPUregression | —Unverified | 0 |
| LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensembles | May 25, 2024 | GPU | CodeCode Available | 0 |
| Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity | May 24, 2024 | GPU | —Unverified | 0 |
| ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning | May 24, 2024 | GPURepresentation Learning | —Unverified | 0 |
| CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization | May 23, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models | May 23, 2024 | Computational EfficiencyDecoder | —Unverified | 0 |
| Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras | May 23, 2024 | 2kGPU | —Unverified | 0 |
| MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | May 23, 2024 | Action RecognitionAction Segmentation | —Unverified | 0 |
| Fast Bayesian Inference for Neutrino Non-Standard Interactions at Dark Matter Direct Detection Experiments | May 23, 2024 | Bayesian InferenceGPU | CodeCode Available | 0 |
| HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images | May 22, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization | May 22, 2024 | GPU | CodeCode Available | 0 |
| ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation | May 22, 2024 | GPU | —Unverified | 0 |
| Personalized Residuals for Concept-Driven Text-to-Image Generation | May 21, 2024 | GPUImage Generation | —Unverified | 0 |
| Parallelization of the K-Means Algorithm with Applications to Big Data Clustering | May 20, 2024 | ClusteringGPU | —Unverified | 0 |
| ENOVA: Autoscaling towards Cost-effective and Stable Serverless LLM Serving | May 17, 2024 | DiversityGPU | —Unverified | 0 |
| Specialising and Analysing Instruction-Tuned and Byte-Level Language Models for Organic Reaction Prediction | May 17, 2024 | Chemical Reaction PredictionDecoder | —Unverified | 0 |
| IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining | May 16, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis | May 14, 2024 | 4kGPU | —Unverified | 0 |
| Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach | May 14, 2024 | GPUreinforcement-learning | —Unverified | 0 |
| Do Bayesian imaging methods report trustworthy probabilities? | May 13, 2024 | DenoisingGPU | —Unverified | 0 |
| Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | May 13, 2024 | GPUTexture Synthesis | —Unverified | 0 |
| Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation | May 13, 2024 | GPU | —Unverified | 0 |
| Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos | May 12, 2024 | AllGPU | —Unverified | 0 |
| Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks | May 11, 2024 | DenoisingGPU | —Unverified | 0 |
| SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models | May 10, 2024 | GPUQuantization | —Unverified | 0 |
| Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | May 10, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | May 10, 2024 | GPUNeRF | —Unverified | 0 |
| You Only Cache Once: Decoder-Decoder Architectures for Language Models | May 8, 2024 | DecoderGPU | CodeCode Available | 0 |
| SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems | May 7, 2024 | CPUGPU | CodeCode Available | 0 |
| A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields | May 7, 2024 | GPUobject-detection | —Unverified | 0 |
| DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid | May 7, 2024 | GPUIndoor Scene Reconstruction | —Unverified | 0 |
| Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression | May 7, 2024 | GPUImage Compression | —Unverified | 0 |
| KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization | May 7, 2024 | GPULanguage Modeling | —Unverified | 0 |