| Token-wise Influential Training Data Retrieval for Large Language Models | May 20, 2024 | CPUGPU | CodeCode Available | 1 |
| SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model | May 20, 2024 | Audio ClassificationGPU | CodeCode Available | 2 |
| Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging | May 19, 2024 | GPU | CodeCode Available | 1 |
| Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries | May 19, 2024 | 6D Pose EstimationGPU | CodeCode Available | 2 |
| MAMCA -- Optimal on Accuracy and Efficiency for Automatic Modulation Classification with Extended Signal Length | May 18, 2024 | DenoisingGPU | CodeCode Available | 2 |
| ENOVA: Autoscaling towards Cost-effective and Stable Serverless LLM Serving | May 17, 2024 | DiversityGPU | —Unverified | 0 |
| Specialising and Analysing Instruction-Tuned and Byte-Level Language Models for Organic Reaction Prediction | May 17, 2024 | Chemical Reaction PredictionDecoder | —Unverified | 0 |
| IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining | May 16, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models | May 16, 2024 | GPULanguage Modelling | CodeCode Available | 1 |
| Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model | May 15, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface Reconstruction | May 14, 2024 | GPUSurface Reconstruction | CodeCode Available | 1 |
| Computation-Aware Kalman Filtering and Smoothing | May 14, 2024 | GPU | CodeCode Available | 1 |
| Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis | May 14, 2024 | 4kGPU | —Unverified | 0 |
| Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach | May 14, 2024 | GPUreinforcement-learning | —Unverified | 0 |
| No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding | May 14, 2024 | Action DetectionGPU | CodeCode Available | 1 |
| Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | May 13, 2024 | GPUTexture Synthesis | —Unverified | 0 |
| Do Bayesian imaging methods report trustworthy probabilities? | May 13, 2024 | DenoisingGPU | —Unverified | 0 |
| Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation | May 13, 2024 | GPU | —Unverified | 0 |
| NGD-SLAM: Towards Real-Time Dynamic SLAM without GPU | May 12, 2024 | CPUDeep Learning | CodeCode Available | 3 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 |
| Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos | May 12, 2024 | AllGPU | —Unverified | 0 |
| Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks | May 11, 2024 | DenoisingGPU | —Unverified | 0 |
| SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models | May 10, 2024 | GPUQuantization | —Unverified | 0 |
| Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | May 10, 2024 | GPUNeRF | —Unverified | 0 |
| Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | May 10, 2024 | Autonomous DrivingGPU | —Unverified | 0 |