| PointMamba: A Simple State Space Model for Point Cloud Analysis | Feb 16, 2024 | GPUMamba | CodeCode Available | 4 |
| Evaluating Neural Radiance Fields (NeRFs) for 3D Plant Geometry Reconstruction in Field Conditions | Feb 15, 2024 | 3D ReconstructionGPU | —Unverified | 0 |
| ME-ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers | Feb 15, 2024 | GPU | —Unverified | 0 |
| QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference | Feb 15, 2024 | GPUQuantization | CodeCode Available | 2 |
| Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish | Feb 15, 2024 | DecoderGPU | —Unverified | 0 |
| BitDelta: Your Fine-Tune May Only Be Worth One Bit | Feb 15, 2024 | GPU | CodeCode Available | 3 |
| Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Feb 15, 2024 | GPUReinforcement Learning (RL) | CodeCode Available | 1 |
| TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems | Feb 15, 2024 | Continual LearningGPU | —Unverified | 0 |
| Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning | Feb 14, 2024 | GPU | —Unverified | 0 |
| Listening to Multi-talker Conversations: Modular and End-to-end Perspectives | Feb 14, 2024 | GPUspeaker-diarization | —Unverified | 0 |
| MLTCP: Congestion Control for DNN Training | Feb 14, 2024 | GPU | —Unverified | 0 |
| DisGNet: A Distance Graph Neural Network for Forward Kinematics Learning of Gough-Stewart Platform | Feb 14, 2024 | GPUGraph Neural Network | CodeCode Available | 0 |
| MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech | Feb 14, 2024 | DecoderGPU | —Unverified | 0 |
| HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding | Feb 14, 2024 | GPUhypergraph embedding | —Unverified | 0 |
| Stochastic Spiking Attention: Accelerating Attention with Stochastic Computing in Spiking Networks | Feb 14, 2024 | GPU | —Unverified | 0 |
| Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection | Feb 13, 2024 | CPUGPU | —Unverified | 0 |
| Multi-Level GNN Preconditioner for Solving Large Scale Problems | Feb 13, 2024 | GPU | —Unverified | 0 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| The I/O Complexity of Attention, or How Optimal is Flash Attention? | Feb 12, 2024 | GPU | —Unverified | 0 |
| Anchor-based Large Language Models | Feb 12, 2024 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT | Feb 12, 2024 | BenchmarkingChunking | —Unverified | 0 |
| Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems | Feb 12, 2024 | GPUobject-detection | CodeCode Available | 0 |
| Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models | Feb 10, 2024 | CPUGPU | CodeCode Available | 3 |
| Cardiac ultrasound simulation for autonomous ultrasound navigation | Feb 9, 2024 | DiagnosticGPU | —Unverified | 0 |
| On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference | Feb 9, 2024 | GPULanguage Modeling | CodeCode Available | 2 |