| MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors | May 2, 2024 | 3D Object Captioning3D Object Classification | CodeCode Available | 2 |
| Streamlining Image Editing with Layered Diffusion Brushes | May 1, 2024 | AttributeDenoising | —Unverified | 0 |
| Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge | May 1, 2024 | GPU | CodeCode Available | 0 |
| A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges | May 1, 2024 | GPU | —Unverified | 0 |
| Extending Llama-3's Context Ten-Fold Overnight | Apr 30, 2024 | 8kGPU | —Unverified | 0 |
| Bypassing Skip-Gram Negative Sampling: Dimension Regularization as a More Efficient Alternative for Graph Embeddings | Apr 30, 2024 | GPUGraph Embedding | —Unverified | 0 |
| MicroDreamer: Efficient 3D Generation in 20 Seconds by Score-based Iterative Reconstruction | Apr 30, 2024 | 3D Generation3D Reconstruction | CodeCode Available | 2 |
| GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting | Apr 30, 2024 | 3D GenerationGPU | —Unverified | 0 |
| LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report | Apr 29, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis | Apr 29, 2024 | CPUEdge-computing | CodeCode Available | 2 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 |
| CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Apr 29, 2024 | Data VisualizationDecision Making | CodeCode Available | 1 |
| Mamba-FETrack: Frame-Event Tracking via State Space Model | Apr 28, 2024 | GPUMamba | CodeCode Available | 4 |
| Deep Learning for Low-Latency, Quantum-Ready RF Sensing | Apr 27, 2024 | CPUDeep Learning | —Unverified | 0 |
| Child Speech Recognition in Human-Robot Interaction: Problem Solved? | Apr 26, 2024 | GPUspeech-recognition | —Unverified | 0 |
| Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection | Apr 26, 2024 | Classify murmursGPU | —Unverified | 0 |
| Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services | Apr 25, 2024 | GPU | CodeCode Available | 3 |
| NeRF-XL: Scaling NeRFs with Multiple GPUs | Apr 24, 2024 | GPUNeRF | —Unverified | 0 |
| GPU-RANC: A CUDA Accelerated Simulation Framework for Neuromorphic Architectures | Apr 24, 2024 | GPU | —Unverified | 0 |
| CORM: Cache Optimization with Recent Message for Large Language Model Inference | Apr 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| BASS: Batched Attention-optimized Speculative Sampling | Apr 24, 2024 | GPUHumanEval | —Unverified | 0 |
| CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Apr 23, 2024 | DenoisingGPU | CodeCode Available | 1 |
| CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture | Apr 22, 2024 | GPUQuantization | —Unverified | 0 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous Environments | Apr 22, 2024 | CPUFederated Learning | —Unverified | 0 |