| MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors | May 2, 2024 | 3D Object Captioning3D Object Classification | CodeCode Available | 2 |
| Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge | May 1, 2024 | GPU | CodeCode Available | 0 |
| A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges | May 1, 2024 | GPU | —Unverified | 0 |
| Streamlining Image Editing with Layered Diffusion Brushes | May 1, 2024 | AttributeDenoising | —Unverified | 0 |
| Extending Llama-3's Context Ten-Fold Overnight | Apr 30, 2024 | 8kGPU | —Unverified | 0 |
| Bypassing Skip-Gram Negative Sampling: Dimension Regularization as a More Efficient Alternative for Graph Embeddings | Apr 30, 2024 | GPUGraph Embedding | —Unverified | 0 |
| GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting | Apr 30, 2024 | 3D GenerationGPU | —Unverified | 0 |
| MicroDreamer: Efficient 3D Generation in 20 Seconds by Score-based Iterative Reconstruction | Apr 30, 2024 | 3D Generation3D Reconstruction | CodeCode Available | 2 |
| LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report | Apr 29, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis | Apr 29, 2024 | CPUEdge-computing | CodeCode Available | 2 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 |
| CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Apr 29, 2024 | Data VisualizationDecision Making | CodeCode Available | 1 |
| Mamba-FETrack: Frame-Event Tracking via State Space Model | Apr 28, 2024 | GPUMamba | CodeCode Available | 4 |
| Deep Learning for Low-Latency, Quantum-Ready RF Sensing | Apr 27, 2024 | CPUDeep Learning | —Unverified | 0 |
| Child Speech Recognition in Human-Robot Interaction: Problem Solved? | Apr 26, 2024 | GPUspeech-recognition | —Unverified | 0 |
| Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection | Apr 26, 2024 | Classify murmursGPU | —Unverified | 0 |
| Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services | Apr 25, 2024 | GPU | CodeCode Available | 3 |
| NeRF-XL: Scaling NeRFs with Multiple GPUs | Apr 24, 2024 | GPUNeRF | —Unverified | 0 |
| BASS: Batched Attention-optimized Speculative Sampling | Apr 24, 2024 | GPUHumanEval | —Unverified | 0 |
| CORM: Cache Optimization with Recent Message for Large Language Model Inference | Apr 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| GPU-RANC: A CUDA Accelerated Simulation Framework for Neuromorphic Architectures | Apr 24, 2024 | GPU | —Unverified | 0 |
| CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Apr 23, 2024 | DenoisingGPU | CodeCode Available | 1 |
| CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture | Apr 22, 2024 | GPUQuantization | —Unverified | 0 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| SnapKV: LLM Knows What You are Looking for Before Generation | Apr 22, 2024 | 16kGPU | CodeCode Available | 3 |
| Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous Environments | Apr 22, 2024 | CPUFederated Learning | —Unverified | 0 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| STROOBnet Optimization via GPU-Accelerated Proximal Recurrence Strategies | Apr 22, 2024 | GPU | —Unverified | 0 |
| GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Apr 22, 2024 | GPUMotion Generation | —Unverified | 0 |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Apr 22, 2024 | DenoisingGPU | —Unverified | 0 |
| Turbo-CF: Matrix Decomposition-Free Graph Filtering for Fast Recommendation | Apr 22, 2024 | Collaborative FilteringGPU | CodeCode Available | 0 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| On-board classification of underwater images using hybrid classical-quantum CNN based method | Apr 19, 2024 | Autonomous VehiclesGPU | —Unverified | 0 |
| Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms | Apr 19, 2024 | GPU | —Unverified | 0 |
| Scalable Data Assimilation with Message Passing | Apr 19, 2024 | Bayesian InferenceGPU | CodeCode Available | 0 |
| RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation | Apr 18, 2024 | GPURAG | —Unverified | 0 |
| Warped Time Series Anomaly Detection | Apr 18, 2024 | Anomaly DetectionDynamic Time Warping | —Unverified | 0 |
| Partial Large Kernel CNNs for Efficient Super-Resolution | Apr 18, 2024 | Computational EfficiencyGPU | CodeCode Available | 2 |
| TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding | Apr 18, 2024 | GPU | CodeCode Available | 3 |
| FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPU | Apr 17, 2024 | Face RecognitionGPU | CodeCode Available | 0 |
| LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs | Apr 16, 2024 | DecoderGPU | CodeCode Available | 1 |
| Shears: Unstructured Sparsity with Neural Low-rank Adapter Search | Apr 16, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| SparseDM: Toward Sparse Efficient Diffusion Models | Apr 16, 2024 | GPUVideo Generation | —Unverified | 0 |
| Interpolating neural network: A novel unification of machine learning and interpolation theory | Apr 16, 2024 | GPUPhysical Simulations | CodeCode Available | 1 |
| Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation | Apr 16, 2024 | GPUSegmentation | —Unverified | 0 |
| Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units | Apr 16, 2024 | GPU | —Unverified | 0 |
| Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models | Apr 15, 2024 | GPUParameter Prediction | —Unverified | 0 |
| Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Apr 15, 2024 | Computational EfficiencyGPU | CodeCode Available | 0 |
| LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism | Apr 15, 2024 | GPU | CodeCode Available | 2 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |