| DeepSeek-V3 Technical Report | Dec 27, 2024 | GPULanguage Modeling | CodeCode Available | 16 |
| MBQ: Modality-Balanced Quantization for Large Vision-Language Models | Dec 27, 2024 | GPUQuantization | CodeCode Available | 2 |
| Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference | Dec 25, 2024 | CPUGPU | —Unverified | 0 |
| GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network | Dec 24, 2024 | GPUgraph construction | CodeCode Available | 1 |
| KunServe: Efficient Parameter-centric Memory Management for LLM Serving | Dec 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Dec 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition | Dec 23, 2024 | GPUMotion Synthesis | —Unverified | 0 |
| Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Dec 23, 2024 | ArabicMMLUDialect Identification | CodeCode Available | 1 |
| Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling | Dec 23, 2024 | 3DGSGPU | —Unverified | 0 |
| Power- and Fragmentation-aware Online Scheduling for GPU Datacenters | Dec 23, 2024 | CPUGPU | CodeCode Available | 0 |
| CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Dec 23, 2024 | 3DGSGPU | —Unverified | 0 |
| Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality | Dec 21, 2024 | GPU | CodeCode Available | 1 |
| Lillama: Large Language Models Compression via Low-Rank Feature Distillation | Dec 21, 2024 | GPUMamba | —Unverified | 0 |
| Less is More: Towards Green Code Large Language Models via Unified Structural Pruning | Dec 20, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up | Dec 20, 2024 | 8kGPU | CodeCode Available | 3 |
| WebLLM: A High-Performance In-Browser LLM Inference Engine | Dec 20, 2024 | CPUGPU | CodeCode Available | 11 |
| MUSTER: Longitudinal Deformable Registration by Composition of Consecutive Deformations | Dec 19, 2024 | GPUImage Registration | CodeCode Available | 0 |
| Taming the Memory Beast: Strategies for Reliable ML Training on Kubernetes | Dec 19, 2024 | GPUManagement | —Unverified | 0 |
| IDOL: Instant Photorealistic 3D Human Creation from a Single Image | Dec 19, 2024 | GPU | —Unverified | 0 |
| HashAttention: Semantic Sparsity for Faster Inference | Dec 19, 2024 | GPUSemantic Similarity | —Unverified | 0 |
| DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation | Dec 19, 2024 | 3D GenerationDenoising | —Unverified | 0 |
| SqueezeMe: Efficient Gaussian Avatars for VR | Dec 19, 2024 | DecoderGPU | —Unverified | 0 |
| Channel Merging: Preserving Specialization for Merged Experts | Dec 18, 2024 | Code GenerationGPU | —Unverified | 0 |
| Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection | Dec 18, 2024 | CPUGPU | —Unverified | 0 |
| SocialED: A Python Library for Social Event Detection | Dec 18, 2024 | CPUEvent Detection | CodeCode Available | 4 |
| Language verY Rare for All | Dec 18, 2024 | AllDecoder | —Unverified | 0 |
| Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box Settings | Dec 18, 2024 | GPU | CodeCode Available | 1 |
| ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting | Dec 17, 2024 | GPUWeather Forecasting | CodeCode Available | 2 |
| What is YOLOv6? A Deep Insight into the Object Detection Model | Dec 17, 2024 | GPUobject-detection | —Unverified | 0 |
| Echo: Simulating Distributed Training At Scale | Dec 17, 2024 | GPU | —Unverified | 0 |
| Three Things to Know about Deep Metric Learning | Dec 17, 2024 | GPUImage Retrieval | —Unverified | 0 |
| Exploring AI-Enabled Cybersecurity Frameworks: Deep-Learning Techniques, GPU Support, and Future Enhancements | Dec 17, 2024 | Deep LearningGPU | —Unverified | 0 |
| DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference | Dec 16, 2024 | CPUGPU | CodeCode Available | 0 |
| Accelerating Sparse Graph Neural Networks with Tensor Core Optimization | Dec 16, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation | Dec 16, 2024 | GPUInformation Retrieval | —Unverified | 0 |
| What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study | Dec 16, 2024 | GPU | —Unverified | 0 |
| Formulations and scalability of neural network surrogates in nonlinear optimization problems | Dec 16, 2024 | GPU | —Unverified | 0 |
| Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning | Dec 16, 2024 | GPULarge Language Model | —Unverified | 0 |
| GS-ProCams: Gaussian Splatting-based Projector-Camera Systems | Dec 16, 2024 | GPUNeRF | —Unverified | 0 |
| Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation | Dec 16, 2024 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Dec 16, 2024 | 3D Reconstruction4k | CodeCode Available | 3 |
| Dynamic Graph Attention Networks for Travel Time Distribution Prediction in Urban Arterial Roads | Dec 15, 2024 | counterfactualGPU | —Unverified | 0 |
| NITRO: LLM Inference on Intel Laptop NPUs | Dec 15, 2024 | CPUGPU | CodeCode Available | 1 |
| Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation | Dec 15, 2024 | GPUMamba | CodeCode Available | 1 |
| Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma | Dec 14, 2024 | GPULicense Plate Recognition | —Unverified | 0 |
| KVDirect: Distributed Disaggregated LLM Inference | Dec 13, 2024 | GPUScheduling | —Unverified | 0 |
| HashEvict: A Pre-Attention KV Cache Eviction Strategy using Locality-Sensitive Hashing | Dec 13, 2024 | GPUMultiple-choice | —Unverified | 0 |
| Real-time Identity Defenses against Malicious Personalization of Diffusion Models | Dec 13, 2024 | CPUGPU | CodeCode Available | 1 |
| SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians | Dec 13, 2024 | GPUObject Localization | —Unverified | 0 |
| Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories | Dec 13, 2024 | GPU | —Unverified | 0 |