| Kozax: Flexible and Scalable Genetic Programming in JAX | Feb 5, 2025 | GPU | CodeCode Available | 1 |
| Work-Efficient Parallel Non-Maximum Suppression Kernels | Feb 1, 2025 | GPUobject-detection | CodeCode Available | 1 |
| Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior | Jan 31, 2025 | GPU | CodeCode Available | 1 |
| Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models | Jan 31, 2025 | GPUQuantization | CodeCode Available | 1 |
| Return of the Encoder: Maximizing Parameter Efficiency for SLMs | Jan 27, 2025 | Computational EfficiencyCPU | CodeCode Available | 1 |
| CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Jan 14, 2025 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping | Jan 11, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action Detection | Jan 10, 2025 | Action DetectionGPU | CodeCode Available | 1 |
| LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations | Jan 5, 2025 | GPU | CodeCode Available | 1 |
| RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Jan 4, 2025 | 3D Object Detection3D Object Detection (RoI) | CodeCode Available | 1 |
| Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jan 1, 2025 | Action RecognitionAction Segmentation | CodeCode Available | 1 |
| Lightweight G-YOLOv11: Advancing Efficient Fracture Detection in Pediatric Wrist X-rays | Dec 31, 2024 | Fracture detectionGPU | CodeCode Available | 1 |
| GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network | Dec 24, 2024 | GPUgraph construction | CodeCode Available | 1 |
| Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Dec 23, 2024 | ArabicMMLUDialect Identification | CodeCode Available | 1 |
| Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality | Dec 21, 2024 | GPU | CodeCode Available | 1 |
| Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box Settings | Dec 18, 2024 | GPU | CodeCode Available | 1 |
| Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation | Dec 15, 2024 | GPUMamba | CodeCode Available | 1 |
| NITRO: LLM Inference on Intel Laptop NPUs | Dec 15, 2024 | CPUGPU | CodeCode Available | 1 |
| Real-time Identity Defenses against Malicious Personalization of Diffusion Models | Dec 13, 2024 | CPUGPU | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day | Dec 8, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| Transformers Can Navigate Mazes With Multi-Step Prediction | Dec 6, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay | Dec 5, 2024 | DecoderGPU | CodeCode Available | 1 |
| Beyond [cls]: Exploring the true potential of Masked Image Modeling representations | Dec 4, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 1 |
| VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models | Nov 29, 2024 | DeblurringGPU | CodeCode Available | 1 |