| Prediction of Effective Elastic Moduli of Rocks using Graph Neural Networks | Oct 30, 2023 | GPU | CodeCode Available | 1 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 |
| SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models | Oct 29, 2023 | GPUMixture-of-Experts | CodeCode Available | 1 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 |
| RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs | Oct 25, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Metrically Scaled Monocular Depth Estimation through Sparse Priors for Underwater Robots | Oct 25, 2023 | CPUDecoder | CodeCode Available | 1 |
| LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery | Oct 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages | Oct 20, 2023 | DiversityGPU | CodeCode Available | 1 |
| CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation | Oct 19, 2023 | 2kGPU | CodeCode Available | 1 |
| MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient | Oct 17, 2023 | 3D Object DetectionGPU | CodeCode Available | 1 |
| DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations | Oct 17, 2023 | BenchmarkingEmotion Recognition | CodeCode Available | 1 |
| TRANSOM: An Efficient Fault-Tolerant System for Training LLMs | Oct 16, 2023 | Anomaly DetectionGPU | CodeCode Available | 1 |
| ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion | Oct 16, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models | Oct 15, 2023 | CPUGPU | CodeCode Available | 1 |
| G10: Enabling An Efficient Unified GPU Memory and Storage Architecture with Smart Tensor Migrations | Oct 13, 2023 | Deep LearningGPU | CodeCode Available | 1 |
| QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models | Oct 13, 2023 | Computational EfficiencyGPU | CodeCode Available | 1 |
| QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models | Oct 12, 2023 | GPUQuantization | CodeCode Available | 1 |
| No Privacy Left Outside: On the (In-)Security of TEE-Shielded DNN Partition for On-Device ML | Oct 11, 2023 | GPUInference Attack | CodeCode Available | 1 |
| Sparse Fine-tuning for Inference Acceleration of Large Language Models | Oct 10, 2023 | CPUGPU | CodeCode Available | 1 |
| Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks | Oct 8, 2023 | BinarizationCPU | CodeCode Available | 1 |
| GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models | Oct 8, 2023 | GPUReinforcement Learning (RL) | CodeCode Available | 1 |
| Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robots | Oct 7, 2023 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs | Oct 3, 2023 | GPU | CodeCode Available | 1 |
| Label Supervised LLaMA Finetuning | Oct 2, 2023 | GPUnamed-entity-recognition | CodeCode Available | 1 |
| Training a Large Video Model on a Single Machine in a Day | Sep 28, 2023 | Action RecognitionCPU | CodeCode Available | 1 |