| A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction | Feb 7, 2024 | AvgCPU | CodeCode Available | 1 |
| Pruner: A Speculative Exploration Mechanism to Accelerate Tensor Program Tuning | Feb 4, 2024 | GPUTransfer Learning | CodeCode Available | 1 |
| Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks | Feb 3, 2024 | GPUMolecular Property Prediction | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy | Jan 26, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction | Jan 23, 2024 | 3D Semantic Occupancy PredictionAutonomous Driving | CodeCode Available | 1 |
| immrax: A Parallelizable and Differentiable Toolbox for Interval Analysis and Mixed Monotone Reachability in JAX | Jan 21, 2024 | Computational EfficiencyGPU | CodeCode Available | 1 |
| Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices | Jan 17, 2024 | Dynamic neural networksGPU | CodeCode Available | 1 |
| Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference | Jan 16, 2024 | GPUMixture-of-Experts | CodeCode Available | 1 |
| CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital Twins | Jan 6, 2024 | Autonomous VehiclesBenchmarking | CodeCode Available | 1 |
| TinyPredNet: A Lightweight Framework for Satellite Image Sequence Prediction | Jan 1, 2024 | DecoderGPU | CodeCode Available | 1 |
| Resource-Efficient Transformer Pruning for Finetuning of Large Models | Jan 1, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 |
| City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Dec 27, 2023 | 3D Scene ReconstructionGPU | CodeCode Available | 1 |
| ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-order Optimization | Dec 23, 2023 | GPU | CodeCode Available | 1 |
| Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving | Dec 19, 2023 | Autonomous DrivingGPU | CodeCode Available | 1 |
| Enhancing predictive capabilities in fusion burning plasmas through surrogate-based optimization in core transport solvers | Dec 19, 2023 | GPUPrediction | CodeCode Available | 1 |
| Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models | Dec 19, 2023 | GPU | CodeCode Available | 1 |
| Opara: Exploiting Operator Parallelism for Expediting DNN Inference on GPUs | Dec 16, 2023 | GPUScheduling | CodeCode Available | 1 |
| Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models | Dec 15, 2023 | BenchmarkingCode Summarization | CodeCode Available | 1 |
| Data-Efficient Multimodal Fusion on a Single GPU | Dec 15, 2023 | GPUImage Retrieval | CodeCode Available | 1 |
| MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training | Dec 14, 2023 | GPU | CodeCode Available | 1 |
| Memory-Efficient Reversible Spiking Neural Networks | Dec 13, 2023 | GPU | CodeCode Available | 1 |
| EZ-CLIP: Efficient Zeroshot Video Action Recognition | Dec 13, 2023 | Action RecognitionGPU | CodeCode Available | 1 |
| DTL: Disentangled Transfer Learning for Visual Recognition | Dec 13, 2023 | GPUTransfer Learning | CodeCode Available | 1 |
| Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI | Dec 13, 2023 | DiversityGPU | CodeCode Available | 1 |