| Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Apr 15, 2024 | Computational EfficiencyGPU | CodeCode Available | 0 |
| LoopAnimate: Loopable Salient Object Animation | Apr 14, 2024 | GPUObject | —Unverified | 0 |
| CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models | Apr 12, 2024 | GPU | CodeCode Available | 1 |
| Detecting AI-Generated Images via CLIP | Apr 12, 2024 | GPU | —Unverified | 0 |
| Reducing the Barriers to Entry for Foundation Model Training | Apr 12, 2024 | GPU | —Unverified | 0 |
| Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT | Apr 12, 2024 | Edge-computingGPU | —Unverified | 0 |
| Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models | Apr 11, 2024 | GPUIn-Context Learning | —Unverified | 0 |
| JetMoE: Reaching Llama2 Performance with 0.1M Dollars | Apr 11, 2024 | GPUMixture-of-Experts | CodeCode Available | 4 |
| Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding | Apr 10, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| YOLO based Ocean Eddy Localization with AWS SageMaker | Apr 10, 2024 | GPUManagement | —Unverified | 0 |
| Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic | Apr 10, 2024 | GPU | CodeCode Available | 2 |
| PIM-Opt: Demystifying Distributed Optimization Algorithms on a Real-World Processing-In-Memory System | Apr 10, 2024 | CPUDistributed Optimization | CodeCode Available | 0 |
| GCV-Turbo: End-to-end Acceleration of GNN-based Computer Vision Tasks on FPGA | Apr 10, 2024 | CPUGPU | —Unverified | 0 |
| FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models | Apr 9, 2024 | FairnessGPU | CodeCode Available | 1 |
| LATUP-Net: A Lightweight 3D Attention U-Net with Parallel Convolutions for Brain Tumor Segmentation | Apr 9, 2024 | Brain Tumor SegmentationGPU | —Unverified | 0 |
| LIPT: Latency-aware Image Processing Transformer | Apr 9, 2024 | DenoisingGPU | CodeCode Available | 1 |
| ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers | Apr 8, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding | Apr 8, 2024 | GPUMultiple-choice | CodeCode Available | 3 |
| Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models | Apr 8, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Allo: A Programming Model for Composable Accelerator Design | Apr 7, 2024 | GPUHigh-Level Synthesis | CodeCode Available | 3 |
| Tensorized Ant Colony Optimization for GPU Acceleration | Apr 7, 2024 | CPUGPU | CodeCode Available | 1 |
| Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels | Apr 7, 2024 | Continual LearningFairness | CodeCode Available | 0 |
| GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System | Apr 5, 2024 | BenchmarkingGPU | —Unverified | 0 |
| Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization | Apr 4, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian Splatting | Apr 4, 2024 | GPU | CodeCode Available | 2 |
| Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Apr 3, 2024 | CPUGPU | CodeCode Available | 2 |
| BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models | Apr 3, 2024 | GPUMath | CodeCode Available | 3 |
| GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU | Apr 3, 2024 | GPUGraph Neural Network | CodeCode Available | 1 |
| GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation | Apr 3, 2024 | GPUSemantic Segmentation | —Unverified | 0 |
| Tensorized NeuroEvolution of Augmenting Topologies for GPU Acceleration | Apr 2, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 |
| Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration | Apr 2, 2024 | AllDecoder | CodeCode Available | 2 |
| Accelerating Transformer Pre-training with 2:4 Sparsity | Apr 2, 2024 | GPU | CodeCode Available | 2 |
| IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT | Apr 2, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA | Apr 1, 2024 | GPUMultiobjective Optimization | CodeCode Available | 3 |
| LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization | Apr 1, 2024 | Action LocalizationGPU | —Unverified | 0 |
| Towards Label-Efficient Human Matting: A Simple Baseline for Weakly Semi-Supervised Trimap-Free Human Matting | Apr 1, 2024 | Domain GeneralizationGPU | CodeCode Available | 0 |
| Enhancing Reasoning Capacity of SLM using Cognitive Enhancement | Apr 1, 2024 | GPULanguage Modelling | —Unverified | 0 |
| GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration | Mar 31, 2024 | DeblurringDenoising | —Unverified | 0 |
| Grid Diffusion Models for Text-to-Video Generation | Mar 30, 2024 | GPUImage Generation | —Unverified | 0 |
| DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference | Mar 30, 2024 | GPU | —Unverified | 0 |
| 94% on CIFAR-10 in 3.29 Seconds on a Single GPU | Mar 30, 2024 | GPU | CodeCode Available | 3 |
| FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model | Mar 29, 2024 | GPUPose Estimation | —Unverified | 0 |
| Shallow Cross-Encoders for Low-Latency Retrieval | Mar 29, 2024 | CPUGPU | CodeCode Available | 0 |
| Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs | Mar 29, 2024 | CPUGPU | —Unverified | 0 |
| Efficient Modulation for Vision Networks | Mar 29, 2024 | GPU | CodeCode Available | 2 |
| Parallel Implementations Assessment of a Spatial-Spectral Classifier for Hyperspectral Clinical Applications | Mar 28, 2024 | GPUMedical Diagnosis | —Unverified | 0 |
| Siamese Vision Transformers are Scalable Audio-visual Learners | Mar 28, 2024 | Contrastive LearningGPU | CodeCode Available | 1 |
| Bespoke Large Language Models for Digital Triage Assistance in Mental Health Care | Mar 28, 2024 | GPU | —Unverified | 0 |
| Jamba: A Hybrid Transformer-Mamba Language Model | Mar 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| Debiasing Cardiac Imaging with Controlled Latent Diffusion Models | Mar 28, 2024 | DenoisingGPU | CodeCode Available | 0 |