| Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestration | Sep 25, 2024 | GPU | CodeCode Available | 0 |
| CNN Mixture-of-Depths | Sep 25, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Sep 25, 2024 | GPUQuantization | CodeCode Available | 2 |
| Textless NLP -- Zero Resource Challenge with Low Resource Compute | Sep 24, 2024 | Acoustic Unit DiscoveryGPU | —Unverified | 0 |
| CAD: Memory Efficient Convolutional Adapter for Segment Anything | Sep 24, 2024 | DecoderGPU | CodeCode Available | 1 |
| A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation | Sep 24, 2024 | GPUMulti-Task Learning | —Unverified | 0 |
| Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Sep 24, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| dnaGrinder: a lightweight and high-capacity genomic foundation model | Sep 24, 2024 | DecoderGPU | —Unverified | 0 |
| PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training | Sep 23, 2024 | 8kGPU | —Unverified | 0 |
| TextToon: Real-Time Text Toonify Head Avatar from Single Video | Sep 23, 2024 | Contrastive LearningGPU | —Unverified | 0 |
| Efficient Tabular Data Preprocessing of ML Pipelines | Sep 23, 2024 | CPUGPU | —Unverified | 0 |
| Benchmarking Edge AI Platforms for High-Performance ML Inference | Sep 23, 2024 | BenchmarkingCPU | —Unverified | 0 |
| FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large Scale | Sep 23, 2024 | GPU | CodeCode Available | 1 |
| A Realistic Simulation Framework for Analog/Digital Neuromorphic Architectures | Sep 23, 2024 | Edge-computingGPU | —Unverified | 0 |
| Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding | Sep 22, 2024 | Anomaly DetectionGPU | CodeCode Available | 4 |
| FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs | Sep 21, 2024 | CPUGPU | —Unverified | 0 |
| ProTEA: Programmable Transformer Encoder Acceleration on FPGA | Sep 21, 2024 | GPUMachine Translation | —Unverified | 0 |
| Drift to Remember | Sep 21, 2024 | GPUimage-classification | —Unverified | 0 |
| On Importance of Pruning and Distillation for Efficient Low Resource NLP | Sep 21, 2024 | Document ClassificationGPU | —Unverified | 0 |
| Optimizing RLHF Training for Large Language Models with Stage Fusion | Sep 20, 2024 | GPU | —Unverified | 0 |
| Occupancy-Based Dual Contouring | Sep 20, 2024 | 3D ReconstructionGPU | CodeCode Available | 2 |
| Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention | Sep 19, 2024 | GPURecommendation Systems | —Unverified | 0 |
| 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt | Sep 19, 2024 | 3DGSGPU | CodeCode Available | 3 |
| Graph Convolutional Neural Networks as Surrogate Models for Climate Simulation | Sep 19, 2024 | GPUUncertainty Quantification | —Unverified | 0 |
| Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization | Sep 19, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Impact of ML Optimization Tactics on Greener Pre-Trained ML Models | Sep 19, 2024 | GPUimage-classification | —Unverified | 0 |
| CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs | Sep 19, 2024 | GPU | CodeCode Available | 1 |
| Efficient Low-Resolution Face Recognition via Bridge Distillation | Sep 18, 2024 | CPUDataset Distillation | —Unverified | 0 |
| User-friendly Foundation Model Adapters for Multivariate Time Series Classification | Sep 18, 2024 | Dimensionality ReductionGPU | —Unverified | 0 |
| Bundle Adjustment in the Eager Mode | Sep 18, 2024 | Deep LearningGPU | —Unverified | 0 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Less Memory Means smaller GPUs: Backpropagation with Compressed Activations | Sep 18, 2024 | GPU | —Unverified | 0 |
| Mamba Fusion: Learning Actions Through Questioning | Sep 17, 2024 | Action AnticipationAction Recognition | CodeCode Available | 0 |
| Can Graph Reordering Speed Up Graph Neural Network Training? An Experimental Study | Sep 17, 2024 | CPUGPU | CodeCode Available | 0 |
| RenderWorld: World Model with Self-Supervised 3D Label | Sep 17, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| Early Detection of Coronary Heart Disease Using Hybrid Quantum Machine Learning Approach | Sep 17, 2024 | GPUQuantum Machine Learning | —Unverified | 0 |
| MARCA: Mamba Accelerator with ReConfigurable Architecture | Sep 16, 2024 | CPUGPU | —Unverified | 0 |
| RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval | Sep 16, 2024 | CPUGPU | CodeCode Available | 2 |
| One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild | Sep 15, 2024 | GPUImage Generation | CodeCode Available | 1 |
| LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach | Sep 14, 2024 | Ensemble LearningGPU | CodeCode Available | 0 |
| Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution | Sep 14, 2024 | GPUMamba | —Unverified | 0 |
| Accurate and Fast Estimation of Temporal Motifs using Path Sampling | Sep 13, 2024 | GPUGraph Mining | CodeCode Available | 0 |
| Using Convolutional Neural Networks for Denoising and Deblending of Marine Seismic Data | Sep 13, 2024 | DenoisingGPU | —Unverified | 0 |
| SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length | Sep 12, 2024 | 3DGSGPU | —Unverified | 0 |
| Super Monotonic Alignment Search | Sep 12, 2024 | CPUGPU | CodeCode Available | 2 |
| Self-Supervised Learning of Iterative Solvers for Constrained Optimization | Sep 12, 2024 | GPUSelf-Supervised Learning | —Unverified | 0 |
| Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part I | Sep 12, 2024 | BenchmarkingCPU | CodeCode Available | 0 |
| Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Sep 11, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| ENACT: Entropy-based Clustering of Attention Input for Improving the Computational Performance of Object Detection Transformers | Sep 11, 2024 | GPUobject-detection | CodeCode Available | 0 |
| A Cost-Aware Approach to Adversarial Robustness in Neural Networks | Sep 11, 2024 | Adversarial RobustnessGPU | —Unverified | 0 |