| Automated Quality Control System for Canned Tuna Production using Artificial Vision | Oct 8, 2024 | GPUOptical Character Recognition (OCR) | —Unverified | 0 |
| CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation | Oct 7, 2024 | GPUMachine Translation | —Unverified | 0 |
| PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms | Oct 5, 2024 | BenchmarkingGPU | —Unverified | 0 |
| Fast Object Detection with a Machine Learning Edge Device | Oct 5, 2024 | Autonomous NavigationCPU | —Unverified | 0 |
| Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning | Oct 4, 2024 | CPUDeep Learning | —Unverified | 0 |
| Compute Or Load KV Cache? Why Not Both? | Oct 4, 2024 | GPU | —Unverified | 0 |
| LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy | Oct 4, 2024 | GPULow-rank compression | —Unverified | 0 |
| Learning from Offline Foundation Features with Tensor Augmentations | Oct 3, 2024 | GPU | —Unverified | 0 |
| Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network | Oct 3, 2024 | GPUReal-Time Semantic Segmentation | —Unverified | 0 |
| Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping | Oct 3, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| An Efficient Inference Frame for SMLM (Single-Molecule Localization Microscopy) | Oct 3, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| Online Energy Optimization in GPUs: A Multi-Armed Bandit Approach | Oct 3, 2024 | energy managementGPU | CodeCode Available | 0 |
| Contextual Document Embeddings | Oct 3, 2024 | Contrastive LearningDocument Embedding | —Unverified | 0 |
| LLMCO2: Advancing Accurate Carbon Footprint Prediction for LLM Inferences | Oct 3, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| Replacement Learning: Training Vision Tasks with Fewer Learnable Parameters | Oct 2, 2024 | GPU | —Unverified | 0 |
| A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts | Oct 2, 2024 | 4kGPU | —Unverified | 0 |
| VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings | Oct 2, 2024 | GPUGraph Attention | —Unverified | 0 |
| FlashMask: Efficient and Rich Mask Extension of FlashAttention | Oct 2, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling | Oct 2, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving | Oct 2, 2024 | BenchmarkingDocument Summarization | —Unverified | 0 |
| ROK Defense M&S in the Age of Hyperscale AI: Concepts, Challenges, and Future Directions | Oct 1, 2024 | Decision MakingGPU | —Unverified | 0 |
| Lotus: learning-based online thermal and latency variation management for two-stage detectors on edge devices | Oct 1, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 0 |
| MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards | Oct 1, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Characterizing and Efficiently Accelerating Multimodal Generation Model Inference | Sep 30, 2024 | GPUmultimodal generation | —Unverified | 0 |
| HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems | Sep 30, 2024 | GPUPrediction | —Unverified | 0 |
| Simulation-based inference with the Python Package sbijax | Sep 28, 2024 | Bayesian InferenceCPU | —Unverified | 0 |
| Gradient-free Decoder Inversion in Latent Diffusion Models | Sep 27, 2024 | DecoderDenoising | —Unverified | 0 |
| TensorSocket: Shared Data Loading for Deep Learning Training | Sep 27, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Input-Dependent Power Usage in GPUs | Sep 26, 2024 | GPU | CodeCode Available | 0 |
| Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores | Sep 26, 2024 | GPUManagement | —Unverified | 0 |
| DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning | Sep 26, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit | Sep 26, 2024 | Action Unit DetectionArousal Estimation | —Unverified | 0 |
| CNN Mixture-of-Depths | Sep 25, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestration | Sep 25, 2024 | GPU | CodeCode Available | 0 |
| FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search | Sep 25, 2024 | Collaborative FilteringCPU | —Unverified | 0 |
| A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation | Sep 24, 2024 | GPUMulti-Task Learning | —Unverified | 0 |
| dnaGrinder: a lightweight and high-capacity genomic foundation model | Sep 24, 2024 | DecoderGPU | —Unverified | 0 |
| Textless NLP -- Zero Resource Challenge with Low Resource Compute | Sep 24, 2024 | Acoustic Unit DiscoveryGPU | —Unverified | 0 |
| Efficient Tabular Data Preprocessing of ML Pipelines | Sep 23, 2024 | CPUGPU | —Unverified | 0 |
| PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training | Sep 23, 2024 | 8kGPU | —Unverified | 0 |
| Benchmarking Edge AI Platforms for High-Performance ML Inference | Sep 23, 2024 | BenchmarkingCPU | —Unverified | 0 |
| TextToon: Real-Time Text Toonify Head Avatar from Single Video | Sep 23, 2024 | Contrastive LearningGPU | —Unverified | 0 |
| A Realistic Simulation Framework for Analog/Digital Neuromorphic Architectures | Sep 23, 2024 | Edge-computingGPU | —Unverified | 0 |
| FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs | Sep 21, 2024 | CPUGPU | —Unverified | 0 |
| Drift to Remember | Sep 21, 2024 | GPUimage-classification | —Unverified | 0 |
| ProTEA: Programmable Transformer Encoder Acceleration on FPGA | Sep 21, 2024 | GPUMachine Translation | —Unverified | 0 |
| On Importance of Pruning and Distillation for Efficient Low Resource NLP | Sep 21, 2024 | Document ClassificationGPU | —Unverified | 0 |
| Optimizing RLHF Training for Large Language Models with Stage Fusion | Sep 20, 2024 | GPU | —Unverified | 0 |
| Graph Convolutional Neural Networks as Surrogate Models for Climate Simulation | Sep 19, 2024 | GPUUncertainty Quantification | —Unverified | 0 |
| Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention | Sep 19, 2024 | GPURecommendation Systems | —Unverified | 0 |