| Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time | Mar 27, 2025 | CPUGPU | —Unverified | 0 |
| Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization | Mar 26, 2025 | CPUGPU | CodeCode Available | 7 |
| PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch | Mar 25, 2025 | CPUGPU | —Unverified | 0 |
| Adaptive Machine Learning for Resource-Constrained Environments | Mar 24, 2025 | CPU | CodeCode Available | 0 |
| PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction | Mar 21, 2025 | CPUDocument Layout Analysis | CodeCode Available | 9 |
| V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms | Mar 21, 2025 | CPUGPU | —Unverified | 0 |
| SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs | Mar 20, 2025 | CPUGPU | —Unverified | 0 |
| Design and Implementation of an FPGA-Based Hardware Accelerator for Transformer | Mar 20, 2025 | CPUHigh-Level Synthesis | CodeCode Available | 1 |
| BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems | Mar 18, 2025 | CPUMath | CodeCode Available | 0 |
| Audio Compression using Periodic Gabor with Biorthogonal Exchange: Implementation Using the Zak Transform | Mar 17, 2025 | Audio CompressionCPU | —Unverified | 0 |
| ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory | Mar 16, 2025 | CPUGPU | CodeCode Available | 3 |
| Robust Learning-Based Sparse Recovery for Device Activity Detection in Grant-Free Random Access Cell-Free Massive MIMO: Enhancing Resilience to Impairments | Mar 13, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge | Mar 12, 2025 | CPUGPU | —Unverified | 0 |
| Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices | Mar 10, 2025 | CPUGPU | —Unverified | 0 |
| Efficient Neural Clause-Selection Reinforcement | Mar 10, 2025 | Automated Theorem ProvingCPU | —Unverified | 0 |
| HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection | Mar 10, 2025 | CPUobject-detection | —Unverified | 0 |
| Coordinated Energy-Trajectory Economic Model Predictive Control for Autonomous Surface Vehicles under Disturbances | Mar 10, 2025 | CPUModel Predictive Control | —Unverified | 0 |
| Spillover effects between climate policy uncertainty, energy markets, and food markets: A time-frequency analysis | Mar 9, 2025 | CPU | —Unverified | 0 |
| The impact of external uncertainties on the extreme return connectedness between food, fossil energy, and clean energy markets | Mar 9, 2025 | CPUGPR | —Unverified | 0 |
| LapSum -- One Method to Differentiate Them All: Ranking, Sorting and Top-k Selection | Mar 8, 2025 | AllCPU | —Unverified | 0 |
| Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows | Mar 7, 2025 | CPUGPU | —Unverified | 0 |
| Deterministic Global Optimization of the Acquisition Function in Bayesian Optimization: To Do or Not To Do? | Mar 5, 2025 | Bayesian OptimizationCPU | —Unverified | 0 |
| Partial Convolution Meets Visual Attention | Mar 5, 2025 | CPUGPU | —Unverified | 0 |
| Benchmarking Dynamic SLO Compliance in Distributed Computing Continuum Systems | Mar 5, 2025 | BenchmarkingCPU | CodeCode Available | 0 |
| DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Mar 4, 2025 | Computational EfficiencyCPU | CodeCode Available | 1 |
| CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory | Mar 4, 2025 | CPUGPU | —Unverified | 0 |
| Evaluation of adaptive sampling methods in scenario generation for virtual safety impact assessment of pre-crash safety systems | Mar 2, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting | Mar 2, 2025 | CPUGPU | CodeCode Available | 1 |
| Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking | Mar 1, 2025 | CPUGPU | CodeCode Available | 1 |
| AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks | Feb 28, 2025 | CPUGPU | —Unverified | 0 |
| TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval | Feb 28, 2025 | CPUGPU | —Unverified | 0 |
| AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs | Feb 27, 2025 | CPUGPU | CodeCode Available | 0 |
| LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis | Feb 27, 2025 | CPUGPU | —Unverified | 0 |
| Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image Enhancement | Feb 27, 2025 | Computational EfficiencyCPU | CodeCode Available | 0 |
| LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking | Feb 25, 2025 | CPU | CodeCode Available | 1 |
| SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations | Feb 24, 2025 | CPUGPU | CodeCode Available | 0 |
| A Universal Framework for Compressing Embeddings in CTR Prediction | Feb 21, 2025 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 0 |
| Distributed U-net model and Image Segmentation for Lung Cancer Detection | Feb 20, 2025 | CPUFederated Learning | —Unverified | 0 |
| Dynamic Low-Rank Sparse Adaptation for Large Language Models | Feb 20, 2025 | CPUGPU | CodeCode Available | 1 |
| Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective | Feb 20, 2025 | CPUGPU | —Unverified | 0 |
| Safe Beyond the Horizon: Efficient Sampling-based MPC with Neural Control Barrier Functions | Feb 20, 2025 | CPUModel Predictive Control | —Unverified | 0 |
| Object-Pose Estimation With Neural Population Codes | Feb 19, 2025 | CPUObject | —Unverified | 0 |
| On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation | Feb 18, 2025 | CPUIntent Detection | —Unverified | 0 |
| A^2ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization | Feb 18, 2025 | CPUPosition | —Unverified | 0 |
| HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading | Feb 18, 2025 | Computational EfficiencyCPU | CodeCode Available | 2 |
| Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance | Feb 17, 2025 | CPUOptical Flow Estimation | —Unverified | 0 |
| Representation Learning on Out of Distribution in Tabular Data | Feb 14, 2025 | Contrastive LearningCPU | —Unverified | 0 |
| Habitizing Diffusion Planning for Efficient and Effective Decision Making | Feb 10, 2025 | CPUD4RL | CodeCode Available | 1 |
| Weighted-Sum Energy Efficiency Maximization in User-Centric Uplink Cell-Free Massive MIMO | Feb 10, 2025 | CPU | —Unverified | 0 |
| DVFS-Aware DNN Inference on GPUs: Latency Modeling and Performance Analysis | Feb 10, 2025 | CPUInference Optimization | —Unverified | 0 |