| Short-Term Load Forecasting for AI-Data Center | Mar 10, 2025 | GPULoad Forecasting | —Unverified | 0 |
| AttFC: Attention Fully-Connected Layer for Large-Scale Face Recognition with One GPU | Mar 10, 2025 | Face RecognitionGPU | —Unverified | 0 |
| Fine-Tuning LLMs for Report Summarization: Analysis on Supervised and Unsupervised Data | Mar 10, 2025 | GPU | —Unverified | 0 |
| Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices | Mar 10, 2025 | CPUGPU | —Unverified | 0 |
| Global Context Is All You Need for Parallel Efficient Tractography Parcellation | Mar 10, 2025 | AllData Augmentation | —Unverified | 0 |
| A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation | Mar 9, 2025 | GPU | —Unverified | 0 |
| Training and Inference Efficiency of Encoder-Decoder Speech Models | Mar 7, 2025 | DecoderGPU | —Unverified | 0 |
| Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Mar 7, 2025 | GPUMath | —Unverified | 0 |
| Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows | Mar 7, 2025 | CPUGPU | —Unverified | 0 |
| Wanda++: Pruning Large Language Models via Regional Gradients | Mar 6, 2025 | DecoderGPU | CodeCode Available | 0 |
| Eventprop training for efficient neuromorphic applications | Mar 6, 2025 | BenchmarkingGPU | —Unverified | 0 |
| Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach | Mar 6, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Mar 6, 2025 | GPUHyperparameter Optimization | —Unverified | 0 |
| JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba | Mar 5, 2025 | GPUMamba | —Unverified | 0 |
| Partial Convolution Meets Visual Attention | Mar 5, 2025 | CPUGPU | —Unverified | 0 |
| Memory and Bandwidth are All You Need for Fully Sharded Data Parallel | Mar 4, 2025 | AllGPU | —Unverified | 0 |
| CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory | Mar 4, 2025 | CPUGPU | —Unverified | 0 |
| OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework | Mar 3, 2025 | GPUSensor Modeling | —Unverified | 0 |
| Category-level Meta-learned NeRF Priors for Efficient Object Mapping | Mar 3, 2025 | GPUMeta-Learning | —Unverified | 0 |
| KurTail : Kurtosis-based LLM Quantization | Mar 3, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Open-source framework for detecting bias and overfitting for large pathology images | Mar 3, 2025 | GPUSelf-Supervised Learning | CodeCode Available | 0 |
| A Reconfigurable Stream-Based FPGA Accelerator for Bayesian Confidence Propagation Neural Networks | Mar 3, 2025 | GPUHigh-Level Synthesis | —Unverified | 0 |
| Cauchy Random Features for Operator Learning in Sobolev Space | Mar 1, 2025 | GPUOperator learning | CodeCode Available | 0 |
| Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction | Mar 1, 2025 | GPUPose Estimation | —Unverified | 0 |
| Timing-Driven Global Placement by Efficient Critical Path Extraction | Feb 28, 2025 | GPU | —Unverified | 0 |
| Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content | Feb 28, 2025 | GPUSensitivity | —Unverified | 0 |
| AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks | Feb 28, 2025 | CPUGPU | —Unverified | 0 |
| Supporting the development of Machine Learning for fundamental science in a federated Cloud with the AI_INFN platform | Feb 28, 2025 | Distributed ComputingDiversity | —Unverified | 0 |
| S4ConvD: Adaptive Scaling and Frequency Adjustment for Energy-Efficient Sensor Networks in Smart Buildings | Feb 28, 2025 | GPUState Space Models | CodeCode Available | 0 |
| TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval | Feb 28, 2025 | CPUGPU | —Unverified | 0 |
| Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image Enhancement | Feb 27, 2025 | Computational EfficiencyCPU | CodeCode Available | 0 |
| QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects | Feb 27, 2025 | 3D Pose EstimationAction Recognition | —Unverified | 0 |
| Accurate and Scalable Graph Neural Networks via Message Invariance | Feb 27, 2025 | GPUTransductive Learning | CodeCode Available | 0 |
| SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models | Feb 27, 2025 | GPUKnowledge Distillation | —Unverified | 0 |
| WaveGAS: Waveform Relaxation for Scaling Graph Neural Networks | Feb 27, 2025 | GPUgraph partitioning | —Unverified | 0 |
| AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs | Feb 27, 2025 | CPUGPU | —Unverified | 0 |
| FPGA-Accelerated SpeckleNN with SNL for Real-time X-ray Single-Particle Imaging | Feb 27, 2025 | GPU | —Unverified | 0 |
| LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis | Feb 27, 2025 | CPUGPU | —Unverified | 0 |
| Mechanistic PDE Networks for Discovery of Governing Equations | Feb 25, 2025 | GPU | —Unverified | 0 |
| Software implemented fault diagnosis of natural gas pumping unit based on feedforward neural network | Feb 25, 2025 | DescriptiveDiagnostic | —Unverified | 0 |
| Accelerated Training on Low-Power Edge Devices | Feb 25, 2025 | GPU | —Unverified | 0 |
| The Power of Graph Signal Processing for Chip Placement Acceleration | Feb 24, 2025 | GPU | —Unverified | 0 |
| Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance | Feb 24, 2025 | GPU | —Unverified | 0 |
| SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations | Feb 24, 2025 | CPUGPU | CodeCode Available | 0 |
| Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space | Feb 24, 2025 | GPU | —Unverified | 0 |
| A Split-Window Transformer for Multi-Model Sequence Spammer Detection using Multi-Model Variational Autoencoder | Feb 23, 2025 | GPUmodel | —Unverified | 0 |
| Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation | Feb 22, 2025 | Dialogue GenerationGPU | —Unverified | 0 |
| A Universal Framework for Compressing Embeddings in CTR Prediction | Feb 21, 2025 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 0 |
| Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference | Feb 21, 2025 | GPU | —Unverified | 0 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 |