| High-Speed Ultra-Energy-Efficient Memristor-Based Massive MIMO SIC Detector Circuit with Hybrid Analog-Digital Computing Architecture | Jun 4, 2025 | GPU | —Unverified | 0 |
| Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem | Jun 3, 2025 | GPUMath | —Unverified | 0 |
| VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians | Jun 3, 2025 | GPUSimultaneous Localization and Mapping | —Unverified | 0 |
| Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency | Jun 3, 2025 | GPUSpeech Enhancement | —Unverified | 0 |
| COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents | Jun 2, 2025 | GPULarge Language Model | —Unverified | 0 |
| Pushing the Limits of Beam Search Decoding for Transducer-based ASR models | May 30, 2025 | GPU | —Unverified | 0 |
| NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation | May 30, 2025 | Autonomous DrivingGPU | CodeCode Available | 0 |
| Recipes for Pre-training LLMs with MXFP8 | May 30, 2025 | GPU | —Unverified | 0 |
| Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization | May 30, 2025 | GPUKnowledge Distillation | —Unverified | 0 |
| LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Trainin | May 29, 2025 | GPUReinforcement Learning (RL) | —Unverified | 0 |
| TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks | May 29, 2025 | GPUNetwork Pruning | —Unverified | 0 |
| CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection | May 29, 2025 | GPUobject-detection | —Unverified | 0 |
| LUMION: Fast Fault Recovery for ML Jobs Using Programmable Optical Fabrics | May 29, 2025 | GPU | —Unverified | 0 |
| LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering | May 29, 2025 | 3DGSGPU | —Unverified | 0 |
| LoLA: Low-Rank Linear Attention With Sparse Caching | May 29, 2025 | 4k8k | —Unverified | 0 |
| NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding | May 28, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels | May 28, 2025 | Autonomous DrivingGPU | CodeCode Available | 0 |
| FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control | May 28, 2025 | GPUHumanoid Control | —Unverified | 0 |
| Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | May 28, 2025 | CPUGPU | —Unverified | 0 |
| CPINN-ABPI: Physics-Informed Neural Networks for Accurate Power Estimation in MPSoCs | May 28, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape | May 28, 2025 | GPU | CodeCode Available | 0 |
| STACI: Spatio-Temporal Aleatoric Conformal Inference | May 27, 2025 | Gaussian ProcessesGPU | —Unverified | 0 |
| Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks | May 27, 2025 | GPU | —Unverified | 0 |
| Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits | May 27, 2025 | GPU | —Unverified | 0 |
| InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling | May 27, 2025 | DenoisingGPU | —Unverified | 0 |
| Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling | May 26, 2025 | GPUtext-to-speech | —Unverified | 0 |
| APE: A Data-Centric Benchmark for Efficient LLM Adaptation in Text Summarization | May 26, 2025 | GPUNews Summarization | CodeCode Available | 0 |
| SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale | May 26, 2025 | Decision MakingGPU | —Unverified | 0 |
| FinLoRA: Benchmarking LoRA Methods for Fine-Tuning LLMs on Financial Datasets | May 26, 2025 | BenchmarkingGPU | —Unverified | 0 |
| Is Architectural Complexity Overrated? Competitive and Interpretable Knowledge Graph Completion with RelatE | May 25, 2025 | GPUKnowledge Graph Completion | —Unverified | 0 |
| eACGM: Non-instrumented Performance Tracing and Anomaly Detection towards Machine Learning Systems | May 25, 2025 | Anomaly DetectionFault Diagnosis | —Unverified | 0 |
| TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis | May 25, 2025 | CPUGPU | —Unverified | 0 |
| Triangle Splatting for Real-Time Radiance Field Rendering | May 25, 2025 | GPUNeRF | —Unverified | 0 |
| FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization | May 25, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Advancing Video Self-Supervised Learning via Image Foundation Models | May 25, 2025 | GPURepresentation Learning | CodeCode Available | 0 |
| Climate Implications of Diffusion-based Generative Visual AI Systems and their Mass Adoption | May 24, 2025 | GPU | —Unverified | 0 |
| KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning | May 24, 2025 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning | May 24, 2025 | GPUOffline RL | —Unverified | 0 |
| A DSP-Free Carrier Phase Recovery System using 16-Offset-QAM Laser Forwarded Links for 400Gb/s and Beyond | May 24, 2025 | GPU | —Unverified | 0 |
| HD-PiSSA: High-Rank Distributed Orthogonal Adaptation | May 24, 2025 | Code GenerationGPU | —Unverified | 0 |
| Dynamic Risk Assessments for Offensive Cybersecurity Agents | May 23, 2025 | GPU | CodeCode Available | 0 |
| Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models | May 23, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence | May 23, 2025 | GPULarge Language Model | —Unverified | 0 |
| A deep solver for backward stochastic Volterra integral equations | May 23, 2025 | GPU | CodeCode Available | 0 |
| LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols | May 22, 2025 | GPU | —Unverified | 0 |
| FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design | May 22, 2025 | GPUImage Generation | CodeCode Available | 0 |
| GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation | May 22, 2025 | GPUPose Estimation | —Unverified | 0 |
| Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | May 22, 2025 | Federated LearningGPU | —Unverified | 0 |
| The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm | May 22, 2025 | GPU | —Unverified | 0 |
| Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks | May 21, 2025 | BenchmarkingGPU | —Unverified | 0 |