| AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks | Oct 29, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Cora: Accelerating Stateful Network Applications with SmartNICs | Oct 29, 2024 | CPU | —Unverified | 0 |
| ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference | Oct 28, 2024 | CPU | CodeCode Available | 3 |
| Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows | Oct 28, 2024 | CPUGPU | —Unverified | 0 |
| Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading | Oct 26, 2024 | CPUGPU | CodeCode Available | 0 |
| Structured Connectivity for 6G Reflex Arc: Task-Oriented Virtual User and New Uplink-Downlink Tradeoff | Oct 24, 2024 | ARCCPU | —Unverified | 0 |
| Multi-objective Optimization in CPU Design Space Exploration: Attention is All You Need | Oct 24, 2024 | AllCPU | —Unverified | 0 |
| Sensing-Communication-Computing-Control Closed-Loop Optimization for 6G Unmanned Robotic Systems | Oct 24, 2024 | ARCCPU | —Unverified | 0 |
| Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding Mechanism | Oct 23, 2024 | CPU | CodeCode Available | 2 |
| ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Oct 23, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost | Oct 22, 2024 | CPUGPU | —Unverified | 0 |
| FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs | Oct 22, 2024 | CPUGPU | —Unverified | 0 |
| MagicPIG: LSH Sampling for Efficient LLM Generation | Oct 21, 2024 | CPUGPU | CodeCode Available | 3 |
| InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems | Oct 21, 2024 | Automated Theorem ProvingCPU | CodeCode Available | 4 |
| Accelerate Coastal Ocean Circulation Model with AI Surrogate | Oct 19, 2024 | CPUDisaster Response | —Unverified | 0 |
| syren-new: Precise formulae for the linear and nonlinear matter power spectra with massive neutrinos and dynamical dark energy | Oct 18, 2024 | CPUGPU | CodeCode Available | 1 |
| CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment | Oct 16, 2024 | CPUGPU | —Unverified | 0 |
| Towards Arbitrary QUBO Optimization: Analysis of Classical and Quantum-Activated Feedforward Neural Networks | Oct 16, 2024 | CPUDecoder | —Unverified | 0 |
| A Transformer Based Generative Chemical Language AI Model for Structural Elucidation of Organic Compounds | Oct 13, 2024 | CPUDecoder | —Unverified | 0 |
| ActNAS : Generating Efficient YOLO Models using Activation NAS | Oct 11, 2024 | CPUGPU | —Unverified | 0 |
| Bukva: Russian Sign Language Alphabet | Oct 11, 2024 | CPUSign Language Recognition | CodeCode Available | 0 |
| Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction | Oct 11, 2024 | CPUDimensionality Reduction | CodeCode Available | 0 |
| Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models | Oct 11, 2024 | CPUGPU | CodeCode Available | 0 |
| KV Prediction for Improved Time to First Token | Oct 10, 2024 | Code CompletionCPU | —Unverified | 0 |
| Octopus Inspired Optimization Algorithm: Multi-Level Structures and Parallel Computing Strategies | Oct 10, 2024 | Computational EfficiencyCPU | CodeCode Available | 1 |