| Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource Constrained IoT Systems | Jun 22, 2023 | Edge-computingGPU | —Unverified | 0 |
| ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation | May 22, 2024 | GPU | —Unverified | 0 |
| SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving | Aug 5, 2024 | GPU | —Unverified | 0 |
| SLOs-Serve: Optimized Serving of Multi-SLO LLMs | Apr 5, 2025 | ChatbotGPU | —Unverified | 0 |
| SLSDeep: Skin Lesion Segmentation Based on Dilated Residual and Pyramid Pooling Networks | May 25, 2018 | DecoderGPU | —Unverified | 0 |
| Small Language Models in the Real World: Insights from Industrial Text Classification | May 21, 2025 | ClassificationDecoder | —Unverified | 0 |
| Small-Text: Active Learning for Text Classification in Python | Jul 21, 2021 | Active LearningClassification | —Unverified | 0 |
| SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization | Jul 17, 2024 | GPUQuantization | —Unverified | 0 |
| SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training | Nov 21, 2022 | cross-modal alignmentGPU | —Unverified | 0 |
| SMDP-Based Dynamic Batching for Efficient Inference on GPU-Based Platforms | Jan 30, 2023 | Edge-computingGPU | —Unverified | 0 |
| SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection | Nov 22, 2019 | GPUNeural Architecture Search | —Unverified | 0 |
| SmolVLM: Redefining small and efficient multimodal models | Apr 7, 2025 | GPU | —Unverified | 0 |
| Snap ML: A Hierarchical Framework for Machine Learning | Mar 16, 2018 | BIG-bench Machine LearningGPU | —Unverified | 0 |
| SNeRF: Stylized Neural Implicit Representations for 3D Scenes | Jul 5, 2022 | GPUInductive Bias | —Unverified | 0 |
| S-Net: A Scalable Convolutional Neural Network for JPEG Compression Artifact Reduction | Oct 18, 2018 | GPUJPEG Artifact Correction | —Unverified | 0 |
| CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation | May 8, 2023 | GPUModel Compression | —Unverified | 0 |
| SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search | Jan 30, 2023 | GPUPolicy Gradient Methods | —Unverified | 0 |
| Software implemented fault diagnosis of natural gas pumping unit based on feedforward neural network | Feb 25, 2025 | DescriptiveDiagnostic | —Unverified | 0 |
| SOLIS -- The MLOps journey from data acquisition to actionable insights | Dec 22, 2021 | BIG-bench Machine LearningGPU | —Unverified | 0 |
| SOL: Reducing the Maintenance Overhead for Integrating Hardware Support into AI Frameworks | May 19, 2022 | CPUGPU | —Unverified | 0 |
| Solving Large Sequential Games with the Excessive Gap Technique | Oct 7, 2018 | counterfactualForm | —Unverified | 0 |
| Solving machine learning optimization problems using quantum computers | Nov 17, 2019 | BIG-bench Machine LearningCPU | —Unverified | 0 |
| Solving the Uncapacitated Single Allocation p-Hub Median Problem on GPU | Apr 14, 2017 | GPU | —Unverified | 0 |
| Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge | Mar 12, 2025 | CPUGPU | —Unverified | 0 |
| Sort-free Gaussian Splatting via Weighted Sum Rendering | Oct 24, 2024 | 3DGS3D Scene Reconstruction | —Unverified | 0 |