| Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents | May 8, 2025 | Benchmarking | —Unverified | 0 |
| SoK: Systematization and Benchmarking of Deepfake Detectors in a Unified Framework | Jan 9, 2024 | BenchmarkingDeepFake Detection | —Unverified | 0 |
| SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates | Nov 1, 2022 | Benchmarking | —Unverified | 0 |
| Solar Multimodal Transformer: Intraday Solar Irradiance Predictor using Public Cameras and Time Series | Feb 28, 2025 | BenchmarkingSolar Irradiance Forecasting | —Unverified | 0 |
| Solver Scheduling via Answer Set Programming | Jan 6, 2014 | BenchmarkingScheduling | —Unverified | 0 |
| Solving the chemical master equation for monomolecular reaction systems analytically: a Doi-Peliti path integral view | Nov 3, 2019 | Benchmarking | —Unverified | 0 |
| Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research | Jan 29, 2025 | Benchmarking | —Unverified | 0 |
| SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset | Aug 4, 2022 | BenchmarkingMulti-Object Tracking | —Unverified | 0 |
| SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents | Jun 9, 2025 | BenchmarkingSynthetic Data Generation | —Unverified | 0 |
| SortBench: Benchmarking LLMs based on their ability to sort lists | Apr 11, 2025 | Benchmarking | —Unverified | 0 |
| SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge | May 27, 2025 | BenchmarkingMultiple-choice | —Unverified | 0 |
| So you think you can track? | Sep 13, 2023 | BenchmarkingObject | —Unverified | 0 |
| SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain | Jan 20, 2023 | BenchmarkingCell Segmentation | —Unverified | 0 |
| Sparse Deep Nonnegative Matrix Factorization | Jul 28, 2017 | BenchmarkingDimensionality Reduction | —Unverified | 0 |
| Sparse Representation-Based Classification: Orthogonal Least Squares or Orthogonal Matching Pursuit? | Jul 18, 2016 | BenchmarkingClassification | —Unverified | 0 |
| Spatially Binned ROC: A Comprehensive Saliency Metric | Jun 1, 2016 | Benchmarking | —Unverified | 0 |
| Spatially Correlated Patterns in Adversarial Images | Nov 21, 2020 | BenchmarkingBlocking | —Unverified | 0 |
| Spatio-Temporal Latent Graph Structure Learning for Traffic Forecasting | Feb 25, 2022 | BenchmarkingGraph Neural Network | —Unverified | 0 |
| Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues | Apr 21, 2025 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration | Dec 14, 2023 | BenchmarkingPoint Cloud Registration | —Unverified | 0 |
| Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads | Aug 28, 2023 | BenchmarkingSelf-Supervised Learning | —Unverified | 0 |
| SpeechVerse: A Large-scale Generalizable Audio Language Model | May 14, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| Speed Benchmarking of Genetic Programming Frameworks | May 25, 2021 | Benchmarking | —Unverified | 0 |
| SPINEX-Clustering: Similarity-based Predictions with Explainable Neighbors Exploration for Clustering Problems | Jul 9, 2024 | BenchmarkingClustering | —Unverified | 0 |
| SPINEX_ Symbolic Regression: Similarity-based Symbolic Regression with Explainable Neighbors Exploration | Nov 5, 2024 | Benchmarkingregression | —Unverified | 0 |