| Simulation of Large Scale Neural Networks for Evaluation Applications | May 20, 2018 | Benchmarking | —Unverified | 0 |
| SinaTools: Open Source Toolkit for Arabic Natural Language Processing | Nov 3, 2024 | BenchmarkingLemmatization | —Unverified | 0 |
| SINDy vs Hard Nonlinearities and Hidden Dynamics: a Benchmarking Study | Mar 1, 2024 | Benchmarking | —Unverified | 0 |
| Single-Cell Omics Arena: A Benchmark Study for Large Language Models on Cell Type Annotation Using Single-Cell Data | Dec 3, 2024 | Benchmarking | —Unverified | 0 |
| Single Stage Prediction with Embedded Topic Modeling of Online Reviews for Mobile App Management | Feb 19, 2018 | BenchmarkingManagement | —Unverified | 0 |
| Site2Vec: a reference frame invariant algorithm for vector embedding of protein-ligand binding sites | Mar 18, 2020 | BenchmarkingDrug Discovery | —Unverified | 0 |
| Six-CD: Benchmarking Concept Removals for Text-to-image Diffusion Models | Jan 1, 2025 | Benchmarking | —Unverified | 0 |
| Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation | Jan 27, 2025 | BenchmarkingC++ code | —Unverified | 0 |
| Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping | Oct 21, 2024 | Benchmarking | —Unverified | 0 |
| Sketch 'n Solve: An Efficient Python Package for Large-Scale Least Squares Using Randomized Numerical Linear Algebra | Sep 22, 2024 | Benchmarking | —Unverified | 0 |
| Sketchtopia: A Dataset and Foundational Agents for Benchmarking Asynchronous Multimodal Communication with Iconic Feedback | Jan 1, 2025 | Benchmarking | —Unverified | 0 |
| Skills and Liquidity Barriers to Youth Employment: Medium-term Evidence from a Cash Benchmarking Experiment in Rwanda | Sep 18, 2022 | Benchmarking | —Unverified | 0 |
| SkyRover: A Modular Simulator for Cross-Domain Pathfinding | Feb 13, 2025 | Benchmarking | —Unverified | 0 |
| SlangDIT: Benchmarking LLMs in Interpretative Slang Translation | May 20, 2025 | BenchmarkingSentence | —Unverified | 0 |
| SMiCRM: A Benchmark Dataset of Mechanistic Molecular Images | Jul 25, 2024 | Benchmarking | —Unverified | 0 |
| Smiling Women Pitching Down: Auditing Representational and Presentational Gender Biases in Image Generative AI | May 17, 2023 | Benchmarking | —Unverified | 0 |
| SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge | May 17, 2024 | BenchmarkingSocial Media Popularity Prediction | —Unverified | 0 |
| SMPLy Benchmarking 3D Human Pose Estimation in the Wild | Dec 4, 2020 | 3D Human Pose EstimationBenchmarking | —Unverified | 0 |
| SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos | Apr 14, 2022 | BenchmarkingMultiple Object Tracking | —Unverified | 0 |
| SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents | Jul 2, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Social Bias Probing: Fairness Benchmarking for Language Models | Nov 15, 2023 | BenchmarkingFairness | —Unverified | 0 |
| Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities | Oct 24, 2013 | Benchmarking | —Unverified | 0 |
| Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns | May 29, 2025 | Benchmarking | —Unverified | 0 |
| So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection | May 24, 2025 | BenchmarkingImage Forgery Detection | —Unverified | 0 |
| Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal | Aug 7, 2024 | BenchmarkingHard Attention | —Unverified | 0 |
| Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents | May 8, 2025 | Benchmarking | —Unverified | 0 |
| SoK: Systematization and Benchmarking of Deepfake Detectors in a Unified Framework | Jan 9, 2024 | BenchmarkingDeepFake Detection | —Unverified | 0 |
| SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates | Nov 1, 2022 | Benchmarking | —Unverified | 0 |
| Solar Multimodal Transformer: Intraday Solar Irradiance Predictor using Public Cameras and Time Series | Feb 28, 2025 | BenchmarkingSolar Irradiance Forecasting | —Unverified | 0 |
| Solver Scheduling via Answer Set Programming | Jan 6, 2014 | BenchmarkingScheduling | —Unverified | 0 |
| Solving the chemical master equation for monomolecular reaction systems analytically: a Doi-Peliti path integral view | Nov 3, 2019 | Benchmarking | —Unverified | 0 |
| Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research | Jan 29, 2025 | Benchmarking | —Unverified | 0 |
| SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset | Aug 4, 2022 | BenchmarkingMulti-Object Tracking | —Unverified | 0 |
| SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents | Jun 9, 2025 | BenchmarkingSynthetic Data Generation | —Unverified | 0 |
| SortBench: Benchmarking LLMs based on their ability to sort lists | Apr 11, 2025 | Benchmarking | —Unverified | 0 |
| SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge | May 27, 2025 | BenchmarkingMultiple-choice | —Unverified | 0 |
| So you think you can track? | Sep 13, 2023 | BenchmarkingObject | —Unverified | 0 |
| SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain | Jan 20, 2023 | BenchmarkingCell Segmentation | —Unverified | 0 |
| Sparse Deep Nonnegative Matrix Factorization | Jul 28, 2017 | BenchmarkingDimensionality Reduction | —Unverified | 0 |
| Sparse Representation-Based Classification: Orthogonal Least Squares or Orthogonal Matching Pursuit? | Jul 18, 2016 | BenchmarkingClassification | —Unverified | 0 |
| Spatially Binned ROC: A Comprehensive Saliency Metric | Jun 1, 2016 | Benchmarking | —Unverified | 0 |
| Spatially Correlated Patterns in Adversarial Images | Nov 21, 2020 | BenchmarkingBlocking | —Unverified | 0 |
| Spatio-Temporal Latent Graph Structure Learning for Traffic Forecasting | Feb 25, 2022 | BenchmarkingGraph Neural Network | —Unverified | 0 |
| Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues | Apr 21, 2025 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration | Dec 14, 2023 | BenchmarkingPoint Cloud Registration | —Unverified | 0 |
| Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads | Aug 28, 2023 | BenchmarkingSelf-Supervised Learning | —Unverified | 0 |
| SpeechVerse: A Large-scale Generalizable Audio Language Model | May 14, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| Speed Benchmarking of Genetic Programming Frameworks | May 25, 2021 | Benchmarking | —Unverified | 0 |
| SPINEX-Clustering: Similarity-based Predictions with Explainable Neighbors Exploration for Clustering Problems | Jul 9, 2024 | BenchmarkingClustering | —Unverified | 0 |
| SPINEX_ Symbolic Regression: Similarity-based Symbolic Regression with Explainable Neighbors Exploration | Nov 5, 2024 | Benchmarkingregression | —Unverified | 0 |