| The Unconstrained Ear Recognition Challenge | Aug 23, 2017 | BenchmarkingPerson Recognition | —Unverified | 0 |
| The Unconstrained Ear Recognition Challenge 2019 - ArXiv Version With Appendix | Mar 11, 2019 | BenchmarkingPerson Recognition | —Unverified | 0 |
| THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models | Apr 17, 2025 | BenchmarkingMath | —Unverified | 0 |
| TIIF-Bench: How Does Your T2I Model Follow Your Instructions? | Jun 2, 2025 | BenchmarkingInstruction Following | —Unverified | 0 |
| Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection | Sep 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time | Sep 20, 2024 | BenchmarkingWorld Knowledge | —Unverified | 0 |
| Time Sensitive Knowledge Editing through Efficient Finetuning | Jun 6, 2024 | Benchmarkingknowledge editing | —Unverified | 0 |
| TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs | Mar 13, 2025 | BenchmarkingQuestion Answering | —Unverified | 0 |
| Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines | Feb 21, 2023 | Benchmarkingwhole slide images | —Unverified | 0 |
| Timing Excess Returns A cross-universe approach to alpha | Feb 11, 2020 | BenchmarkingTime Series | —Unverified | 0 |
| TinyML Platforms Benchmarking | Nov 30, 2021 | Benchmarking | —Unverified | 0 |
| Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset | Nov 2, 2022 | BenchmarkingEvent Extraction | —Unverified | 0 |
| TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking | Feb 16, 2025 | Benchmarking | —Unverified | 0 |
| tmVar 3.0: an improved variant concept recognition and normalization tool | Apr 7, 2022 | Benchmarking | —Unverified | 0 |
| Token Sequence Compression for Efficient Multimodal Computing | Apr 24, 2025 | Benchmarking | —Unverified | 0 |
| Top-k Regularization for Supervised Feature Selection | Jun 4, 2021 | Benchmarkingfeature selection | —Unverified | 0 |
| Top Score on the Wrong Exam: On Benchmarking in Machine Learning for Vulnerability Detection | Aug 23, 2024 | BenchmarkingBinary Classification | —Unverified | 0 |
| Totally Corrective Boosting with Cardinality Penalization | Apr 7, 2015 | BenchmarkingCombinatorial Optimization | —Unverified | 0 |
| TOTOPO: Classifying univariate and multivariate time series with Topological Data Analysis | Oct 10, 2020 | BenchmarkingTime Series | —Unverified | 0 |
| Toward an ImageNet Library of Functions for Global Optimization Benchmarking | Jun 27, 2022 | Benchmarkingglobal-optimization | —Unverified | 0 |
| Toward end-to-end interpretable convolutional neural networks for waveform signals | May 3, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage | Dec 20, 2024 | AttributeBenchmarking | —Unverified | 0 |
| Towards a Benchmark for Scientific Understanding in Humans and Machines | Apr 20, 2023 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving | May 29, 2020 | Benchmarking | —Unverified | 0 |
| Towards a Multidimensional Evaluation Framework for Empathetic Conversational Systems | Jul 26, 2024 | Benchmarking | —Unverified | 0 |
| Towards an AI Accountability Policy | Jul 25, 2023 | BenchmarkingFairness | —Unverified | 0 |
| Towards an Automated SOAP Note: Classifying Utterances from Medical Conversations | Jul 17, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards a Taxonomy of Graph Learning Datasets | Oct 27, 2021 | BenchmarkingGraph Learning | —Unverified | 0 |
| Towards a Theory-Guided Benchmarking Suite for Discrete Black-Box Optimization Heuristics: Profiling (1+λ) EA Variants on OneMax and LeadingOnes | Aug 17, 2018 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 |
| Towards a Unified Framework for Determining Conformational Ensembles of Disordered Proteins | Apr 4, 2025 | Benchmarking | —Unverified | 0 |
| Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios | Mar 31, 2025 | Adversarial AttackAutonomous Driving | —Unverified | 0 |
| Towards Benchmarking and Evaluating Deepfake Detection | Mar 4, 2022 | BenchmarkingDeepFake Detection | —Unverified | 0 |
| Towards Benchmarking Explainable Artificial Intelligence Methods | Aug 25, 2022 | BenchmarkingExplainable artificial intelligence | —Unverified | 0 |
| Towards Benchmarking Scene Background Initialization | Jun 12, 2015 | Benchmarking | —Unverified | 0 |
| Towards Benchmarking the Utility of Explanations for Model Debugging | May 10, 2021 | Benchmarking | —Unverified | 0 |
| Towards Class-agnostic Tracking Using Feature Decorrelation in Point Clouds | Feb 28, 2022 | BenchmarkingObject Tracking | —Unverified | 0 |
| Towards Effective Disambiguation for Machine Translation with Large Language Models | Sep 20, 2023 | BenchmarkingIn-Context Learning | —Unverified | 0 |
| Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques | Jun 6, 2025 | BenchmarkingModel Selection | —Unverified | 0 |
| Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset | Feb 26, 2024 | BenchmarkingCross-Lingual Transfer | —Unverified | 0 |
| Towards Explainable Network Intrusion Detection using Large Language Models | Aug 8, 2024 | BenchmarkingIntrusion Detection | —Unverified | 0 |
| Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking | Feb 16, 2023 | Benchmarkingcounterfactual | —Unverified | 0 |
| Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings | Dec 10, 2024 | BenchmarkingGraph Learning | —Unverified | 0 |
| Towards Ideal Temporal Graph Neural Networks: Evaluations and Conclusions after 10,000 GPU Hours | Dec 28, 2024 | BenchmarkingGPU | —Unverified | 0 |
| Towards Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models | Mar 10, 2025 | AllBenchmarking | —Unverified | 0 |
| Towards Large-Scale Small Object Detection: Survey and Benchmarks | Jul 28, 2022 | BenchmarkingObject | —Unverified | 0 |
| Towards Long-Term predictions of Turbulence using Neural Operators | Jul 25, 2023 | Benchmarking | —Unverified | 0 |
| Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks | May 17, 2023 | Benchmarking | —Unverified | 0 |
| Towards Personalized Federated Learning | Mar 1, 2021 | BenchmarkingFederated Learning | —Unverified | 0 |
| Towards Private Learning on Decentralized Graphs with Local Differential Privacy | Jan 23, 2022 | BenchmarkingGraph Learning | —Unverified | 0 |
| Towards Productionizing Subjective Search Systems | Mar 31, 2020 | BenchmarkingLanguage Modelling | —Unverified | 0 |