| MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation | Jan 9, 2024 | BenchmarkingInteractive Segmentation | CodeCode Available | 0 |
| ferret: a Framework for Benchmarking Explainers on Transformers | Aug 2, 2022 | BenchmarkingExplainable Artificial Intelligence (XAI) | CodeCode Available | 0 |
| Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish | Sep 13, 2023 | BenchmarkingTranslation | CodeCode Available | 0 |
| FEET: A Framework for Evaluating Embedding Techniques | Nov 2, 2024 | BenchmarkingRepresentation Learning | CodeCode Available | 0 |
| Benchmarking Probabilistic Deep Learning Methods for License Plate Recognition | Feb 2, 2023 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| Unraveling the Capabilities of Language Models in News Summarization | Jan 30, 2025 | BenchmarkingFew-Shot Learning | CodeCode Available | 0 |
| mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale | Jun 26, 2025 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks | Apr 18, 2021 | BenchmarkingFederated Learning | CodeCode Available | 0 |
| MUBen: Benchmarking the Uncertainty of Molecular Representation Models | Jun 14, 2023 | BenchmarkingDrug Discovery | CodeCode Available | 0 |
| The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection | Sep 17, 2024 | BenchmarkingEvent Detection | CodeCode Available | 0 |
| WAC: A Corpus of Wikipedia Conversations for Online Abuse Detection | Mar 13, 2020 | Abuse DetectionBenchmarking | CodeCode Available | 0 |
| FedSecurity: Benchmarking Attacks and Defenses in Federated Learning and Federated LLMs | Jun 8, 2023 | BenchmarkingFederated Learning | CodeCode Available | 0 |
| Fedivertex: a Graph Dataset based on Decentralized Social Networks for Trustworthy Machine Learning | May 27, 2025 | Benchmarking | CodeCode Available | 0 |
| Feature interpretability in BCIs: exploring the role of network lateralization | Jul 16, 2024 | BenchmarkingEEG | CodeCode Available | 0 |
| AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? | Oct 28, 2024 | BenchmarkingQuestion Answering | CodeCode Available | 0 |
| Benchmarking pre-trained text embedding models in aligning built asset information | Nov 18, 2024 | Asset ManagementBenchmarking | CodeCode Available | 0 |
| Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared Task | Apr 1, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Feature embedding in click-through rate prediction | Sep 20, 2022 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 0 |
| Acoustic Identification of Ae. aegypti Mosquitoes using Smartphone Apps and Residual Convolutional Neural Networks | Jun 16, 2023 | Benchmarking | CodeCode Available | 0 |
| FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback | Oct 12, 2024 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis | Feb 18, 2025 | BenchmarkingMamba | CodeCode Available | 0 |
| Multi-EuP: The Multilingual European Parliament Dataset for Analysis of Bias in Information Retrieval | Nov 3, 2023 | BenchmarkingFairness | CodeCode Available | 0 |
| AuthNet: A Deep Learning based Authentication Mechanism using Temporal Facial Feature Movements | Dec 4, 2020 | BenchmarkingLip password classification | CodeCode Available | 0 |
| Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models | Oct 12, 2024 | BenchmarkingMisinformation | CodeCode Available | 0 |
| FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting | Aug 27, 2024 | BenchmarkingDecoder | CodeCode Available | 0 |