| TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models | Apr 29, 2025 | BenchmarkingDataset Generation | CodeCode Available | 0 |
| BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search | Dec 1, 2022 | BenchmarkingGPU | CodeCode Available | 0 |
| Knowing-how & Knowing-that: A New Task for Machine Comprehension of User Manuals | Jun 7, 2023 | BenchmarkingMachine Reading Comprehension | CodeCode Available | 0 |
| TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish Language | Dec 23, 2021 | BenchmarkingClustering | CodeCode Available | 0 |
| Can Tree Based Approaches Surpass Deep Learning in Anomaly Detection? A Benchmarking Study | Feb 11, 2024 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| LANTERN: A Machine Learning Framework for Lipid Nanoparticle Transfection Efficiency Prediction | Jul 3, 2025 | Benchmarking | CodeCode Available | 0 |
| Laparoscopic Image Desmoking Using the U-Net with New Loss Function and Integrated Differentiable Wiener Filter | May 27, 2025 | Benchmarking | CodeCode Available | 0 |
| LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing | Feb 14, 2025 | BenchmarkingRAG | CodeCode Available | 0 |
| Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench | Apr 1, 2025 | Benchmarking | CodeCode Available | 0 |
| Recurrent Quantum Neural Networks | Jun 25, 2020 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| KhabarChin: Automatic Detection of Important News in the Persian Language | Dec 6, 2023 | ArticlesBenchmarking | CodeCode Available | 0 |
| Can geometric combinatorics improve RNA branching predictions? | Mar 26, 2025 | Benchmarking | CodeCode Available | 0 |
| BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search | Aug 9, 2021 | BenchmarkingGPU | CodeCode Available | 0 |
| Can a single neuron learn predictive uncertainty? | Jun 7, 2021 | BenchmarkingConformal Prediction | CodeCode Available | 0 |
| Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges | Mar 11, 2025 | Benchmarking | CodeCode Available | 0 |
| Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework | Jun 8, 2023 | Benchmarking | CodeCode Available | 0 |
| KArSL: Arabic Sign Language Database | Jan 1, 2021 | BenchmarkingSign Language Recognition | CodeCode Available | 0 |
| Can AI Validate Science? Benchmarking LLMs for Accurate Scientific Claim Evidence Reasoning | Jun 9, 2025 | BenchmarkingDiagnostic | CodeCode Available | 0 |
| JavaBench: A Benchmark of Object-Oriented Code Generation for Evaluating Large Language Models | Jun 10, 2024 | BenchmarkingCode Generation | CodeCode Available | 0 |
| TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics | Feb 5, 2025 | BenchmarkingLink Prediction | CodeCode Available | 0 |
| Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning | May 7, 2024 | BenchmarkingContrastive Learning | CodeCode Available | 0 |
| KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-Zen | Mar 3, 2022 | Benchmarking | CodeCode Available | 0 |
| Joint Multi-Scale Tone Mapping and Denoising for HDR Image Enhancement | Mar 16, 2023 | BenchmarkingDemosaicking | CodeCode Available | 0 |
| Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models | Jul 13, 2025 | AttributeBenchmarking | CodeCode Available | 0 |