| Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data | Jun 20, 2024 | Animal Pose EstimationBenchmarking | —Unverified | 0 | 0 |
| TOTOPO: Classifying univariate and multivariate time series with Topological Data Analysis | Oct 10, 2020 | BenchmarkingTime Series | —Unverified | 0 | 0 |
| LMFormer: Lane based Motion Prediction Transformer | Apr 14, 2025 | Autonomous DrivingBenchmarking | —Unverified | 0 | 0 |
| Benchmarking Modern Named Entity Recognition Techniques for Free-text Health Record De-identification | Mar 25, 2021 | BenchmarkingDecoder | —Unverified | 0 | 0 |
| LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs | Apr 29, 2025 | BenchmarkingFace Generation | —Unverified | 0 | 0 |
| LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models | Jul 17, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 | 0 |
| Load-independent Metrics for Benchmarking Force Controllers | May 13, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Benchmarking Mobile Device Control Agents across Diverse Configurations | Apr 25, 2024 | BenchmarkingImitation Learning | —Unverified | 0 | 0 |
| Local Data Quantity-Aware Weighted Averaging for Federated Learning with Dishonest Clients | Apr 17, 2025 | BenchmarkingFederated Learning | —Unverified | 0 | 0 |
| XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis | Jun 26, 2024 | Autonomous DrivingBenchmarking | —Unverified | 0 | 0 |
| Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework | Jun 9, 2025 | BenchmarkingFairness | —Unverified | 0 | 0 |
| Benchmarking Middle-Trained Language Models for Neural Search | Jun 5, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture | Jan 9, 2023 | AvgBenchmarking | —Unverified | 0 | 0 |
| Logically at Factify 2022: Multimodal Fact Verification | Dec 16, 2021 | BenchmarkingFact Checking | —Unverified | 0 | 0 |
| Toward an ImageNet Library of Functions for Global Optimization Benchmarking | Jun 27, 2022 | Benchmarkingglobal-optimization | —Unverified | 0 | 0 |
| Benchmarking Meta-heuristic Optimization | Jul 27, 2020 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 | 0 |
| Brittle Minds, Fixable Activations: Understanding Belief Representations in Language Models | Jun 25, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Toward end-to-end interpretable convolutional neural networks for waveform signals | May 3, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 | 0 |
| Benchmarking MedMNIST dataset on real quantum hardware | Feb 18, 2025 | Benchmarkingimage-classification | —Unverified | 0 | 0 |
| Benchmarking Machine Translated Sentiment Analysis for Arabic Tweets | Jun 1, 2015 | BenchmarkingMachine Translation | —Unverified | 0 | 0 |
| Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression | Feb 15, 2023 | Benchmarking | —Unverified | 0 | 0 |
| Benchmarking Machine Learning Robustness in Covid-19 Spike Sequence Classification | Sep 29, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 | 0 |
| Benchmarking Machine Learning Models to Predict Corporate Bankruptcy | Dec 22, 2022 | Benchmarking | —Unverified | 0 | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 | 0 |
| Long Range Arena : A Benchmark for Efficient Transformers | Jan 1, 2021 | 16kBenchmarking | —Unverified | 0 | 0 |