| LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs | Apr 29, 2025 | BenchmarkingFace Generation | —Unverified | 0 |
| LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models | Jul 17, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 |
| Load-independent Metrics for Benchmarking Force Controllers | May 13, 2025 | Benchmarking | —Unverified | 0 |
| Local Data Quantity-Aware Weighted Averaging for Federated Learning with Dishonest Clients | Apr 17, 2025 | BenchmarkingFederated Learning | —Unverified | 0 |
| Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture | Jan 9, 2023 | AvgBenchmarking | —Unverified | 0 |
| Logically at Factify 2022: Multimodal Fact Verification | Dec 16, 2021 | BenchmarkingFact Checking | —Unverified | 0 |
| Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression | Feb 15, 2023 | Benchmarking | —Unverified | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 |
| Long Range Arena : A Benchmark for Efficient Transformers | Jan 1, 2021 | 16kBenchmarking | —Unverified | 0 |
| Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning | Dec 21, 2019 | BenchmarkingPrediction | —Unverified | 0 |