| User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance | Aug 4, 2024 | Action AnticipationBenchmarking | —Unverified | 0 |
| SPINEX-TimeSeries: Similarity-based Predictions with Explainable Neighbors Exploration for Time Series and Forecasting Problems | Aug 4, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Visual-Inertial SLAM for Unstructured Outdoor Environments: Benchmarking the Benefits and Computational Costs of Loop Closing | Aug 3, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 0 |
| Integrating Large Language Models and Knowledge Graphs for Extraction and Validation of Textual Test Data | Aug 3, 2024 | BenchmarkingKnowledge Graphs | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations | Aug 3, 2024 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| IBB Traffic Graph Data: Benchmarking and Road Traffic Prediction Model | Aug 2, 2024 | BenchmarkingFeature Engineering | —Unverified | 0 |
| Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics | Aug 2, 2024 | Adversarial AttackAdversarial Purification | CodeCode Available | 1 |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Aug 2, 2024 | Benchmarkingmultimodal interaction | CodeCode Available | 0 |
| RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework | Aug 2, 2024 | BenchmarkingDataset Generation | CodeCode Available | 3 |
| PINNs for Medical Image Analysis: A Survey | Aug 2, 2024 | AnatomyBenchmarking | —Unverified | 0 |