| Value-Spectrum: Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts | Nov 18, 2024 | BenchmarkingMultimodal Large Language Model | CodeCode Available | 0 |
| Benchmarking pre-trained text embedding models in aligning built asset information | Nov 18, 2024 | Asset ManagementBenchmarking | CodeCode Available | 0 |
| Countering Backdoor Attacks in Image Recognition: A Survey and Evaluation of Mitigation Strategies | Nov 17, 2024 | Benchmarking | —Unverified | 0 |
| FastDraft: How to Train Your Draft | Nov 17, 2024 | BenchmarkingCode Completion | —Unverified | 0 |
| Reinforcing Competitive Multi-Agents for Playing So Long Sucker | Nov 17, 2024 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML | Nov 17, 2024 | BenchmarkingFairness | —Unverified | 0 |
| Towards a Comprehensive Benchmark for Pathological Lymph Node Metastasis in Breast Cancer Sections | Nov 16, 2024 | BenchmarkingDiagnostic | CodeCode Available | 0 |
| Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level | Nov 15, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |
| The ParClusterers Benchmark Suite (PCBS): A Fine-Grained Analysis of Scalable Graph Clustering | Nov 15, 2024 | BenchmarkingClustering | —Unverified | 0 |
| Automated Coding of Communications in Collaborative Problem-solving Tasks Using ChatGPT | Nov 15, 2024 | Benchmarking | —Unverified | 0 |