| Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet | Nov 7, 2024 | BenchmarkingCombinatorial Optimization | —Unverified | 0 |
| ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding | Nov 7, 2024 | BenchmarkingMultiple-choice | —Unverified | 0 |
| Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries | Nov 7, 2024 | Benchmarking | —Unverified | 0 |
| HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images | Nov 7, 2024 | AnatomyBenchmarking | —Unverified | 0 |
| Benchmarking Large Language Models with Integer Sequence Generation Tasks | Nov 7, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Nov 7, 2024 | Active LearningBenchmarking | —Unverified | 0 |
| Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking | Nov 6, 2024 | Benchmarking | —Unverified | 0 |
| Beemo: Benchmark of Expert-edited Machine-generated Outputs | Nov 6, 2024 | Benchmarking | CodeCode Available | 0 |
| TDDBench: A Benchmark for Training data detection | Nov 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level | Nov 5, 2024 | Bayesian OptimisationBenchmarking | —Unverified | 0 |