| Sequential Large Language Model-Based Hyper-parameter Optimization | Oct 27, 2024 | Bayesian OptimizationBenchmarking | CodeCode Available | 0 |
| Multi-input Multi-output Loewner Framework for Vibration-based Damage Detection on a Trainer Jet | Oct 26, 2024 | BenchmarkingCantilever Beam | —Unverified | 0 |
| OGBench: Benchmarking Offline Goal-Conditioned RL | Oct 26, 2024 | Benchmarkingreinforcement-learning | CodeCode Available | 3 |
| SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects | Oct 26, 2024 | BenchmarkingMulti-Object Tracking | —Unverified | 0 |
| AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance Labels | Oct 26, 2024 | BenchmarkingInformation Retrieval | CodeCode Available | 0 |
| MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding | Oct 25, 2024 | Benchmarkingdocument understanding | —Unverified | 0 |
| OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery | Oct 25, 2024 | Benchmarkingimage-classification | —Unverified | 0 |
| A Survey of Small Language Models | Oct 25, 2024 | BenchmarkingModel Compression | —Unverified | 0 |
| An Auditing Test To Detect Behavioral Shift in Language Models | Oct 25, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs | Oct 25, 2024 | BenchmarkingFairness | —Unverified | 0 |