| Leveraging LLMs to Create a Haptic Devices' Recommendation System | Jan 22, 2025 | Benchmarking | —Unverified | 0 |
| Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning | Jan 22, 2025 | Benchmarking | CodeCode Available | 0 |
| RAG-Reward: Optimizing RAG with Reward Modeling and RLHF | Jan 22, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs) | Jan 21, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Randomized Optimization Algorithms on Binary, Permutation, and Combinatorial Problem Landscapes | Jan 21, 2025 | Benchmarking | —Unverified | 0 |
| Optimally-Weighted Maximum Mean Discrepancy Framework for Continual Learning | Jan 21, 2025 | BenchmarkingContinual Learning | —Unverified | 0 |
| Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems | Jan 21, 2025 | Autonomous VehiclesBenchmarking | CodeCode Available | 0 |
| Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing | Jan 20, 2025 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 |
| Algorithm Selection with Probing Trajectories: Benchmarking the Choice of Classifier Model | Jan 20, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Large Language Models via Random Variables | Jan 20, 2025 | BenchmarkingMathematical Reasoning | —Unverified | 0 |
| An Interpretable Measure for Quantifying Predictive Dependence between Continuous Random Variables -- Extended Version | Jan 18, 2025 | Benchmarking | —Unverified | 0 |
| FORLAPS: An Innovative Data-Driven Reinforcement Learning Approach for Prescriptive Process Monitoring | Jan 17, 2025 | BenchmarkingData Augmentation | —Unverified | 0 |
| ColorGrid: A Multi-Agent Non-Stationary Environment for Goal Inference and Assistance | Jan 17, 2025 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Village-Net Clustering: A Rapid approach to Non-linear Unsupervised Clustering of High-Dimensional Data | Jan 16, 2025 | BenchmarkingClustering | —Unverified | 0 |
| PixelBrax: Learning Continuous Control from Pixels End-to-End on the GPU | Jan 16, 2025 | Benchmarkingcontinuous-control | CodeCode Available | 0 |
| Similarity-Quantized Relative Difference Learning for Improved Molecular Activity Prediction | Jan 15, 2025 | Activity PredictionBenchmarking | —Unverified | 0 |
| Cancer-Net PCa-Seg: Benchmarking Deep Learning Models for Prostate Cancer Segmentation Using Synthetic Correlated Diffusion Imaging | Jan 15, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents | Jan 15, 2025 | BenchmarkingOptical Character Recognition (OCR) | —Unverified | 0 |
| Evaluating SAT and SMT Solvers on Large-Scale Sudoku Puzzles | Jan 15, 2025 | Benchmarking | CodeCode Available | 0 |
| Off-policy Evaluation for Payments at Adyen | Jan 15, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval | Jan 15, 2025 | BenchmarkingContrastive Learning | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| Keras Sig: Efficient Path Signature Computation on GPU in Keras 3 | Jan 14, 2025 | BenchmarkingC++ code | —Unverified | 0 |
| Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition | Jan 14, 2025 | Activity RecognitionBenchmarking | —Unverified | 0 |
| Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving | Jan 14, 2025 | Autonomous DrivingBenchmarking | —Unverified | 0 |