| Large Language Models as Automated Aligners for benchmarking Vision-Language Models | Nov 24, 2023 | BenchmarkingWorld Knowledge | —Unverified | 0 |
| An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification | Nov 24, 2023 | Benchmarkingimage-classification | —Unverified | 0 |
| Dialogue Quality and Emotion Annotations for Customer Support Conversations | Nov 23, 2023 | BenchmarkingDiversity | CodeCode Available | 0 |
| Learning Dynamic Selection and Pricing of Out-of-Home Deliveries | Nov 23, 2023 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Automated 3D Tumor Segmentation using Temporal Cubic PatchGAN (TCuP-GAN) | Nov 23, 2023 | BenchmarkingBrain Tumor Segmentation | —Unverified | 0 |
| Creating and Leveraging a Synthetic Dataset of Cloud Optical Thickness Measures for Cloud Detection in MSI | Nov 23, 2023 | BenchmarkingCloud Detection | CodeCode Available | 0 |
| A projected nonlinear state-space model for forecasting time series signals | Nov 22, 2023 | BenchmarkingComputational Efficiency | CodeCode Available | 0 |
| Benchmarking Toxic Molecule Classification using Graph Neural Networks and Few Shot Learning | Nov 22, 2023 | BenchmarkingDrug Discovery | —Unverified | 0 |
| Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors | Nov 21, 2023 | Benchmarking | —Unverified | 0 |
| Deep State-Space Model for Predicting Cryptocurrency Price | Nov 21, 2023 | BenchmarkingUncertainty Quantification | —Unverified | 0 |
| Segment Together: A Versatile Paradigm for Semi-Supervised Medical Image Segmentation | Nov 20, 2023 | BenchmarkingImage Segmentation | —Unverified | 0 |
| Demonstrating Almost Linear Time Complexity of Bus Admittance Matrix-Based Distribution Network Power Flow: An Empirical Approach | Nov 20, 2023 | Benchmarking | —Unverified | 0 |
| Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning | Nov 20, 2023 | BenchmarkingInverse Rendering | —Unverified | 0 |
| LABCAT: Locally adaptive Bayesian optimization using principal-component-aligned trust regions | Nov 19, 2023 | Bayesian OptimizationBenchmarking | CodeCode Available | 0 |
| Benchmarking Feature Extractors for Reinforcement Learning-Based Semiconductor Defect Localization | Nov 18, 2023 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Benchmarking Machine Learning Models for Quantum Error Correction | Nov 18, 2023 | Benchmarking | —Unverified | 0 |
| Predicting the Probability of Collision of a Satellite with Space Debris: A Bayesian Machine Learning Approach | Nov 17, 2023 | BenchmarkingCollision Avoidance | —Unverified | 0 |
| Social Bias Probing: Fairness Benchmarking for Language Models | Nov 15, 2023 | BenchmarkingFairness | —Unverified | 0 |
| Domain Aligned CLIP for Few-shot Classification | Nov 15, 2023 | BenchmarkingClassification | —Unverified | 0 |
| Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks | Nov 15, 2023 | BenchmarkingNetwork Pruning | CodeCode Available | 0 |
| Model Agnostic Explainable Selective Regression via Uncertainty Estimation | Nov 15, 2023 | Benchmarkingmodel | —Unverified | 0 |
| Benchmarking Individual Tree Mapping with Sub-meter Imagery | Nov 14, 2023 | BenchmarkingSegmentation | —Unverified | 0 |
| On Using Distribution-Based Compositionality Assessment to Evaluate Compositional Generalisation in Machine Translation | Nov 14, 2023 | BenchmarkingMachine Translation | CodeCode Available | 0 |
| The Disagreement Problem in Faithfulness Metrics | Nov 13, 2023 | BenchmarkingExplainable artificial intelligence | —Unverified | 0 |
| Uncertainty estimation of machine learning spatial precipitation predictions from satellite data | Nov 13, 2023 | BenchmarkingFeature Importance | —Unverified | 0 |