| Answering Ambiguous Questions via Iterative Prompting | Jul 8, 2023 | DiversityOpen-Domain Question Answering | CodeCode Available | 1 |
| KonIQ-10k: An ecologically valid database for deep learning of blind image quality assessment | Oct 14, 2019 | DiversityImage Quality Assessment | CodeCode Available | 1 |
| Comparing Sequential Forecasters | Sep 30, 2021 | valid | CodeCode Available | 1 |
| Comparing Experimental and Nonexperimental Methods: What Lessons Have We Learned Four Decades After LaLonde (1986)? | Jun 2, 2024 | valid | CodeCode Available | 1 |
| Large Language Models Are Neurosymbolic Reasoners | Jan 17, 2024 | Common Sense ReasoningMath | CodeCode Available | 1 |
| An Efficient Adversarial Attack for Tree Ensembles | Oct 22, 2020 | Adversarial Attackvalid | CodeCode Available | 1 |
| Large Language Models for Automated Open-domain Scientific Hypotheses Discovery | Sep 6, 2023 | valid | CodeCode Available | 1 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| An efficient graph generative model for navigating ultra-large combinatorial synthesis libraries | Oct 19, 2022 | DecoderDrug Discovery | CodeCode Available | 1 |
| Counterfactual Data Augmentation using Locally Factored Dynamics | Jul 6, 2020 | counterfactualData Augmentation | CodeCode Available | 1 |