| Externally Valid Selection of Experimental Sites via the k-Median Problem | Aug 17, 2024 | valid | —Unverified | 0 |
| A Confidence Interval for the _2 Expected Calibration Error | Aug 16, 2024 | valid | CodeCode Available | 0 |
| A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models | Aug 16, 2024 | Logical Reasoningvalid | —Unverified | 0 |
| An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem | Aug 16, 2024 | Combinatorial Optimizationvalid | CodeCode Available | 0 |
| Evaluating the Validity of Word-level Adversarial Attacks with Large Language Models | Aug 15, 2024 | Adversarial AttackLanguage Modeling | CodeCode Available | 0 |
| QirK: Question Answering via Intermediate Representation on Knowledge Graphs | Aug 14, 2024 | Knowledge GraphsQuestion Answering | —Unverified | 0 |
| Defining and Measuring Disentanglement for non-Independent Factors of Variation | Aug 13, 2024 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Design Proteins Using Large Language Models: Enhancements and Comparative Analyses | Aug 12, 2024 | valid | CodeCode Available | 0 |
| Approximating Discrimination Within Models When Faced With Several Non-Binary Sensitive Attributes | Aug 12, 2024 | AttributeFairness | CodeCode Available | 0 |
| People over trust AI-generated medical responses and view them to be as valid as doctors, despite low accuracy | Aug 11, 2024 | Large Language Modelvalid | —Unverified | 0 |