| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Configuration Validation with Large Language Models | Oct 15, 2023 | Code GenerationFew-Shot Learning | —Unverified | 0 |
| Inference with Mondrian Random Forests | Oct 15, 2023 | regressionvalid | —Unverified | 0 |
| Chatbot-supported Thesis Writing: An Autoethnographic Report | Oct 14, 2023 | ChatbotLanguage Modelling | —Unverified | 0 |
| Model-Agnostic Covariate-Assisted Inference on Partially Identified Causal Effects | Oct 12, 2023 | Causal Inferencevalid | CodeCode Available | 0 |
| Kernel-Elastic Autoencoder for Molecular Design | Oct 12, 2023 | Diversityvalid | —Unverified | 0 |
| DiscoMatch: Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching | Oct 12, 2023 | valid | CodeCode Available | 1 |
| ADMEOOD: Out-of-Distribution Benchmark for Drug Property Prediction | Oct 11, 2023 | Domain GeneralizationPrediction | —Unverified | 0 |
| Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding | Oct 10, 2023 | Mathvalid | CodeCode Available | 1 |
| HYVE: Hybrid Vertex Encoder for Neural Distance Fields | Oct 10, 2023 | 3D geometryDecoder | —Unverified | 0 |