| MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design | Dec 20, 2024 | valid | —Unverified | 0 |
| Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Dec 19, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Label Errors in the Tobacco3482 Dataset | Dec 17, 2024 | Document Classificationvalid | CodeCode Available | 0 |
| Beyond Accuracy: On the Effects of Fine-tuning Towards Vision-Language Model's Prediction Rationality | Dec 17, 2024 | Predictionvalid | CodeCode Available | 0 |
| Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets | Dec 16, 2024 | Uncertainty Quantificationvalid | CodeCode Available | 0 |
| Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties | Dec 16, 2024 | FairnessHate Speech Detection | —Unverified | 0 |
| On the Role of Surrogates in Conformal Inference of Individual Causal Effects | Dec 16, 2024 | Causal InferenceConformal Prediction | CodeCode Available | 0 |
| Learning Structural Causal Models from Ordering: Identifiable Flow Models | Dec 13, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| Direct Encoding of Declare Constraints in ASP | Dec 13, 2024 | valid | CodeCode Available | 0 |
| Temporal Numeric Planning with Patterns | Dec 12, 2024 | valid | —Unverified | 0 |