| PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion | Dec 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions | Dec 22, 2024 | GSM8KMath | —Unverified | 0 |
| MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design | Dec 20, 2024 | valid | —Unverified | 0 |
| Tests for model misspecification in simulation-based inference: from local distortions to global model checks | Dec 19, 2024 | Anomaly Detectionmodel | CodeCode Available | 2 |
| Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Dec 19, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Beyond Accuracy: On the Effects of Fine-tuning Towards Vision-Language Model's Prediction Rationality | Dec 17, 2024 | Predictionvalid | CodeCode Available | 0 |
| Label Errors in the Tobacco3482 Dataset | Dec 17, 2024 | Document Classificationvalid | CodeCode Available | 0 |
| RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection | Dec 17, 2024 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| On the Role of Surrogates in Conformal Inference of Individual Causal Effects | Dec 16, 2024 | Causal InferenceConformal Prediction | CodeCode Available | 0 |
| Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets | Dec 16, 2024 | Uncertainty Quantificationvalid | CodeCode Available | 0 |