| Ask-before-Plan: Proactive Language Agents for Real-World Planning | Jun 18, 2024 | Decision Makingvalid | CodeCode Available | 1 |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Jun 17, 2024 | In-Context Learningvalid | CodeCode Available | 0 |
| Spillover Detection for Donor Selection in Synthetic Control Models | Jun 17, 2024 | Causal Inferencevalid | —Unverified | 0 |
| CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents | Jun 17, 2024 | Code GenerationCode Search | CodeCode Available | 0 |
| Active, anytime-valid risk controlling prediction sets | Jun 15, 2024 | Predictionvalid | CodeCode Available | 0 |
| Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning | Jun 15, 2024 | Diversityvalid | —Unverified | 0 |
| Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions | Jun 14, 2024 | Few-Shot Semantic SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Randomization Inference: Theory and Applications | Jun 13, 2024 | valid | —Unverified | 0 |