| Intent Detection in the Age of LLMs | Oct 2, 2024 | Data AugmentationIn-Context Learning | —Unverified | 0 |
| "Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models | Sep 27, 2024 | Interpretable Machine LearningWorld Knowledge | —Unverified | 0 |
| "Why" Has the Least Side Effect on Model Editing | Sep 27, 2024 | Experimental Designknowledge editing | —Unverified | 0 |
| Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion | Sep 26, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 0 |
| 60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering | Sep 24, 2024 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking | Sep 23, 2024 | BenchmarkingDiversity | CodeCode Available | 0 |
| Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models | Sep 22, 2024 | World Knowledge | —Unverified | 0 |
| The X Types -- Mapping the Semantics of the Twitter Sphere | Sep 22, 2024 | Type predictionWorld Knowledge | —Unverified | 0 |
| Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration | Sep 21, 2024 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time | Sep 20, 2024 | BenchmarkingWorld Knowledge | —Unverified | 0 |