| All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing | Oct 22, 2024 | AllEntity Typing | —Unverified | 0 |
| Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic | Oct 21, 2024 | Formal LogicWorld Knowledge | —Unverified | 0 |
| Roadmap towards Superhuman Speech Understanding using Large Language Models | Oct 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Comprehending Knowledge Graphs with Large Language Models for Recommender Systems | Oct 16, 2024 | Knowledge-Aware RecommendationKnowledge Graphs | —Unverified | 0 |
| Understanding the Role of LLMs in Multimodal Evaluation Benchmarks | Oct 16, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 |
| KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities | Oct 15, 2024 | Image GenerationRetrieval | —Unverified | 0 |
| DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities | Oct 10, 2024 | Document RankingEntity Embeddings | CodeCode Available | 0 |
| TVBench: Redesigning Video-Language Evaluation | Oct 10, 2024 | Multiple-choiceOpen-Ended Question Answering | —Unverified | 0 |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Oct 9, 2024 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| SEAL: SEmantic-Augmented Imitation Learning via Language Model | Oct 3, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Intent Detection in the Age of LLMs | Oct 2, 2024 | Data AugmentationIn-Context Learning | —Unverified | 0 |
| "Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models | Sep 27, 2024 | Interpretable Machine LearningWorld Knowledge | —Unverified | 0 |
| "Why" Has the Least Side Effect on Model Editing | Sep 27, 2024 | Experimental Designknowledge editing | —Unverified | 0 |
| Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion | Sep 26, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 0 |
| 60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering | Sep 24, 2024 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking | Sep 23, 2024 | BenchmarkingDiversity | CodeCode Available | 0 |
| Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models | Sep 22, 2024 | World Knowledge | —Unverified | 0 |
| The X Types -- Mapping the Semantics of the Twitter Sphere | Sep 22, 2024 | Type predictionWorld Knowledge | —Unverified | 0 |
| Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration | Sep 21, 2024 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time | Sep 20, 2024 | BenchmarkingWorld Knowledge | —Unverified | 0 |
| Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Sep 13, 2024 | Sequential Decision MakingWorld Knowledge | —Unverified | 0 |
| Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles | Sep 10, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| How Does Code Pretraining Affect Language Model Task Performance? | Sep 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Physical Rule-Guided Convolutional Neural Network | Sep 3, 2024 | World Knowledge | —Unverified | 0 |
| CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding | Sep 2, 2024 | World Knowledge | —Unverified | 0 |