| All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing | Oct 22, 2024 | AllEntity Typing | —Unverified | 0 |
| Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic | Oct 21, 2024 | Formal LogicWorld Knowledge | —Unverified | 0 |
| Roadmap towards Superhuman Speech Understanding using Large Language Models | Oct 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Understanding the Role of LLMs in Multimodal Evaluation Benchmarks | Oct 16, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 |
| Comprehending Knowledge Graphs with Large Language Models for Recommender Systems | Oct 16, 2024 | Knowledge-Aware RecommendationKnowledge Graphs | —Unverified | 0 |
| KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities | Oct 15, 2024 | Image GenerationRetrieval | —Unverified | 0 |
| DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities | Oct 10, 2024 | Document RankingEntity Embeddings | CodeCode Available | 0 |
| TVBench: Redesigning Video-Language Evaluation | Oct 10, 2024 | Multiple-choiceOpen-Ended Question Answering | —Unverified | 0 |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Oct 9, 2024 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| SEAL: SEmantic-Augmented Imitation Learning via Language Model | Oct 3, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Intent Detection in the Age of LLMs | Oct 2, 2024 | Data AugmentationIn-Context Learning | —Unverified | 0 |
| "Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models | Sep 27, 2024 | Interpretable Machine LearningWorld Knowledge | —Unverified | 0 |
| "Why" Has the Least Side Effect on Model Editing | Sep 27, 2024 | Experimental Designknowledge editing | —Unverified | 0 |
| Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion | Sep 26, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 0 |
| 60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering | Sep 24, 2024 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking | Sep 23, 2024 | BenchmarkingDiversity | CodeCode Available | 0 |
| The X Types -- Mapping the Semantics of the Twitter Sphere | Sep 22, 2024 | Type predictionWorld Knowledge | —Unverified | 0 |
| Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models | Sep 22, 2024 | World Knowledge | —Unverified | 0 |
| Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration | Sep 21, 2024 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time | Sep 20, 2024 | BenchmarkingWorld Knowledge | —Unverified | 0 |
| Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Sep 13, 2024 | Sequential Decision MakingWorld Knowledge | —Unverified | 0 |
| Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles | Sep 10, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| How Does Code Pretraining Affect Language Model Task Performance? | Sep 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Physical Rule-Guided Convolutional Neural Network | Sep 3, 2024 | World Knowledge | —Unverified | 0 |
| CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding | Sep 2, 2024 | World Knowledge | —Unverified | 0 |
| Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning | Aug 30, 2024 | Causal Language ModelingContinual Learning | —Unverified | 0 |
| Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis | Aug 27, 2024 | BenchmarkingLarge Language Model | —Unverified | 0 |
| Exploring the Potential of Large Language Models for Heterophilic Graphs | Aug 26, 2024 | Node ClassificationWorld Knowledge | —Unverified | 0 |
| To Code, or Not To Code? Exploring Impact of Code in Pre-training | Aug 20, 2024 | Code GenerationWorld Knowledge | —Unverified | 0 |
| Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models | Aug 20, 2024 | Music RecommendationRecommendation Systems | —Unverified | 0 |
| CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation | Aug 20, 2024 | Collaborative FilteringGeneral Knowledge | —Unverified | 0 |
| CoDi: Conversational Distillation for Grounded Question Answering | Aug 20, 2024 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme Classification | Aug 18, 2024 | ImputationMissing Labels | CodeCode Available | 0 |
| A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models | Aug 16, 2024 | Logical Reasoningvalid | —Unverified | 0 |
| Prompt Tuning as User Inherent Profile Inference Machine | Aug 13, 2024 | QuantizationRecommendation Systems | —Unverified | 0 |
| MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty | Aug 13, 2024 | Mathematical ReasoningQuestion Answering | CodeCode Available | 0 |
| LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Aug 9, 2024 | DiversityInstruction Following | CodeCode Available | 0 |
| Better Alignment with Instruction Back-and-Forth Translation | Aug 8, 2024 | DiversityTranslation | —Unverified | 0 |
| Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation | Aug 7, 2024 | Logical ReasoningRecommendation Systems | —Unverified | 0 |
| CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge | Jul 30, 2024 | In-Context LearningKnowledge Graphs | —Unverified | 0 |
| Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models | Jul 28, 2024 | World Knowledge | —Unverified | 0 |
| DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models | Jul 24, 2024 | Retrieval-augmented GenerationWorld Knowledge | CodeCode Available | 0 |
| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 |
| Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data | Jul 20, 2024 | Language ModellingMachine Translation | —Unverified | 0 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 |
| VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Jul 9, 2024 | Autonomous DrivingImage to 3D | —Unverified | 0 |
| BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization | Jun 30, 2024 | Continual LearningGeneral Knowledge | —Unverified | 0 |
| Mental Modeling of Reinforcement Learning Agents by Language Models | Jun 26, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| LABOR-LLM: Language-Based Occupational Representations with Large Language Models | Jun 25, 2024 | In-Context LearningJob Prediction | —Unverified | 0 |
| Mitigating Hallucination in Fictional Character Role-Play | Jun 25, 2024 | HallucinationWorld Knowledge | CodeCode Available | 0 |