| PRobELM: Plausibility Ranking Evaluation for Language Models | Apr 4, 2024 | Question AnsweringTruthfulQA | —Unverified | 0 |
| Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization | Apr 2, 2024 | MemorizationOpen-Domain Question Answering | —Unverified | 0 |
| LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start Recommendations | Mar 31, 2024 | Recommendation SystemsRe-Ranking | CodeCode Available | 0 |
| EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs | Mar 30, 2024 | Graph Neural NetworkKnowledge Graphs | CodeCode Available | 0 |
| Enhancing Content-based Recommendation via Large Language Model | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces | Mar 28, 2024 | Data SummarizationWorld Knowledge | —Unverified | 0 |
| Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent | Mar 28, 2024 | World Knowledge | CodeCode Available | 0 |
| Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior Simulation | Mar 27, 2024 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 0 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 |
| Large Language Models Enhanced Collaborative Filtering | Mar 26, 2024 | Collaborative FilteringIn-Context Learning | —Unverified | 0 |
| Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning | Mar 22, 2024 | Few-Shot Stance DetectionIn-Context Learning | CodeCode Available | 0 |
| Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models | Mar 21, 2024 | SentenceWorld Knowledge | CodeCode Available | 0 |
| Informed Spectral Normalized Gaussian Processes for Trajectory Prediction | Mar 18, 2024 | Autonomous DrivingContinual Learning | —Unverified | 0 |
| EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents | Mar 18, 2024 | Reinforcement Learning (RL)World Knowledge | —Unverified | 0 |
| RetinaQA: A Robust Knowledge Base Question Answering Model for both Answerable and Unanswerable Questions | Mar 16, 2024 | Knowledge Base Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Mar 12, 2024 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs | Mar 4, 2024 | Fake News DetectionImage Manipulation | —Unverified | 0 |
| Cognition is All You Need -- The Next Layer of AI Above Large Language Models | Mar 4, 2024 | AllWorld Knowledge | —Unverified | 0 |
| Word Order and World Knowledge | Mar 1, 2024 | World Knowledge | CodeCode Available | 0 |
| LLMs for Targeted Sentiment in News Headlines: Exploring the Descriptive-Prescriptive Dilemma | Mar 1, 2024 | DescriptiveIn-Context Learning | —Unverified | 0 |
| EyeGPT: Ophthalmic Assistant with Large Language Models | Feb 29, 2024 | Retrieval-augmented GenerationWorld Knowledge | —Unverified | 0 |
| AKEW: Assessing Knowledge Editing in the Wild | Feb 29, 2024 | Articlescounterfactual | CodeCode Available | 0 |
| ICE-SEARCH: A Language Model-Driven Feature Selection Approach | Feb 28, 2024 | Diabetes PredictionDisease Prediction | —Unverified | 0 |
| PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering | Feb 26, 2024 | Question AnsweringRetrieval | —Unverified | 0 |
| Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | Feb 26, 2024 | AttributeFine-Grained Visual Categorization | —Unverified | 0 |
| Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition | Feb 22, 2024 | Data Augmentationfew-shot-ner | —Unverified | 0 |
| Can Language Models Act as Knowledge Bases at Scale? | Feb 22, 2024 | Natural Language QueriesWorld Knowledge | —Unverified | 0 |
| FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language Models | Feb 21, 2024 | Recommendation SystemsTaxonomy Expansion | CodeCode Available | 0 |
| Lying Blindly: Bypassing ChatGPT's Safeguards to Generate Hard-to-Detect Disinformation Claims | Feb 13, 2024 | World Knowledge | —Unverified | 0 |
| GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding | Feb 9, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Vision-Language Models Provide Promptable Representations for Reinforcement Learning | Feb 5, 2024 | Common Sense ReasoningInstruction Following | —Unverified | 0 |
| MQuinE: a cure for "Z-paradox" in knowledge graph embedding models | Feb 5, 2024 | Graph EmbeddingInformation Retrieval | —Unverified | 0 |
| Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases | Feb 5, 2024 | Layout GenerationObject | —Unverified | 0 |
| A Note On Lookahead In Real Life And Computing | Feb 2, 2024 | Future predictionWorld Knowledge | —Unverified | 0 |
| LoRec: Large Language Model for Robust Sequential Recommendation against Poisoning Attacks | Jan 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear | Jan 30, 2024 | World Knowledge | —Unverified | 0 |
| Efficient Tool Use with Chain-of-Abstraction Reasoning | Jan 30, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Towards Generating Informative Textual Description for Neurons in Language Models | Jan 30, 2024 | World Knowledge | —Unverified | 0 |
| Democratizing Fine-grained Visual Recognition with Large Language Models | Jan 24, 2024 | Fine-Grained Visual RecognitionWorld Knowledge | —Unverified | 0 |
| WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge | Jan 12, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | —Unverified | 0 |
| Enhancing Multilingual Information Retrieval in Mixed Human Resources Environments: A RAG Model Implementation for Multicultural Enterprise | Jan 3, 2024 | Information RetrievalRAG | —Unverified | 0 |
| Open-Vocabulary 3D Semantic Segmentation with Foundation Models | Jan 1, 2024 | 3D Semantic SegmentationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| Anchoring Path for Inductive Relation Prediction in Knowledge Graphs | Dec 21, 2023 | Inductive Relation PredictionKnowledge Graphs | CodeCode Available | 0 |
| Typhoon: Thai Large Language Models | Dec 21, 2023 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Implicit Affordance Acquisition via Causal Action-Effect Modeling in the Video Domain | Dec 18, 2023 | World Knowledge | CodeCode Available | 0 |
| Dynamic Retrieval-Augmented Generation | Dec 14, 2023 | abstractive question answeringCode Generation | —Unverified | 0 |
| Learning adaptive planning representations with natural language guidance | Dec 13, 2023 | Decision MakingMinecraft | —Unverified | 0 |