| MeaCap: Memory-Augmented Zero-shot Image Captioning | Mar 6, 2024 | Caption GenerationImage Captioning | CodeCode Available | 2 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 |
| GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Can AI Assistants Know What They Don't Know? | Jan 24, 2024 | MathOpen-Domain Question Answering | CodeCode Available | 2 |
| V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Dec 21, 2023 | Visual Question AnsweringWorld Knowledge | CodeCode Available | 2 |
| LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin | Dec 15, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 2 |
| CapsFusion: Rethinking Image-Text Data at Scale | Oct 31, 2023 | World Knowledge | CodeCode Available | 2 |
| FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation | Oct 5, 2023 | HallucinationWorld Knowledge | CodeCode Available | 2 |
| Grasp-Anything: Large-scale Grasp Dataset from Foundation Models | Sep 18, 2023 | DiversityRobotic Grasping | CodeCode Available | 2 |
| Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations | Aug 23, 2023 | BenchmarkingDecoder | CodeCode Available | 2 |
| ExpeL: LLM Agents Are Experiential Learners | Aug 20, 2023 | Decision MakingTransfer Learning | CodeCode Available | 2 |
| RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit | Jun 8, 2023 | Answer GenerationFact Checking | CodeCode Available | 2 |
| ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human | Apr 16, 2023 | World Knowledge | CodeCode Available | 2 |
| PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change | Jun 21, 2022 | Common Sense ReasoningDiversity | CodeCode Available | 2 |
| GreaseLM: Graph REASoning Enhanced Language Models for Question Answering | Jan 21, 2022 | Knowledge GraphsMedical Question Answering | CodeCode Available | 2 |
| Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents | Jan 18, 2022 | Robot Task PlanningWorld Knowledge | CodeCode Available | 2 |
| Measuring Massive Multitask Language Understanding | Sep 7, 2020 | Elementary MathematicsMulti-task Language Understanding | CodeCode Available | 2 |
| Aligning AI With Shared Human Values | Aug 5, 2020 | Ethicsreinforcement-learning | CodeCode Available | 2 |
| A Survey on Knowledge Graphs: Representation, Acquisition and Applications | Feb 2, 2020 | Graph EmbeddingGraph Representation Learning | CodeCode Available | 2 |
| GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains | May 24, 2025 | geo-localizationVisual Reasoning | CodeCode Available | 1 |
| O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering | May 22, 2025 | Answer GenerationOpen-Ended Question Answering | CodeCode Available | 1 |
| WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion | Apr 18, 2025 | Contrastive LearningDenoising | CodeCode Available | 1 |
| F-ViTA: Foundation Model Guided Visible to Thermal Translation | Apr 3, 2025 | Scene UnderstandingStyle Transfer | CodeCode Available | 1 |
| Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors | Mar 26, 2025 | Depth EstimationWorld Knowledge | CodeCode Available | 1 |
| Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training | Mar 19, 2025 | Image DehazingWorld Knowledge | CodeCode Available | 1 |
| FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data Classification | Mar 18, 2025 | Combinatorial OptimizationContrastive Learning | CodeCode Available | 1 |
| Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement | Jan 21, 2025 | Synthetic Data GenerationWorld Knowledge | CodeCode Available | 1 |
| VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models | Jan 9, 2025 | BenchmarkingMathematical Problem-Solving | CodeCode Available | 1 |
| An Automatic Graph Construction Framework based on Large Language Models for Recommendation | Dec 24, 2024 | graph constructionQuantization | CodeCode Available | 1 |
| Knowledge Editing through Chain-of-Thought | Dec 23, 2024 | knowledge editingWorld Knowledge | CodeCode Available | 1 |
| Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models | Dec 18, 2024 | Contrastive LearningKnowledge Graphs | CodeCode Available | 1 |
| Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs | Dec 10, 2024 | Knowledge GraphsRAG | CodeCode Available | 1 |
| I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token | Dec 9, 2024 | World Knowledge | CodeCode Available | 1 |
| Retrieval-Augmented Machine Translation with Unstructured Knowledge | Dec 5, 2024 | Knowledge GraphsMachine Translation | CodeCode Available | 1 |
| LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content | Oct 14, 2024 | Visual Question Answering (VQA)World Knowledge | CodeCode Available | 1 |
| LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts | Oct 9, 2024 | Test-time AdaptationWorld Knowledge | CodeCode Available | 1 |
| CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models | Sep 27, 2024 | Reinforcement Learning (RL)World Knowledge | CodeCode Available | 1 |
| Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement | Sep 17, 2024 | Active LearningDiversity | CodeCode Available | 1 |
| Can OOD Object Detectors Learn from Foundation Models? | Sep 8, 2024 | Objectobject-detection | CodeCode Available | 1 |
| AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Jul 10, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs | May 29, 2024 | Image RetrievalQuestion Answering | CodeCode Available | 1 |
| Large Scale Knowledge Washing | May 26, 2024 | DecoderMemorization | CodeCode Available | 1 |
| Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models | May 24, 2024 | knowledge editingWorld Knowledge | CodeCode Available | 1 |
| Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models | May 15, 2024 | AI AgentWorld Knowledge | CodeCode Available | 1 |
| PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning | May 10, 2024 | DecoderGeneralization Bounds | CodeCode Available | 1 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Apr 22, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 1 |