| CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Mar 7, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 2 |
| InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents | Mar 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Android in the Zoo: Chain-of-Action-Thought for GUI Agents | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Wukong: Towards a Scaling Law for Large-Scale Recommendation | Mar 4, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training | Feb 28, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification | Feb 27, 2024 | ClassificationDiagnostic | CodeCode Available | 2 |
| TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space | Feb 27, 2024 | Contrastive LearningHallucination | CodeCode Available | 2 |
| PALO: A Polyglot Large Multimodal Model for 5B People | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Subobject-level Image Tokenization | Feb 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning | Feb 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CoLLaVO: Crayon Large Language and Vision mOdel | Feb 17, 2024 | Large Language Modelmodel | CodeCode Available | 2 |
| RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Feb 16, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback | Feb 15, 2024 | Computational chemistryGraph Neural Network | CodeCode Available | 2 |
| Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Feb 13, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 |
| UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction | Feb 10, 2024 | graph constructionKnowledge Graph Completion | CodeCode Available | 2 |
| Can Large Language Model Agents Simulate Human Trust Behavior? | Feb 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion | Feb 4, 2024 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation | Jan 30, 2024 | HallucinationKnowledge Distillation | CodeCode Available | 2 |
| EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain | Jan 30, 2024 | Image ComprehensionInstruction Following | CodeCode Available | 2 |
| Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models | Jan 30, 2024 | Data CompressionLanguage Modelling | CodeCode Available | 2 |
| L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks | Jan 27, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey | Jan 25, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Spatial-Temporal Large Language Model for Traffic Prediction | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Jan 18, 2024 | Code GenerationEvolutionary Algorithms | CodeCode Available | 2 |
| SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model | Jan 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| TechGPT-2.0: A large language model project to solve the task of knowledge graph construction | Jan 9, 2024 | graph constructionLanguage Modeling | CodeCode Available | 2 |
| SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems | Jan 8, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Malla: Demystifying Real-world Large Language Model Integrated Malicious Services | Jan 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge | Jan 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference | Dec 23, 2023 | GPUHigh-Level Synthesis | CodeCode Available | 2 |
| Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Machine Mindset: An MBTI Exploration of Large Language Models | Dec 20, 2023 | Large Language ModelPersonality Alignment | CodeCode Available | 2 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Holodeck: Language Guided Generation of 3D Embodied AI Environments | Dec 14, 2023 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 2 |
| mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs | Dec 5, 2023 | GPULarge Language Model | CodeCode Available | 2 |
| Customization Assistant for Text-to-image Generation | Dec 5, 2023 | DescriptiveImage Generation | CodeCode Available | 2 |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Dec 4, 2023 | Dense CaptioningHighlight Detection | CodeCode Available | 2 |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars | Nov 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| YUAN 2.0: A Large Language Model with Localized Filtering-based Attention | Nov 27, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| LLMGA: Multimodal Large Language Model based Generation Assistant | Nov 27, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| Algorithm Evolution Using Large Language Model | Nov 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design | Nov 23, 2023 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| A Survey of Graph Meets Large Language Model: Progress and Future Directions | Nov 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Mixture of Tokens: Continuous MoE through Cross-Example Aggregation | Oct 24, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |