| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 | 5 |
| "Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation | Dec 18, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 | 5 |
| Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering | Jan 1, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 | 5 |
| NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering | May 26, 2025 | ChunkingLarge Language Model | CodeCode Available | 1 | 5 |
| ChemMLLM: Chemical Multimodal Large Language Model | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ChemLLM: A Chemical Large Language Model | Feb 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior | May 21, 2025 | Large Language ModelManagement | CodeCode Available | 1 | 5 |
| SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models | Jul 20, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Dissecting Human and LLM Preferences | Feb 17, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model | May 1, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| A semantic embedding space based on large language models for modelling human beliefs | Aug 13, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| nicolay-r at SemEval-2024 Task 3: Using Flan-T5 for Reasoning Emotion Cause in Conversations with Chain-of-Thought on Emotion States | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model | Mar 31, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion | May 26, 2025 | DenoisingImage Generation | CodeCode Available | 1 | 5 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| NL4Opt Competition: Formulating Optimization Problems Based on Their Natural Language Descriptions | Mar 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| On AI-Inspired UI-Design | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Democratizing Reasoning Ability: Tailored Learning from Large Language Model | Oct 20, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| IDEA-Bench: How Far are Generative Models from Professional Designing? | Dec 16, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 | 5 |
| Multimodal AI predicts clinical outcomes of drug combinations from preclinical data | Mar 4, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models | Nov 1, 2024 | Decision MakingInformativeness | CodeCode Available | 1 | 5 |
| Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V | Oct 29, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 | 5 |
| DeepInception: Hypnotize Large Language Model to Be Jailbreaker | Nov 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Feb 20, 2025 | AutoMLCode Generation | CodeCode Available | 1 | 5 |
| Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection | Dec 15, 2022 | Deep LearningGraph Learning | CodeCode Available | 1 | 5 |
| Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives | Apr 17, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 1 | 5 |
| MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition | Feb 18, 2025 | Emotion RecognitionLarge Language Model | CodeCode Available | 1 | 5 |
| Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder | Jun 28, 2025 | Image SegmentationLarge Language Model | CodeCode Available | 1 | 5 |
| MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion | Feb 20, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| M^2Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation | Nov 29, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 1 | 5 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| In-context Autoencoder for Context Compression in a Large Language Model | Jul 13, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-Modal Classifiers for Open-Vocabulary Object Detection | Jun 8, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Dataset Distillation via Vision-Language Category Prototype | Jun 30, 2025 | Dataset DistillationDescriptive | CodeCode Available | 1 | 5 |
| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Inference with Reference: Lossless Acceleration of Large Language Models | Apr 10, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |