| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Oct 1, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Exploring Empty Spaces: Human-in-the-Loop Data Augmentation | Oct 1, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection | Sep 30, 2024 | DiversityKeypoint Detection | CodeCode Available | 1 |
| VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs | Sep 30, 2024 | EgoSchemaLanguage Modelling | CodeCode Available | 1 |
| DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Sep 26, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 1 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation | Sep 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning | Sep 19, 2024 | Change DetectionDecoder | CodeCode Available | 1 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| LOLA -- An Open-Source Massively Multilingual Large Language Model | Sep 17, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Enhancing RL Safety with Counterfactual LLM Reasoning | Sep 16, 2024 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Symbolic Regression with a Learned Concept Library | Sep 14, 2024 | Evolutionary AlgorithmsLanguage Modeling | CodeCode Available | 1 |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | Sep 12, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model | Sep 11, 2024 | Data-to-Text GenerationGraph-to-Sequence | CodeCode Available | 1 |
| AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Sep 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| TextToucher: Fine-Grained Text-to-Touch Generation | Sep 9, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Sparse Rewards Can Self-Train Dialogue Agents | Sep 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Legilimens: Practical and Unified Content Moderation for Large Language Model Services | Aug 28, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| XG-NID: Dual-Modality Network Intrusion Detection using a Heterogeneous Graph Neural Network and Large Language Model | Aug 27, 2024 | Graph Neural NetworkIntrusion Detection | CodeCode Available | 1 |
| AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos | Aug 26, 2024 | FormLanguage Modelling | CodeCode Available | 1 |
| IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities | Aug 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models | Aug 23, 2024 | Contrastive LearningLanguage Modelling | CodeCode Available | 1 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding | Aug 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval | Aug 20, 2024 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models | Aug 20, 2024 | GPULanguage Modelling | CodeCode Available | 1 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 |
| Harnessing Multimodal Large Language Models for Multimodal Sequential Recommendation | Aug 19, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model | Aug 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A semantic embedding space based on large language models for modelling human beliefs | Aug 13, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Prompto: An open source library for asynchronous querying of LLM endpoints | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PhishLang: A Real-Time, Fully Client-Side Phishing Detection Framework Using MobileBERT | Aug 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| ViC: Virtual Compiler Is All You Need For Assembly Code Search | Aug 10, 2024 | AllCode Search | CodeCode Available | 1 |
| Unsupervised Episode Detection for Large-Scale News Events | Aug 9, 2024 | ArticlesEvent Detection | CodeCode Available | 1 |
| Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling | Aug 7, 2024 | Image GenerationLanguage Modelling | CodeCode Available | 1 |
| WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models | Aug 7, 2024 | AI and SafetyBenchmarking | CodeCode Available | 1 |
| ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Citekit: A Modular Toolkit for Large Language Model Citation Generation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |