| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents | Aug 2, 2024 | Code GenerationLarge Language Model | CodeCode Available | 1 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation Models | Aug 1, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 1 |
| LADDER: Language Driven Slice Discovery and Error Rectification | Jul 31, 2024 | AttributeClustering | CodeCode Available | 1 |
| CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Jul 30, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs | Jul 26, 2024 | Action GenerationLarge Language Model | CodeCode Available | 1 |
| Cost-effective Instruction Learning for Pathology Vision and Language Analysis | Jul 25, 2024 | Few-Shot LearningLanguage Modelling | CodeCode Available | 1 |
| INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Jul 22, 2024 | Image GenerationLanguage Modelling | CodeCode Available | 1 |
| LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Jul 22, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| ViLLa: Video Reasoning Segmentation with Large Language Model | Jul 18, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| On Large Language Model Continual Unlearning | Jul 14, 2024 | DisentanglementLanguage Modeling | CodeCode Available | 1 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 |
| Incorporating Large Language Models into Production Systems for Enhanced Task Automation and Flexibility | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 |
| Open-world Multi-label Text Classification with Extremely Weak Supervision | Jul 8, 2024 | Keyword ExtractionLanguage Modelling | CodeCode Available | 1 |
| DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large language models are good medical coders, if provided with tools | Jul 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking | Jul 5, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Jul 5, 2024 | General KnowledgeInstruction Following | CodeCode Available | 1 |
| WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System | Jul 4, 2024 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models | Jul 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model | Jul 1, 2024 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 |
| MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment | Jun 28, 2024 | Answer GenerationImage Captioning | CodeCode Available | 1 |
| A Refer-and-Ground Multimodal Large Language Model for Biomedicine | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph | Jun 25, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Jun 24, 2024 | Chinese Spell CheckingLanguage Modeling | CodeCode Available | 1 |
| DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Jun 24, 2024 | Image RestorationImage Super-Resolution | CodeCode Available | 1 |
| RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Jun 24, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| Safely Learning with Private Data: A Federated Learning Framework for Large Language Model | Jun 21, 2024 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| InternLM-Law: An Open Source Chinese Legal Large Language Model | Jun 21, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors | Jun 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors | Jun 20, 2024 | 16kInstruction Following | CodeCode Available | 1 |
| CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks | Jun 20, 2024 | General KnowledgeHuman Dynamics | CodeCode Available | 1 |
| LiveMind: Low-latency Large Language Models with Simultaneous Inference | Jun 20, 2024 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs | Jun 20, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation | Jun 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| On AI-Inspired UI-Design | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding | Jun 18, 2024 | AttributeInstruction Following | CodeCode Available | 1 |
| MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction | Jun 18, 2024 | Drug DiscoveryGraph Neural Network | CodeCode Available | 1 |