| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| The Geometry of Categorical and Hierarchical Concepts in Large Language Models | Jun 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Query2CAD: Generating CAD models using natural language queries | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | May 29, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Aligning to Thousands of Preferences via System Message Generalization | May 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | May 27, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| A Survey of Multimodal Large Language Model from A Data-centric Perspective | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LM4LV: A Frozen Large Language Model for Low-level Vision Tasks | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large language models can be zero-shot anomaly detectors for time series? | May 23, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |
| Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for Russian | May 22, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles | May 22, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Libra: Building Decoupled Vision System on Large Language Models | May 16, 2024 | Image to textLanguage Modeling | CodeCode Available | 2 |
| PLeak: Prompt Leaking Attacks against Large Language Model Applications | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID | May 8, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom | May 6, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 2 |
| A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model | May 3, 2024 | Decision MakingFew-Shot Learning | CodeCode Available | 2 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Paint by Inpaint: Learning to Add Image Objects by Removing Them First | Apr 28, 2024 | Image InpaintingLanguage Modeling | CodeCode Available | 2 |
| PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games | Apr 26, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| An empirical study of LLaMA3 quantization: from LLMs to MLLMs | Apr 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| MoVA: Adapting Mixture of Vision Experts to Multimodal Context | Apr 19, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| in2IN: Leveraging individual Information to Generate Human INteractions | Apr 15, 2024 | DiversityLanguage Modelling | CodeCode Available | 2 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |
| From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LaVy: Vietnamese Multimodal Large Language Model | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UMBRAE: Unified Multimodal Brain Decoding | Apr 10, 2024 | Brain DecodingLanguage Modeling | CodeCode Available | 2 |
| ARAGOG: Advanced RAG Output Grading | Apr 1, 2024 | Document EmbeddingLanguage Modeling | CodeCode Available | 2 |
| Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward | Apr 1, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want | Mar 29, 2024 | Instruction FollowingLanguage Modelling | CodeCode Available | 2 |
| Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving | Mar 28, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis | Mar 28, 2024 | Change DetectionLanguage Modelling | CodeCode Available | 2 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 |
| RepairAgent: An Autonomous, LLM-Based Agent for Program Repair | Mar 25, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models | Mar 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SelfIE: Self-Interpretation of Large Language Model Embeddings | Mar 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Mar 15, 2024 | EgoSchemaForm | CodeCode Available | 2 |
| What Was Your Prompt? A Remote Keylogging Attack on AI Assistants | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents | Mar 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model | Mar 13, 2024 | General KnowledgeInstruction Following | CodeCode Available | 2 |
| LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments | Mar 13, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework | Mar 12, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction | Mar 12, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 2 |
| Beyond Text: Frozen Large Language Models in Visual Signal Comprehension | Mar 12, 2024 | DeblurringDecoder | CodeCode Available | 2 |
| Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews | Mar 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System | Mar 11, 2024 | GPULanguage Modeling | CodeCode Available | 2 |