| Behavior Trees Enable Structured Programming of Language Model Agents | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UMBRAE: Unified Multimodal Brain Decoding | Apr 10, 2024 | Brain DecodingLanguage Modeling | CodeCode Available | 2 |
| Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation | Apr 9, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 2 |
| MotionChain: Conversational Motion Controllers via Multimodal Prompts | Apr 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ARAGOG: Advanced RAG Output Grading | Apr 1, 2024 | Document EmbeddingLanguage Modeling | CodeCode Available | 2 |
| Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward | Apr 1, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis | Mar 29, 2024 | HallucinationImage Captioning | CodeCode Available | 2 |
| Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving | Mar 28, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction | Mar 27, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 2 |
| An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLM | Mar 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance | Mar 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models | Mar 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Advancing Time Series Classification with Multimodal Language Modeling | Mar 19, 2024 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SelfIE: Self-Interpretation of Large Language Model Embeddings | Mar 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Region-Language Pretraining for Open-Ended Object Detection | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Mar 15, 2024 | EgoSchemaForm | CodeCode Available | 2 |
| What Was Your Prompt? A Remote Keylogging Attack on AI Assistants | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale | Mar 13, 2024 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 2 |
| CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model | Mar 13, 2024 | General KnowledgeInstruction Following | CodeCode Available | 2 |
| LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments | Mar 13, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents | Mar 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark | Mar 12, 2024 | knowledge editingLanguage Modeling | CodeCode Available | 2 |
| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Beyond Text: Frozen Large Language Models in Visual Signal Comprehension | Mar 12, 2024 | DeblurringDecoder | CodeCode Available | 2 |
| Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System | Mar 11, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Mar 7, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 2 |
| Backtracing: Retrieving the Cause of the Query | Mar 6, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |
| ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling | Mar 5, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents | Mar 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Android in the Zoo: Chain-of-Action-Thought for GUI Agents | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training | Feb 28, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation | Feb 28, 2024 | Code GenerationIn-Context Learning | CodeCode Available | 2 |
| Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Feb 27, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| Retrieval is Accurate Generation | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Long-Context Language Modeling with Parallel Context Encoding | Feb 26, 2024 | In-Context LearningInstruction Following | CodeCode Available | 2 |
| HiGPT: Heterogeneous Graph Language Model | Feb 25, 2024 | Graph LearningLanguage Modeling | CodeCode Available | 2 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG) | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PALO: A Polyglot Large Multimodal Model for 5B People | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Subobject-level Image Tokenization | Feb 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning | Feb 21, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| A Touch, Vision, and Language Dataset for Multimodal Alignment | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning | Feb 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Feb 16, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Feb 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback | Feb 15, 2024 | Computational chemistryGraph Neural Network | CodeCode Available | 2 |
| Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents | Feb 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |