LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning Apr 12, 2024 Image Segmentation Language Modeling
Code Code Available 2Behavior Trees Enable Structured Programming of Language Model Agents Apr 11, 2024 Language Modeling Language Modelling
Code Code Available 2HGRN2: Gated Linear RNNs with State Expansion Apr 11, 2024 Image Classification Language Modeling
Code Code Available 2LaVy: Vietnamese Multimodal Large Language Model Apr 11, 2024 Language Modeling Language Modelling
Code Code Available 2From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples Apr 11, 2024 Language Modeling Language Modelling
Code Code Available 2UMBRAE: Unified Multimodal Brain Decoding Apr 10, 2024 Brain Decoding Language Modeling
Code Code Available 2Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation Apr 9, 2024 Knowledge Distillation Language Modeling
Code Code Available 2Test-Time Zero-Shot Temporal Action Localization Apr 8, 2024 Action Localization Language Modelling
Code Code Available 2MotionChain: Conversational Motion Controllers via Multimodal Prompts Apr 2, 2024 Language Modeling Language Modelling
Code Code Available 2Stream of Search (SoS): Learning to Search in Language Apr 1, 2024 Language Modelling
Code Code Available 2Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward Apr 1, 2024 Instruction Following Language Modeling
Code Code Available 2ARAGOG: Advanced RAG Output Grading Apr 1, 2024 Document Embedding Language Modeling
Code Code Available 2Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Mar 29, 2024 Instruction Following Language Modelling
Code Code Available 2VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis Mar 29, 2024 Hallucination Image Captioning
Code Code Available 2Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving Mar 28, 2024 Autonomous Driving Language Modeling
Code Code Available 2Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis Mar 28, 2024 Change Detection Language Modelling
Code Code Available 2Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction Mar 27, 2024 Image Captioning Language Modeling
Code Code Available 2An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLM Mar 27, 2024 Language Modeling Language Modelling
Code Code Available 2Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms Mar 26, 2024 Language Modelling
Code Code Available 2MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation Mar 26, 2024 Cross-Lingual Transfer Language Modelling
Code Code Available 2Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 2DreamLIP: Language-Image Pre-training with Long Captions Mar 25, 2024 Contrastive Learning Image-text Retrieval
Code Code Available 2RepairAgent: An Autonomous, LLM-Based Agent for Program Repair Mar 25, 2024 Language Modelling Large Language Model
Code Code Available 2Understanding Long Videos with Multimodal Language Models Mar 25, 2024 Action Recognition Fine-grained Action Recognition
Code Code Available 2LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Mar 22, 2024 Language Modelling Large Language Model
Code Code Available 2LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Mar 20, 2024 Language Modeling Language Modelling
Code Code Available 2Cross-Domain Pre-training with Language Models for Transferable Time Series Representations Mar 19, 2024 Language Modelling Time Series
Code Code Available 2Advancing Time Series Classification with Multimodal Language Modeling Mar 19, 2024 Classification Language Modeling
Code Code Available 2LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning Mar 18, 2024 Language Modeling Language Modelling
Code Code Available 2SelfIE: Self-Interpretation of Large Language Model Embeddings Mar 16, 2024 Language Modeling Language Modelling
Code Code Available 2Generative Region-Language Pretraining for Open-Ended Object Detection Mar 15, 2024 Language Modeling Language Modelling
Code Code Available 2VideoAgent: Long-form Video Understanding with Large Language Model as Agent Mar 15, 2024 EgoSchema Form
Code Code Available 2What Was Your Prompt? A Remote Keylogging Attack on AI Assistants Mar 14, 2024 Language Modeling Language Modelling
Code Code Available 2LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments Mar 13, 2024 Decision Making Language Modeling
Code Code Available 2Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 2SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents Mar 13, 2024 Language Modeling Language Modelling
Code Code Available 2Language models scale reliably with over-training and on downstream tasks Mar 13, 2024 Language Modelling
Code Code Available 2CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model Mar 13, 2024 General Knowledge Instruction Following
Code Code Available 2Characterization of Large Language Model Development in the Datacenter Mar 12, 2024 GPU Language Modeling
Code Code Available 2Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework Mar 12, 2024 Language Modelling Large Language Model
Code Code Available 2VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark Mar 12, 2024 knowledge editing Language Modeling
Code Code Available 2KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction Mar 12, 2024 Code Generation Language Modelling
Code Code Available 2Beyond Text: Frozen Large Language Models in Visual Signal Comprehension Mar 12, 2024 Deblurring Decoder
Code Code Available 2Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews Mar 11, 2024 Language Modelling Large Language Model
Code Code Available 2Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System Mar 11, 2024 GPU Language Modeling
Code Code Available 2CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Mar 7, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 2Online Adaptation of Language Models with a Memory of Amortized Contexts Mar 7, 2024 Language Modelling Meta-Learning
Code Code Available 2Backtracing: Retrieving the Cause of the Query Mar 6, 2024 Information Retrieval Language Modeling
Code Code Available 2MeaCap: Memory-Augmented Zero-shot Image Captioning Mar 6, 2024 Caption Generation Image Captioning
Code Code Available 2ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling Mar 5, 2024 All Language Modeling
Code Code Available 2