Autoregressive Action Sequence Learning for Robotic Manipulation Oct 4, 2024 Chunking Language Modeling
Code Code Available 2AntiFold: Improved antibody structure-based design using inverse folding May 6, 2024 Language Modeling Language Modelling
Code Code Available 2ChatterBox: Multi-round Multimodal Referring and Grounding Jan 24, 2024 Language Modeling Language Modelling
Code Code Available 2ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data Dec 16, 2024 Language Modeling Language Modelling
Code Code Available 2GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Jul 7, 2023 Attribute Common Sense Reasoning
Code Code Available 2GeoChat: Grounded Large Vision-Language Model for Remote Sensing Nov 24, 2023 Instruction Following Language Modeling
Code Code Available 2GenSim: A General Social Simulation Platform with Large Language Model based Agents Oct 6, 2024 Language Modeling Language Modelling
Code Code Available 2LingoQA: Visual Question Answering for Autonomous Driving Dec 21, 2023 Autonomous Driving Decision Making
Code Code Available 2LinVT: Empower Your Image-level Large Language Model to Understand Videos Dec 6, 2024 Language Modeling Language Modelling
Code Code Available 2AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM Mar 6, 2025 Anomaly Detection Language Modeling
Code Code Available 2GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding Nov 16, 2024 Instruction Following Language Modeling
Code Code Available 2Generative Region-Language Pretraining for Open-Ended Object Detection Mar 15, 2024 Language Modeling Language Modelling
Code Code Available 2Advancing Time Series Classification with Multimodal Language Modeling Mar 19, 2024 Classification Language Modeling
Code Code Available 2GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model Jun 3, 2024 geo-localization Language Modeling
Code Code Available 2Generative Modeling for Mathematical Discovery Mar 14, 2025 Language Modeling Language Modelling
Code Code Available 2Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer Jun 3, 2024 Audio Generation In-Context Learning
Code Code Available 2Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support Feb 25, 2025 Decision Making Diagnostic
Code Code Available 2Generating Benchmarks for Factuality Evaluation of Language Models Jul 13, 2023 Language Modeling Language Modelling
Code Code Available 2Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 2GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models May 30, 2025 Classification Disaster Response
Code Code Available 2LLark: A Multimodal Instruction-Following Language Model for Music Oct 11, 2023 Instruction Following Language Modeling
Code Code Available 2LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement Mar 1, 2025 Language Modeling Language Modelling
Code Code Available 2Generalized Interpolating Discrete Diffusion Mar 6, 2025 Language Modeling Language Modelling
Code Code Available 2Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 2Automated Bioinformatics Analysis via AutoBA Sep 6, 2023 AI Agent Language Modeling
Code Code Available 2AutoGRAMS: Autonomous Graphical Agent Modeling Software Jul 14, 2024 Language Modeling Language Modelling
Code Code Available 2Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts Feb 12, 2024 Continual Pretraining GSM8K
Code Code Available 2General-purpose, long-context autoregressive modeling with Perceiver AR Feb 15, 2022 Density Estimation Language Modelling
Code Code Available 2CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels Nov 25, 2022 image-classification Image Classification
Code Code Available 2LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning Jun 20, 2024 Autonomous Navigation Heuristic Search
Code Code Available 2GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Jun 17, 2024 Audio Question Answering Instruction Following
Code Code Available 2Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification Oct 26, 2020 Few-Shot Text Classification General Classification
Code Code Available 2AutoFlow: Automated Workflow Generation for Large Language Model Agents Jul 1, 2024 AI Agent Language Modeling
Code Code Available 2Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model Mar 20, 2025 Language Modeling Language Modelling
Code Code Available 2Generate rather than Retrieve: Large Language Models are Strong Context Generators Sep 21, 2022 Language Modeling Language Modelling
Code Code Available 2GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering Feb 4, 2024 Language Modeling Language Modelling
Code Code Available 2Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs Dec 2, 2024 All Language Modeling
Code Code Available 2LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Aug 25, 2024 Language Modelling Link Prediction
Code Code Available 2GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction May 30, 2023 Image Generation Instruction Following
Code Code Available 2From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks Jun 4, 2024 Image Captioning Language Modelling
Code Code Available 2Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Jan 20, 2025 Imitation Learning Language Modeling
Code Code Available 2Forgetting Transformer: Softmax Attention with a Forget Gate Mar 3, 2025 Language Modeling Language Modelling
Code Code Available 2A Training-free LLM-based Approach to General Chinese Character Error Correction Feb 21, 2025 Language Modeling Language Modelling
Code Code Available 2A Touch, Vision, and Language Dataset for Multimodal Alignment Feb 20, 2024 Language Modeling Language Modelling
Code Code Available 2Formal Mathematics Statement Curriculum Learning Feb 3, 2022 Automated Theorem Proving Language Modeling
Code Code Available 2From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples Apr 11, 2024 Language Modeling Language Modelling
Code Code Available 2A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 2A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jul 24, 2023 Image Generation Image-text matching
Code Code Available 2LoQT: Low-Rank Adapters for Quantized Pretraining May 26, 2024 GPU Language Modeling
Code Code Available 2FLAME: Financial Large-Language Model Assessment and Metrics Evaluation Jan 3, 2025 Language Modeling Language Modelling
Code Code Available 2