SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models Dec 16, 2024 Instruction Following
Code Code Available 1LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts Dec 16, 2024 General Knowledge Instruction Following
Code Code Available 2ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation Dec 15, 2024 Instruction Following
— Unverified 0Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval Dec 15, 2024 Image Retrieval Instruction Following
— Unverified 0Empowering LLMs to Understand and Generate Complex Vector Graphics Dec 15, 2024 Instruction Following Vector Graphics
— Unverified 0VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation Dec 13, 2024 Instruction Following Question Answering
— Unverified 0EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM Dec 12, 2024 Image Comprehension Image Generation
— Unverified 0SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs Dec 11, 2024 ARC GSM8K
— Unverified 0LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information Dec 11, 2024 Data Augmentation Instruction Following
— Unverified 0LLMs for Generalizable Language-Conditioned Policy Learning under Minimal Data Requirements Dec 9, 2024 Decision Making Instruction Following
— Unverified 0PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models Dec 9, 2024 Benchmarking Instruction Following
Code Code Available 0Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families Dec 9, 2024 Emotional Intelligence Instruction Following
Code Code Available 0KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Dec 8, 2024 Instruction Following Natural Language Understanding
Code Code Available 1GROOT-2: Weakly Supervised Multi-Modal Instruction Following Agents Dec 7, 2024 Instruction Following
— Unverified 0Compositional Image Retrieval via Instruction-Aware Contrastive Learning Dec 7, 2024 Contrastive Learning Image Retrieval
Code Code Available 0RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts Dec 7, 2024 Change Detection Image Comprehension
Code Code Available 1EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Dec 6, 2024 Instruction Following
— Unverified 0LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs Dec 6, 2024 Entity Alignment Entity Embeddings
— Unverified 0If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Dec 5, 2024 Code Generation Instruction Following
— Unverified 0VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding Dec 4, 2024 Hallucination Instruction Following
— Unverified 0From Words to Workflows: Automating Business Processes Dec 4, 2024 Decision Making Instruction Following
— Unverified 0PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation Dec 4, 2024 Instruction Following
Code Code Available 1Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases Dec 3, 2024 Instruction Following
Code Code Available 1Optimizing Latent Goal by Learning from Trajectory Preference Dec 3, 2024 Continual Learning Instruction Following
— Unverified 0T-REG: Preference Optimization with Token-Level Reward Regularization Dec 3, 2024 Instruction Following
Code Code Available 0AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM Dec 2, 2024 Instruction Following Question Answering
— Unverified 0MiningGPT -- A Domain-Specific Large Language Model for the Mining Industry Dec 2, 2024 Instruction Following Language Modeling
— Unverified 0Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation Dec 2, 2024 Data Integration Instruction Following
— Unverified 0VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation Dec 1, 2024 Instruction Following Video Understanding
— Unverified 0InsightEdit: Towards Better Instruction Following for Image Editing Nov 26, 2024 Instruction Following
— Unverified 0ShowUI: One Vision-Language-Action Model for GUI Visual Agent Nov 26, 2024 Instruction Following Natural Language Visual Grounding
Code Code Available 5Parameter Efficient Instruction Tuning: An Empirical Study Nov 25, 2024 Instruction Following Memorization
Code Code Available 4Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors Nov 24, 2024 Depth Estimation Instruction Following
— Unverified 0From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars Nov 23, 2024 Descriptive In-Context Learning
Code Code Available 0Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy Nov 23, 2024 Instruction Following MME
— Unverified 0Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning Nov 21, 2024 Continual Learning Instruction Following
— Unverified 0GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding Nov 16, 2024 Instruction Following Language Modeling
Code Code Available 2MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection Nov 16, 2024 Diagnostic Instruction Following
Code Code Available 0MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models Nov 15, 2024 Instruction Following Zero-shot Generalization
Code Code Available 0Adaptive Decoding via Latent Preference Optimization Nov 14, 2024 GSM8K Instruction Following
— Unverified 0LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation Nov 14, 2024 Earth Observation Instruction Following
Code Code Available 2Zero-shot Object-Centric Instruction Following: Integrating Foundation Models with Traditional Navigation Nov 12, 2024 Instruction Following Object
— Unverified 0LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios Nov 11, 2024 Instruction Following
Code Code Available 0SetLexSem Challenge: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models Nov 11, 2024 Instruction Following
Code Code Available 1Stronger Models are NOT Stronger Teachers for Instruction Tuning Nov 11, 2024 Instruction Following
— Unverified 0MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory Nov 11, 2024 Instruction Following Minecraft
— Unverified 0IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization Nov 9, 2024 Instruction Following
Code Code Available 0Fox-1 Technical Report Nov 8, 2024 2k 8k
— Unverified 0Bayesian Calibration of Win Rate Estimation with LLM Evaluators Nov 7, 2024 Bayesian Inference Instruction Following
Code Code Available 0Multi-Reward as Condition for Instruction-based Image Editing Nov 6, 2024 Descriptive Instruction Following
— Unverified 0