Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine Dec 12, 2024 Language Modeling Language Modelling
Code Code Available 2Predicting Human Brain States with Transformer Dec 11, 2024 Language Modelling Music Generation
Code Code Available 2Granite Guardian Dec 10, 2024 Hallucination Language Modeling
Code Code Available 2C^2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation Dec 6, 2024 Language Model Evaluation Language Modeling
Code Code Available 2LinVT: Empower Your Image-level Large Language Model to Understand Videos Dec 6, 2024 Language Modeling Language Modelling
Code Code Available 2FLAIR: VLM with Fine-grained Language-informed Image Representations Dec 4, 2024 Language Modeling Language Modelling
Code Code Available 2X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models Dec 2, 2024 Image Generation In-Context Learning
Code Code Available 2Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs Dec 2, 2024 All Language Modeling
Code Code Available 2KV Shifting Attention Enhances Language Modeling Nov 29, 2024 In-Context Learning Language Modeling
Code Code Available 2MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension Nov 26, 2024 Language Modeling Language Modelling
Code Code Available 2OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection Nov 26, 2024 3D Object Detection Autonomous Driving
Code Code Available 2Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks Nov 23, 2024 Language Modeling Language Modelling
Code Code Available 2Large Language Model with Region-guided Referring and Grounding for CT Report Generation Nov 23, 2024 Computed Tomography (CT) Diagnostic
Code Code Available 2RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts Nov 22, 2024 AI Agent Language Modeling
Code Code Available 2ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data Nov 22, 2024 Language Modeling Language Modelling
Code Code Available 2GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI Nov 21, 2024 Decision Making Language Modeling
Code Code Available 2MC-LLaVA: Multi-Concept Personalized Vision-Language Model Nov 18, 2024 Language Modeling Language Modelling
Code Code Available 2BianCang: A Traditional Chinese Medicine Large Language Model Nov 17, 2024 Diagnostic Language Modeling
Code Code Available 2GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding Nov 16, 2024 Instruction Following Language Modeling
Code Code Available 2SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning Nov 15, 2024 Image Quality Assessment Language Modeling
Code Code Available 2LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation Nov 14, 2024 Earth Observation Instruction Following
Code Code Available 2TIPO: Text to Image with Text Presampling for Prompt Optimization Nov 12, 2024 Image Generation Language Modeling
Code Code Available 2Tucano: Advancing Neural Text Generation for Portuguese Nov 12, 2024 Language Modeling Language Modelling
Code Code Available 2The Super Weight in Large Language Models Nov 11, 2024 Language Modeling Language Modelling
Code Code Available 2Concept Bottleneck Language Models For protein design Nov 9, 2024 Decision Making Drug Discovery
Code Code Available 2LLM-PySC2: Starcraft II learning environment for Large Language Models Nov 8, 2024 Decision Making Language Modelling
Code Code Available 2End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 2PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 2V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization Nov 5, 2024 Hallucination Language Modeling
Code Code Available 2RAGViz: Diagnose and Visualize Retrieval-Augmented Generation Nov 4, 2024 Answer Generation GPU
Code Code Available 2Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs Oct 31, 2024 Knowledge Graphs Language Modeling
Code Code Available 2GPT or BERT: why not both? Oct 31, 2024 Causal Language Modeling Language Modeling
Code Code Available 2What is Wrong with Perplexity for Long-context Language Modeling? Oct 31, 2024 Document Summarization In-Context Learning
Code Code Available 2Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 2Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 2Retrieval-Enhanced Mutation Mastery: Augmenting Zero-Shot Prediction of Protein Language Model Oct 28, 2024 Language Modeling Language Modelling
Code Code Available 2Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models Oct 23, 2024 Instruction Following Language Modelling
Code Code Available 2PAPILLON: Privacy Preservation from Internet-based and Local Language Model Ensembles Oct 22, 2024 Language Modeling Language Modelling
Code Code Available 2MiniPLM: Knowledge Distillation for Pre-Training Language Models Oct 22, 2024 Diversity Knowledge Distillation
Code Code Available 2Improve Vision Language Model Chain-of-thought Reasoning Oct 21, 2024 Language Modeling Language Modelling
Code Code Available 2RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style Oct 21, 2024 Benchmarking Language Modeling
Code Code Available 2SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation Oct 19, 2024 AI Agent Benchmarking
Code Code Available 2A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 2Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 2On the Role of Attention Heads in Large Language Model Safety Oct 17, 2024 Attribute Language Modeling
Code Code Available 2Process Reward Model with Q-Value Rankings Oct 15, 2024 Decision Making Language Modeling
Code Code Available 2MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation Oct 15, 2024 Hallucination Language Modeling
Code Code Available 2WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic Segmentation Oct 15, 2024 Autonomous Driving Language Modeling
Code Code Available 2Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization Oct 11, 2024 GSM8K Language Modeling
Code Code Available 2Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs Oct 10, 2024 Active Learning Language Modeling
Code Code Available 2