GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Jun 17, 2024 Audio Question Answering Instruction Following
Code Code Available 2Generative Modeling for Mathematical Discovery Mar 14, 2025 Language Modeling Language Modelling
Code Code Available 2MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling Mar 17, 2025 GPU Language Modeling
Code Code Available 2From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks Jun 4, 2024 Image Captioning Language Modelling
Code Code Available 2A Touch, Vision, and Language Dataset for Multimodal Alignment Feb 20, 2024 Language Modeling Language Modelling
Code Code Available 2Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image Analysis Mar 25, 2025 Contrastive Learning Image-text Retrieval
Code Code Available 2CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers Apr 28, 2022 Image Generation Language Modeling
Code Code Available 2From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples Apr 11, 2024 Language Modeling Language Modelling
Code Code Available 2A Training-free LLM-based Approach to General Chinese Character Error Correction Feb 21, 2025 Language Modeling Language Modelling
Code Code Available 2Mega: Moving Average Equipped Gated Attention Sep 21, 2022 Image Classification Inductive Bias
Code Code Available 2Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization Mar 5, 2025 Language Modeling Language Modelling
Code Code Available 2Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism Sep 17, 2019 GPU LAMBADA
Code Code Available 2Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities Mar 6, 2025 Language Modeling Language Modelling
Code Code Available 2Forgetting Transformer: Softmax Attention with a Forget Gate Mar 3, 2025 Language Modeling Language Modelling
Code Code Available 2A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 2Memory Mosaics May 10, 2024 Disentanglement In-Context Learning
Code Code Available 2Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Jan 20, 2025 Language Modeling Language Modelling
Code Code Available 2Metadata Conditioning Accelerates Language Model Pre-training Jan 3, 2025 Language Modeling Language Modelling
Code Code Available 2MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-Processing Feb 1, 2025 Language Modeling Language Modelling
Code Code Available 2Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model Jun 13, 2024 Diagnostic Image Retrieval
Code Code Available 2Formal Mathematics Statement Curriculum Learning Feb 3, 2022 Automated Theorem Proving Language Modeling
Code Code Available 2A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jul 24, 2023 Image Generation Image-text matching
Code Code Available 2G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning May 19, 2025 Language Modeling Language Modelling
Code Code Available 2Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer Jun 3, 2024 Audio Generation In-Context Learning
Code Code Available 2Composed Image Retrieval for Remote Sensing May 24, 2024 Composed Image Retrieval (CoIR) Descriptive
Code Code Available 2Asynchronous Large Language Model Enhanced Planner for Autonomous Driving Jun 20, 2024 Autonomous Driving Language Modeling
Code Code Available 2AgentSims: An Open-Source Sandbox for Large Language Model Evaluation Aug 8, 2023 Language Model Evaluation Language Modeling
Code Code Available 2FLAME: Financial Large-Language Model Assessment and Metrics Evaluation Jan 3, 2025 Language Modeling Language Modelling
Code Code Available 2An Egocentric Vision-Language Model based Portable Real-time Smart Assistant Mar 6, 2025 Language Modeling Language Modelling
Code Code Available 2FLAIR: VLM with Fine-grained Language-informed Image Representations Dec 4, 2024 Language Modeling Language Modelling
Code Code Available 2FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets Jul 20, 2023 Instruction Following Language Model Evaluation
Code Code Available 2A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model May 3, 2024 Decision Making Few-Shot Learning
Code Code Available 2ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints Aug 3, 2023 Image Generation Language Modelling
Code Code Available 2MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models Jun 23, 2023 Benchmarking Language Modeling
Code Code Available 2Concept Bottleneck Language Models For protein design Nov 9, 2024 Decision Making Drug Discovery
Code Code Available 2MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments Mar 4, 2025 2D Panoptic Segmentation Graph Generation
Code Code Available 2MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding Sep 23, 2024 Language Modeling Language Modelling
Code Code Available 2Modifying Large Language Model Post-Training for Diverse Creative Writing Mar 21, 2025 Diversity Language Modeling
Code Code Available 2MoEUT: Mixture-of-Experts Universal Transformers May 25, 2024 Language Modeling Language Modelling
Code Code Available 2A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval Mar 7, 2025 Information Retrieval Language Modeling
Code Code Available 2A Survey of Multimodal Large Language Model from A Data-centric Perspective May 26, 2024 Language Modeling Language Modelling
Code Code Available 2FIRST: Faster Improved Listwise Reranking with Single Token Decoding Jun 21, 2024 Information Retrieval Language Modeling
Code Code Available 2Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models Oct 23, 2024 Instruction Following Language Modelling
Code Code Available 2Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 2DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings Apr 21, 2022 Contrastive Learning Language Modeling
Code Code Available 2DiffArtist: Towards Structure and Appearance Controllable Image Stylization Jul 22, 2024 Disentanglement Image Stylization
Code Code Available 2A Survey of Graph Meets Large Language Model: Progress and Future Directions Nov 21, 2023 Language Modeling Language Modelling
Code Code Available 2FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design Nov 23, 2023 Decision Making Language Modelling
Code Code Available 2Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models May 8, 2024 Language Modeling Language Modelling
Code Code Available 2Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 2