BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer Jul 1, 2023 Language Modeling Language Modelling
Code Code Available 2Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference Mar 14, 2024 Text Generation
Code Code Available 2Improving Factuality and Reasoning in Language Models through Multiagent Debate May 23, 2023 Few-Shot Learning Language Modeling
Code Code Available 2In-Context Editing: Learning Knowledge from Self-Induced Distributions Jun 17, 2024 Image Editing In-Context Learning
Code Code Available 2Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph Jun 21, 2024 Benchmarking Text Generation
Code Code Available 2In-Context Retrieval-Augmented Language Models Jan 31, 2023 Language Modeling Language Modelling
Code Code Available 2MiniLLM: Knowledge Distillation of Large Language Models Jun 14, 2023 Instruction Following Knowledge Distillation
Code Code Available 2AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Dec 13, 2024 Text Generation
Code Code Available 2HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding Mar 1, 2024 Hallucination Object
Code Code Available 2Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference May 28, 2024 GPU Text Generation
Code Code Available 2Grounding Language Models to Images for Multimodal Inputs and Outputs Jan 31, 2023 Image Retrieval In-Context Learning
Code Code Available 2Harmonizing Visual Text Comprehension and Generation Jul 23, 2024 multimodal generation Reading Comprehension
Code Code Available 2GlyphControl: Glyph Conditional Control for Visual Text Generation May 29, 2023 Optical Character Recognition (OCR) Text Generation
Code Code Available 2BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models Jun 19, 2023 Instruction Following Text Generation
Code Code Available 2InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management Jun 28, 2024 Management Text Generation
Code Code Available 2Inseq: An Interpretability Toolkit for Sequence Generation Models Feb 27, 2023 Decoder Feature Importance
Code Code Available 2Authorship Obfuscation in Multilingual Machine-Generated Text Detection Jan 15, 2024 Adversarial Robustness Benchmarking
Code Code Available 2GPT-NER: Named Entity Recognition via Large Language Models Apr 20, 2023 Hallucination named-entity-recognition
Code Code Available 2Language-Driven Representation Learning for Robotics Feb 24, 2023 Contrastive Learning Imitation Learning
Code Code Available 2Language Models Can See: Plugging Visual Controls in Text Generation May 5, 2022 Image Captioning Image-text matching
Code Code Available 2Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 2Large Language Model with Region-guided Referring and Grounding for CT Report Generation Nov 23, 2024 Computed Tomography (CT) Diagnostic
Code Code Available 2GPTScore: Evaluate as You Desire Feb 8, 2023 Text Generation
Code Code Available 2Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Jun 2, 2023 Language Modeling Language Modelling
Code Code Available 2A Touch, Vision, and Language Dataset for Multimodal Alignment Feb 20, 2024 Language Modeling Language Modelling
Code Code Available 2Few-Shot Text Generation with Pattern-Exploiting Training Dec 22, 2020 Headline Generation text-classification
Code Code Available 2A Survey on Data Contamination for Large Language Models Feb 20, 2025 Survey Text Generation
Code Code Available 2Balancing LoRA Performance and Efficiency with Simple Shard Sharing Sep 19, 2024 Computational Efficiency GSM8K
Code Code Available 2A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jul 24, 2023 Image Generation Image-text matching
Code Code Available 2LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Oct 12, 2023 Natural Language Understanding Quantization
Code Code Available 2LongForm: Effective Instruction Tuning with Reverse Instructions Apr 17, 2023 Long Form Question Answering News Generation
Code Code Available 2FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation May 23, 2023 Form Language Modelling
Code Code Available 2Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap Jan 18, 2024 Code Generation Evolutionary Algorithms
Code Code Available 2MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis Jul 10, 2024 GPU Image Generation
Code Code Available 2Building Cooperative Embodied Agents Modularly with Large Language Models Jul 5, 2023 Text Generation
Code Code Available 2mbrs: A Library for Minimum Bayes Risk Decoding Aug 8, 2024 Text Generation
Code Code Available 2Expressive Text-to-Image Generation with Rich Text Apr 13, 2023 Image Generation Text Generation
Code Code Available 2MemLong: Memory-Augmented Retrieval for Long Text Modeling Aug 30, 2024 4k Decoder
Code Code Available 2MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens Oct 3, 2023 Image Generation multimodal generation
Code Code Available 2MonoFormer: One Transformer for Both Diffusion and Autoregression Sep 24, 2024 Image Generation Text Generation
Code Code Available 2From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models Apr 1, 2024 Graph Generation Image to text
Code Code Available 2HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models Sep 24, 2024 Long-Context Understanding Text Generation
Code Code Available 2Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms Jun 5, 2024 Low-Rank Matrix Completion Machine Translation
Code Code Available 2Get my drift? Catching LLM Task Drift with Activation Deltas Jun 2, 2024 Text Generation
Code Code Available 2Ecco: An Open Source Library for the Explainability of Transformer Language Models Aug 1, 2021 Text Generation
Code Code Available 2AnyText2: Visual Text Generation and Editing With Customizable Attributes Nov 22, 2024 Image Generation Text Generation
Code Code Available 2DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences Jun 5, 2024 Autonomous Driving Autonomous Vehicles
Code Code Available 2ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis Aug 16, 2024 Contrastive Learning Diagnostic
Code Code Available 2eVAE: Evolutionary Variational Autoencoder Jan 1, 2023 Disentanglement Image Generation
Code Code Available 2DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models Nov 28, 2022 Denoising Language Modeling
Code Code Available 2