Locally Typical Sampling Feb 1, 2022 Abstractive Text Summarization Story Generation
Code Code Available 4What Makes Good In-Context Examples for GPT-3? Jan 17, 2021 Few-Shot Learning Natural Language Understanding
Code Code Available 4Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks May 22, 2020 Fact Verification Question Answering
Code Code Available 4The Diffusion Duality Jun 12, 2025 Text Generation
Code Code Available 3Long-Context Autoregressive Video Modeling with Next-Frame Prediction Mar 25, 2025 Text Generation Video Generation
Code Code Available 3M+: Extending MemoryLLM with Scalable Long-Term Memory Feb 1, 2025 16k GPU
Code Code Available 3LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Dec 18, 2024 Attribute Text Generation
Code Code Available 3A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Nov 4, 2024 Question Answering Text Generation
Code Code Available 3Scaling up Masked Diffusion Models on Text Oct 24, 2024 GSM8K Language Modeling
Code Code Available 3ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood Sep 14, 2024 Instruction Following Text Generation
Code Code Available 3BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models Aug 23, 2024 Data Poisoning text-classification
Code Code Available 3Controllable Text Generation for Large Language Models: A Survey Aug 22, 2024 Attribute Prompt Engineering
Code Code Available 3SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Aug 19, 2024 image-classification Image Classification
Code Code Available 3MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities Aug 1, 2024 Math MM-Vet
Code Code Available 3FusionBench: A Comprehensive Benchmark of Deep Model Fusion Jun 5, 2024 image-classification Image Classification
Code Code Available 3FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models Jun 4, 2024 Text Generation Transfer Learning
Code Code Available 3Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset May 17, 2024 16k Benchmarking
Code Code Available 3Evaluating Text-to-Visual Generation with Image-to-Text Generation Apr 1, 2024 Image to text Question Answering
Code Code Available 3Simple linear attention language models balance the recall-throughput tradeoff Feb 28, 2024 Language Modelling Mamba
Code Code Available 3ChatMusician: Understanding and Generating Music Intrinsically with LLM Feb 25, 2024 MMLU Text Generation
Code Code Available 3Efficient Large Language Models: A Survey Dec 6, 2023 Natural Language Understanding Survey
Code Code Available 3Emu: Generative Pretraining in Multimodality Jul 11, 2023 Image Captioning Image Generation
Code Code Available 3CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial Domains May 23, 2023 Text Generation
Code Code Available 3LLM-Pruner: On the Structural Pruning of Large Language Models May 19, 2023 Text Generation zero-shot-classification
Code Code Available 3NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist May 15, 2023 Controllable Language Modelling Dialogue Generation
Code Code Available 3TextBox 2.0: A Text Generation Library with Pre-trained Language Models Dec 26, 2022 Abstractive Text Summarization Data-to-Text Generation
Code Code Available 3Bird-Eye Transformers for Text Generation Models Oct 8, 2022 Attribute Inductive Bias
Code Code Available 3Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals Sep 29, 2022 Text Generation
Code Code Available 3Diffusion-LM Improves Controllable Text Generation May 27, 2022 Language Modeling Language Modelling
Code Code Available 3Prefix-Tuning: Optimizing Continuous Prompts for Generation Jan 1, 2021 Language Modeling Language Modelling
Code Code Available 3The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs Jul 15, 2025 Code Generation Safety Alignment
Code Code Available 2Seq vs Seq: An Open Suite of Paired Encoders and Decoders Jul 15, 2025 Decoder Large Language Model
Code Code Available 2HyperSteer: Activation Steering at Scale with Hypernetworks Jun 3, 2025 Dictionary Learning Text Generation
Code Code Available 2Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model May 29, 2025 Decoder Image Generation
Code Code Available 2Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey Apr 21, 2025 Computational Efficiency Information Retrieval
Code Code Available 2TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Mar 30, 2025 2k Image Generation
Code Code Available 2Unified Multimodal Discrete Diffusion Mar 26, 2025 Image Captioning Image Generation
Code Code Available 2Reasoning to Learn from Latent Thoughts Mar 24, 2025 Math Text Generation
Code Code Available 2OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models Mar 11, 2025 GPU Mamba
Code Code Available 2WritingBench: A Comprehensive Benchmark for Generative Writing Mar 7, 2025 Text Generation
Code Code Available 2Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Feb 24, 2025 image-classification Image Classification
Code Code Available 2A Survey on Data Contamination for Large Language Models Feb 20, 2025 Survey Text Generation
Code Code Available 2Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems Feb 16, 2025 Open-Domain Question Answering Question Answering
Code Code Available 2A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis Feb 13, 2025 Text Generation
Code Code Available 2Saving 77% of the Parameters in Large Language Models Technical Report Feb 9, 2025 GPU Text Generation
Code Code Available 2CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance Feb 4, 2025 Code Generation Text Generation
Code Code Available 2Where am I? Cross-View Geo-localization with Natural Language Descriptions Dec 22, 2024 geo-localization Image Retrieval
Code Code Available 2LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts Dec 16, 2024 General Knowledge Instruction Following
Code Code Available 2AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Dec 13, 2024 Text Generation
Code Code Available 2OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Dec 2, 2024 Audio Synthesis Image Generation
Code Code Available 2