Knowledge Fusion of Large Language Models Jan 19, 2024 Code Generation Common Sense Reasoning
Code Code Available 45 Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINO Nov 8, 2023 Quantization Text Generation
Code Code Available 45 DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation Oct 14, 2022 Natural Language Understanding Text Generation
Code Code Available 45 BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models Aug 23, 2024 Data Poisoning text-classification
Code Code Available 35 FusionBench: A Comprehensive Benchmark of Deep Model Fusion Jun 5, 2024 image-classification Image Classification
Code Code Available 35 Emu: Generative Pretraining in Multimodality Jul 11, 2023 Image Captioning Image Generation
Code Code Available 35 FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models Jun 4, 2024 Text Generation Transfer Learning
Code Code Available 35 Evaluating Text-to-Visual Generation with Image-to-Text Generation Apr 1, 2024 Image to text Question Answering
Code Code Available 35 Efficient Large Language Models: A Survey Dec 6, 2023 Natural Language Understanding Survey
Code Code Available 35 The Diffusion Duality Jun 12, 2025 Text Generation
Code Code Available 35 SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Aug 19, 2024 image-classification Image Classification
Code Code Available 35 TextBox 2.0: A Text Generation Library with Pre-trained Language Models Dec 26, 2022 Abstractive Text Summarization Data-to-Text Generation
Code Code Available 35 A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Nov 4, 2024 Question Answering Text Generation
Code Code Available 35 Simple linear attention language models balance the recall-throughput tradeoff Feb 28, 2024 Language Modelling Mamba
Code Code Available 35 ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood Sep 14, 2024 Instruction Following Text Generation
Code Code Available 35 MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities Aug 1, 2024 Math MM-Vet
Code Code Available 35 Diffusion-LM Improves Controllable Text Generation May 27, 2022 Language Modeling Language Modelling
Code Code Available 35 Scaling up Masked Diffusion Models on Text Oct 24, 2024 GSM8K Language Modeling
Code Code Available 35 Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals Sep 29, 2022 Text Generation
Code Code Available 35 NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist May 15, 2023 Controllable Language Modelling Dialogue Generation
Code Code Available 35 M+: Extending MemoryLLM with Scalable Long-Term Memory Feb 1, 2025 16k GPU
Code Code Available 35 ChatMusician: Understanding and Generating Music Intrinsically with LLM Feb 25, 2024 MMLU Text Generation
Code Code Available 35 Prefix-Tuning: Optimizing Continuous Prompts for Generation Jan 1, 2021 Language Modeling Language Modelling
Code Code Available 35 CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial Domains May 23, 2023 Text Generation
Code Code Available 35 LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Dec 18, 2024 Attribute Text Generation
Code Code Available 35 LLM-Pruner: On the Structural Pruning of Large Language Models May 19, 2023 Text Generation zero-shot-classification
Code Code Available 35 Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset May 17, 2024 16k Benchmarking
Code Code Available 35 Bird-Eye Transformers for Text Generation Models Oct 8, 2022 Attribute Inductive Bias
Code Code Available 35 Controllable Text Generation for Large Language Models: A Survey Aug 22, 2024 Attribute Prompt Engineering
Code Code Available 35 Long-Context Autoregressive Video Modeling with Next-Frame Prediction Mar 25, 2025 Text Generation Video Generation
Code Code Available 35 HyperSteer: Activation Steering at Scale with Hypernetworks Jun 3, 2025 Dictionary Learning Text Generation
Code Code Available 25 HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models Sep 24, 2024 Long-Context Understanding Text Generation
Code Code Available 25 Harmonizing Visual Text Comprehension and Generation Jul 23, 2024 multimodal generation Reading Comprehension
Code Code Available 25 A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis Feb 13, 2025 Text Generation
Code Code Available 25 Improving Factuality and Reasoning in Language Models through Multiagent Debate May 23, 2023 Few-Shot Learning Language Modeling
Code Code Available 25 HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding Mar 1, 2024 Hallucination Object
Code Code Available 25 GPTScore: Evaluate as You Desire Feb 8, 2023 Text Generation
Code Code Available 25 GPT-NER: Named Entity Recognition via Large Language Models Apr 20, 2023 Hallucination named-entity-recognition
Code Code Available 25 Grounding Language Models to Images for Multimodal Inputs and Outputs Jan 31, 2023 Image Retrieval In-Context Learning
Code Code Available 25 Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference May 28, 2024 GPU Text Generation
Code Code Available 25 In-Context Editing: Learning Knowledge from Self-Induced Distributions Jun 17, 2024 Image Editing In-Context Learning
Code Code Available 25 From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models Apr 1, 2024 Graph Generation Image to text
Code Code Available 25 Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 25 AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Dec 13, 2024 Text Generation
Code Code Available 25 Few-Shot Text Generation with Pattern-Exploiting Training Dec 22, 2020 Headline Generation text-classification
Code Code Available 25 Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Jun 2, 2023 Language Modeling Language Modelling
Code Code Available 25 Expressive Text-to-Image Generation with Rich Text Apr 13, 2023 Image Generation Text Generation
Code Code Available 25 Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap Jan 18, 2024 Code Generation Evolutionary Algorithms
Code Code Available 25 FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation May 23, 2023 Form Language Modelling
Code Code Available 25 Evaluating Morphological Compositional Generalization in Large Language Models Oct 16, 2024 Text Generation
Code Code Available 25