BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining Oct 19, 2022 Document Classification Language Modelling
Code Code Available 4LISA: Reasoning Segmentation via Large Language Model Aug 1, 2023 Language Modeling Language Modelling
Code Code Available 4AlignScore: Evaluating Factual Consistency with a Unified Alignment Function May 26, 2023 Fact Verification Information Retrieval
Code Code Available 4BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models Aug 23, 2024 Data Poisoning text-classification
Code Code Available 3FusionBench: A Comprehensive Benchmark of Deep Model Fusion Jun 5, 2024 image-classification Image Classification
Code Code Available 3Emu: Generative Pretraining in Multimodality Jul 11, 2023 Image Captioning Image Generation
Code Code Available 3ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood Sep 14, 2024 Instruction Following Text Generation
Code Code Available 3FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models Jun 4, 2024 Text Generation Transfer Learning
Code Code Available 3Evaluating Text-to-Visual Generation with Image-to-Text Generation Apr 1, 2024 Image to text Question Answering
Code Code Available 3Efficient Large Language Models: A Survey Dec 6, 2023 Natural Language Understanding Survey
Code Code Available 3TextBox 2.0: A Text Generation Library with Pre-trained Language Models Dec 26, 2022 Abstractive Text Summarization Data-to-Text Generation
Code Code Available 3The Diffusion Duality Jun 12, 2025 Text Generation
Code Code Available 3Simple linear attention language models balance the recall-throughput tradeoff Feb 28, 2024 Language Modelling Mamba
Code Code Available 3A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Nov 4, 2024 Question Answering Text Generation
Code Code Available 3SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Aug 19, 2024 image-classification Image Classification
Code Code Available 3Scaling up Masked Diffusion Models on Text Oct 24, 2024 GSM8K Language Modeling
Code Code Available 3Diffusion-LM Improves Controllable Text Generation May 27, 2022 Language Modeling Language Modelling
Code Code Available 3MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities Aug 1, 2024 Math MM-Vet
Code Code Available 3Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals Sep 29, 2022 Text Generation
Code Code Available 3NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist May 15, 2023 Controllable Language Modelling Dialogue Generation
Code Code Available 3Prefix-Tuning: Optimizing Continuous Prompts for Generation Jan 1, 2021 Language Modeling Language Modelling
Code Code Available 3ChatMusician: Understanding and Generating Music Intrinsically with LLM Feb 25, 2024 MMLU Text Generation
Code Code Available 3M+: Extending MemoryLLM with Scalable Long-Term Memory Feb 1, 2025 16k GPU
Code Code Available 3LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Dec 18, 2024 Attribute Text Generation
Code Code Available 3LLM-Pruner: On the Structural Pruning of Large Language Models May 19, 2023 Text Generation zero-shot-classification
Code Code Available 3Bird-Eye Transformers for Text Generation Models Oct 8, 2022 Attribute Inductive Bias
Code Code Available 3CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial Domains May 23, 2023 Text Generation
Code Code Available 3Long-Context Autoregressive Video Modeling with Next-Frame Prediction Mar 25, 2025 Text Generation Video Generation
Code Code Available 3Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset May 17, 2024 16k Benchmarking
Code Code Available 3Controllable Text Generation for Large Language Models: A Survey Aug 22, 2024 Attribute Prompt Engineering
Code Code Available 3HyperSteer: Activation Steering at Scale with Hypernetworks Jun 3, 2025 Dictionary Learning Text Generation
Code Code Available 2HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models Sep 24, 2024 Long-Context Understanding Text Generation
Code Code Available 2AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Dec 13, 2024 Text Generation
Code Code Available 2Improving Factuality and Reasoning in Language Models through Multiagent Debate May 23, 2023 Few-Shot Learning Language Modeling
Code Code Available 2HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding Mar 1, 2024 Hallucination Object
Code Code Available 2Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference May 28, 2024 GPU Text Generation
Code Code Available 2GPTScore: Evaluate as You Desire Feb 8, 2023 Text Generation
Code Code Available 2GPT-NER: Named Entity Recognition via Large Language Models Apr 20, 2023 Hallucination named-entity-recognition
Code Code Available 2Grounding Language Models to Images for Multimodal Inputs and Outputs Jan 31, 2023 Image Retrieval In-Context Learning
Code Code Available 2Harmonizing Visual Text Comprehension and Generation Jul 23, 2024 multimodal generation Reading Comprehension
Code Code Available 2In-Context Editing: Learning Knowledge from Self-Induced Distributions Jun 17, 2024 Image Editing In-Context Learning
Code Code Available 2From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models Apr 1, 2024 Graph Generation Image to text
Code Code Available 2Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 2Authorship Obfuscation in Multilingual Machine-Generated Text Detection Jan 15, 2024 Adversarial Robustness Benchmarking
Code Code Available 2Few-Shot Text Generation with Pattern-Exploiting Training Dec 22, 2020 Headline Generation text-classification
Code Code Available 2A Touch, Vision, and Language Dataset for Multimodal Alignment Feb 20, 2024 Language Modeling Language Modelling
Code Code Available 2Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Jun 2, 2023 Language Modeling Language Modelling
Code Code Available 2Expressive Text-to-Image Generation with Rich Text Apr 13, 2023 Image Generation Text Generation
Code Code Available 2Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap Jan 18, 2024 Code Generation Evolutionary Algorithms
Code Code Available 2FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation May 23, 2023 Form Language Modelling
Code Code Available 2