Qwen2.5 Technical Report Dec 19, 2024 Common Sense Reasoning
Code Code Available 135 Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration Apr 24, 2024 Management Prompt Engineering
Code Code Available 75 DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference Jan 9, 2024 Benchmarking Text Generation
Code Code Available 75 OmniGen: Unified Image Generation Sep 17, 2024 Edge Detection Image Generation
Code Code Available 75 Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering Jan 16, 2024 Code Generation Prompt Engineering
Code Code Available 75 Flow Matching Guide and Code Dec 9, 2024 Text Generation
Code Code Available 75 Chameleon: Mixed-Modal Early-Fusion Foundation Models May 16, 2024 Image Captioning Image Generation
Code Code Available 75 OmniGen2: Exploration to Advanced Multimodal Generation Jun 23, 2025 Image Generation multimodal generation
Code Code Available 75 Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Mar 12, 2025 Question Answering RAG
Code Code Available 75 Qwen2.5-Omni Technical Report Mar 26, 2025 Automatic Speech Recognition (ASR) GSM8K
Code Code Available 75 DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers Mar 15, 2024 Text Generation Video Generation
Code Code Available 75 Flow-GRPO: Training Flow Matching Models via Online RL May 8, 2025 Denoising Diversity
Code Code Available 75 Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Apr 26, 2023 Language Modelling Natural Language Understanding
Code Code Available 65 ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages Dec 13, 2022 Code Summarization Language Modeling
Code Code Available 65 Efficient Guided Generation for Large Language Models Jul 19, 2023 Language Modelling Text Generation
Code Code Available 65 h2oGPT: Democratizing Large Language Models Jun 13, 2023 Chatbot Fairness
Code Code Available 65 ImageBind-LLM: Multi-modality Instruction Tuning Sep 7, 2023 Instruction Following Text Generation
Code Code Available 55 How to Design Translation Prompts for ChatGPT: An Empirical Study Apr 5, 2023 Machine Translation Natural Language Understanding
Code Code Available 55 LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Jun 23, 2025 Reinforcement Learning (RL) Text Generation
Code Code Available 55 Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Jan 15, 2025 Natural Language Understanding RAG
Code Code Available 55 FlowTok: Flowing Seamlessly Across Text and Image Tokens Mar 13, 2025 Denoising Image to text
Code Code Available 55 Factuality Enhanced Language Models for Open-Ended Text Generation Jun 9, 2022 Misconceptions Sentence
Code Code Available 55 MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model Sep 4, 2024 Language Modeling Language Modelling
Code Code Available 55 WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Feb 8, 2024 Conversational Web Navigation Text Generation
Code Code Available 55 Assessing Language Model Deployment with Risk Cards Mar 31, 2023 Language Modeling Language Modelling
Code Code Available 55 TRUE: Re-evaluating Factual Consistency Evaluation Apr 11, 2022 Question Generation Question-Generation
Code Code Available 45 The All-Seeing Project V2: Towards General Relation Comprehension of the Open World Feb 29, 2024 All Hallucination
Code Code Available 45 FinBen: A Holistic Financial Benchmark for Large Language Models Feb 20, 2024 Question Answering RAG
Code Code Available 45 Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference Mar 8, 2023 Hyperparameter Optimization Language Modeling
Code Code Available 45 AlignScore: Evaluating Factual Consistency with a Unified Alignment Function May 26, 2023 Fact Verification Information Retrieval
Code Code Available 45 Locally Typical Sampling Feb 1, 2022 Abstractive Text Summarization Story Generation
Code Code Available 45 Knowledge Fusion of Large Language Models Jan 19, 2024 Code Generation Common Sense Reasoning
Code Code Available 45 Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINO Nov 8, 2023 Quantization Text Generation
Code Code Available 45 Cube: A Roblox View of 3D Intelligence Mar 19, 2025 Scene Generation Text Generation
Code Code Available 45 ChatHaruhi: Reviving Anime Character in Reality via Large Language Model Aug 18, 2023 Language Modeling Language Modelling
Code Code Available 45 LISA: Reasoning Segmentation via Large Language Model Aug 1, 2023 Language Modeling Language Modelling
Code Code Available 45 Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs Jun 14, 2024 Language Modeling Language Modelling
Code Code Available 45 Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks May 22, 2020 Fact Verification Question Answering
Code Code Available 45 BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining Oct 19, 2022 Document Classification Language Modelling
Code Code Available 45 ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing Sep 17, 2023 Model Selection Prompt Engineering
Code Code Available 45 One-Shot Diffusion Mimicker for Handwritten Text Generation Sep 6, 2024 Handwriting generation Text Generation
Code Code Available 45 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation Feb 4, 2025 Benchmarking Information Retrieval
Code Code Available 45 BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Jan 30, 2023 Generative Visual Question Answering Image Captioning
Code Code Available 45 Efficient Post-training Quantization with FP8 Formats Sep 26, 2023 image-classification Image Classification
Code Code Available 45 DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation Oct 14, 2022 Natural Language Understanding Text Generation
Code Code Available 45 AnyText: Multilingual Visual Text Generation And Editing Nov 6, 2023 Image Generation Optical Character Recognition (OCR)
Code Code Available 45 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Jul 8, 2024 multimodal generation Text Generation
Code Code Available 45 Natural Language Generation Feb 20, 2025 Text Generation
Code Code Available 45 One Embedder, Any Task: Instruction-Finetuned Text Embeddings Dec 19, 2022 Information Retrieval Learning Word Embeddings
Code Code Available 45 SEED-Story: Multimodal Long Story Generation with Large Language Model Jul 11, 2024 Image Generation Language Modeling
Code Code Available 45