RWKV: Reinventing RNNs for the Transformer Era May 22, 2023 Computational Efficiency Natural Language Inference
Code Code Available 6LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale Aug 15, 2022 GPU Language Modelling
Code Code Available 5Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation Feb 28, 2024 Attribute Extractive Question-Answering
Code Code Available 4AlignScore: Evaluating Factual Consistency with a Unified Alignment Function May 26, 2023 Fact Verification Information Retrieval
Code Code Available 4TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models May 18, 2023 Natural Language Inference Synthetic Data Generation
Code Code Available 4Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Oct 16, 2022 Coreference Resolution Multiple-choice
Code Code Available 4N-Grammer: Augmenting Transformers with latent n-grams Jul 13, 2022 Common Sense Reasoning Coreference Resolution
Code Code Available 4ST-MoE: Designing Stable and Transferable Sparse Expert Models Feb 17, 2022 ARC Common Sense Reasoning
Code Code Available 3Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 3Language Models are Few-Shot Learners May 28, 2020 answerability prediction Articles
Code Code Available 3ERNIE 2.0: A Continual Pre-training Framework for Language Understanding Jul 29, 2019 Chinese Named Entity Recognition Chinese Reading Comprehension
Code Code Available 3Pre-Training with Whole Word Masking for Chinese BERT Jun 19, 2019 Document Classification General Classification
Code Code Available 3ERNIE: Enhanced Representation through Knowledge Integration Apr 19, 2019 Chinese Named Entity Recognition Chinese Sentence Pair Classification
Code Code Available 3BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Oct 11, 2018 Citation Intent Classification Common Sense Reasoning
Code Code Available 3Scientific QA System with Verifiable Answers Jul 16, 2024 Articles Information Retrieval
Code Code Available 2ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Jul 5, 2024 Hallucination Long Form Question Answering
Code Code Available 2Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale Mar 13, 2024 Constituency Grammar Induction Language Modeling
Code Code Available 2PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain Oct 22, 2023 Dialogue Generation Dialogue Understanding
Code Code Available 2ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers Sep 28, 2023 GPU Instruction Following
Code Code Available 2BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks May 26, 2023 Image Captioning Medical Visual Question Answering
Code Code Available 2The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning May 23, 2023 Common Sense Reasoning Common Sense Reasoning (Zero-Shot)
Code Code Available 2LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions Apr 27, 2023 Common Sense Reasoning Coreference Resolution
Code Code Available 2Hungry Hungry Hippos: Towards Language Modeling with State Space Models Dec 28, 2022 8k Coreference Resolution
Code Code Available 2Ask Me Anything: A simple strategy for prompting language models Oct 5, 2022 Coreference Resolution Natural Language Inference
Code Code Available 2AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model Aug 2, 2022 Causal Language Modeling Common Sense Reasoning
Code Code Available 2mGPT: Few-Shot Learners Go Multilingual Apr 15, 2022 Cross-Lingual Natural Language Inference Cross-Lingual Paraphrase Identification
Code Code Available 2PaLM: Scaling Language Modeling with Pathways Apr 5, 2022 Auto Debugging Code Generation
Code Code Available 2DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing Nov 18, 2021 Language Modeling Language Modelling
Code Code Available 2Order Constraints in Optimal Transport Oct 14, 2021 Natural Language Inference
Code Code Available 2SimCSE: Simple Contrastive Learning of Sentence Embeddings Apr 18, 2021 Contrastive Learning Data Augmentation
Code Code Available 2I-BERT: Integer-only BERT Quantization Jan 5, 2021 GPU Natural Language Inference
Code Code Available 2DeBERTa: Decoding-enhanced BERT with Disentangled Attention Jun 5, 2020 Common Sense Reasoning Coreference Resolution
Code Code Available 2Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Oct 23, 2019 Answer Generation Common Sense Reasoning
Code Code Available 2ALBERT: A Lite BERT for Self-supervised Learning of Language Representations Sep 26, 2019 Common Sense Reasoning GPU
Code Code Available 2Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach Aug 31, 2019 Articles Benchmarking
Code Code Available 2Learning to Reason via Mixture-of-Thought for Logical Reasoning May 21, 2025 Logical Reasoning Natural Language Inference
Code Code Available 1FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data Jan 28, 2025 Natural Language Inference Synthetic Data Generation
Code Code Available 1Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization Dec 19, 2024 Contrastive Learning Decision Making
Code Code Available 1GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge Sep 26, 2024 Natural Language Inference Sentiment Analysis
Code Code Available 1Enhancing adversarial robustness in Natural Language Inference using explanations Sep 11, 2024 Adversarial Robustness Natural Language Inference
Code Code Available 1Quantifying and Optimizing Global Faithfulness in Persona-driven Role-playing May 13, 2024 Natural Language Inference
Code Code Available 1Don't Say No: Jailbreaking LLM by Suppressing Refusal Apr 25, 2024 Natural Language Inference Safety Alignment
Code Code Available 1FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction Mar 4, 2024 Articles Natural Language Inference
Code Code Available 1Citation-Enhanced Generation for LLM-based Chatbots Feb 25, 2024 Chatbot Citation Prediction
Code Code Available 1Pixel Sentence Representation Learning Feb 13, 2024 Natural Language Inference Representation Learning
Code Code Available 1MT-Ranker: Reference-free machine translation evaluation by inter-system ranking Jan 30, 2024 Machine Translation Natural Language Inference
Code Code Available 1Are self-explanations from Large Language Models faithful? Jan 15, 2024 counterfactual Faithfulness Critic
Code Code Available 1Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue Jan 9, 2024 Model Editing Natural Language Inference
Code Code Available 1Building Efficient Universal Classifiers with Natural Language Inference Dec 29, 2023 Classification Natural Language Inference
Code Code Available 1Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models Nov 13, 2023 Document-level Relation Extraction In-Context Learning
Code Code Available 1