Counterfactual Token Generation in Large Language Models Sep 25, 2024 Bias Detection counterfactual
Code Code Available 15 CREAM: Consistency Regularized Self-Rewarding Language Models Oct 16, 2024 Language Modeling Language Modelling
Code Code Available 15 RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling May 14, 2021 Dialogue Generation Language Modeling
Code Code Available 15 Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning May 12, 2025 Language Modeling Language Modelling
Code Code Available 15 JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model Apr 3, 2025 Language Modeling Language Modelling
Code Code Available 15 Counterfactual Data Augmentation for Neural Machine Translation Jun 1, 2021 counterfactual Data Augmentation
Code Code Available 15 Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE Feb 10, 2025 Diversity Language Modeling
Code Code Available 15 Cost-effective Instruction Learning for Pathology Vision and Language Analysis Jul 25, 2024 Few-Shot Learning Language Modelling
Code Code Available 15 A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19 Jun 19, 2020 Chatbot Language Modeling
Code Code Available 15 Discovering Autoregressive Orderings with Variational Inference Jan 1, 2021 Code Generation Image Captioning
Code Code Available 15 Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient Visual Learning Paradigm Jan 1, 2023 Domain Generalization Few-Shot Learning
Code Code Available 15 JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata Feb 11, 2025 Language Modeling Language Modelling
Code Code Available 15 IvyGPT: InteractiVe Chinese pathwaY language model in medical domain Jul 20, 2023 Language Modeling Language Modelling
Code Code Available 15 cosFormer: Rethinking Softmax in Attention Feb 17, 2022 D4RL Language Modeling
Code Code Available 15 CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference Jun 25, 2024 Language Modeling Language Modelling
Code Code Available 15 CoS: Enhancing Personalization and Mitigating Bias with Context Steering May 2, 2024 Bayesian Inference Language Modelling
Code Code Available 15 Iterative Few-shot Semantic Segmentation from Image Label Text Mar 10, 2023 Few-Shot Semantic Segmentation Language Modeling
Code Code Available 15 CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models Feb 20, 2025 Blocking Language Modeling
Code Code Available 15 Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction May 4, 2020 Language Modeling Language Modelling
Code Code Available 15 Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning Jul 20, 2024 Contrastive Learning Diagnostic
Code Code Available 15 AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies Aug 13, 2024 Language Modelling Mixture-of-Experts
Code Code Available 15 Latin BERT: A Contextual Language Model for Classical Philology Sep 21, 2020 Language Modeling Language Modelling
Code Code Available 15 ITER: Iterative Transformer-based Entity Recognition and Relation Extraction Nov 11, 2024 GPU Language Modeling
Code Code Available 15 Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning Dec 2, 2023 Causal Language Modeling Contrastive Learning
Code Code Available 15 Dissecting Human and LLM Preferences Feb 17, 2024 Language Modelling Large Language Model
Code Code Available 15 DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling Mar 2, 2024 Language Modelling Large Language Model
Code Code Available 15 Copy Suppression: Comprehensively Understanding an Attention Head Oct 6, 2023 Language Modelling
Code Code Available 15 Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model May 1, 2024 Knowledge Distillation Language Modeling
Code Code Available 15 DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter Oct 2, 2019 Hate Speech Detection Knowledge Distillation
Code Code Available 15 Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking Apr 4, 2025 Document Ranking Information Retrieval
Code Code Available 15 Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling Oct 22, 2022 Abstractive Text Summarization Language Modeling
Code Code Available 15 Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections Nov 17, 2023 Language Modelling Large Language Model
Code Code Available 15 ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic Feb 20, 2024 ArabicMMLU Language Model Evaluation
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 IterVM: Iterative Vision Modeling Module for Scene Text Recognition Apr 6, 2022 Language Modeling Language Modelling
Code Code Available 15 Creative Agents: Empowering Agents with Imagination for Creative Tasks Dec 5, 2023 Instruction Following Language Modelling
Code Code Available 15 Distilling Linguistic Context for Language Model Compression Sep 17, 2021 Knowledge Distillation Language Modeling
Code Code Available 15 Distilling Large Vision-Language Model with Out-of-Distribution Generalizability Jul 6, 2023 Few-Shot Image Classification Image Classification
Code Code Available 15 CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation Jul 9, 2024 Language Modeling Language Modelling
Code Code Available 15 DistilProtBert: A distilled protein language model used to distinguish between real proteins and their randomly shuffled counterparts May 10, 2022 Dimensionality Reduction Knowledge Distillation
Code Code Available 15 Arabisc: Context-Sensitive Neural Spelling Checker Dec 1, 2020 Language Modelling Sentence
Code Code Available 15 Diversified in-domain synthesis with efficient fine-tuning for few-shot classification Dec 5, 2023 Diversity Few-Shot Image Classification
Code Code Available 15 Condenser: a Pre-training Architecture for Dense Retrieval Apr 16, 2021 Language Modelling Retrieval
Code Code Available 15 DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints May 29, 2024 Diversity Language Modeling
Code Code Available 15 Learning Approximate Inference Networks for Structured Prediction Mar 9, 2018 Language Modeling Language Modelling
Code Code Available 15 Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models Jun 10, 2021 Language Modeling Language Modelling
Code Code Available 15 Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning Oct 10, 2024 Language Modelling Large Language Model
Code Code Available 15 DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects Oct 3, 2024 Benchmarking Imitation Learning
Code Code Available 15 AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding Dec 31, 2020 Language Modeling Language Modelling
Code Code Available 15 ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing Mar 4, 2023 Diversity Image Captioning
Code Code Available 15