An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers May 1, 2022 Language Modeling Language Modelling
Code Code Available 15 DeepInception: Hypnotize Large Language Model to Be Jailbreaker Nov 6, 2023 Language Modeling Language Modelling
Code Code Available 15 Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment Feb 22, 2024 Backdoor Attack Language Modelling
Code Code Available 15 BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Apr 5, 2024 Factual probe General Knowledge
Code Code Available 15 XG-NID: Dual-Modality Network Intrusion Detection using a Heterogeneous Graph Neural Network and Large Language Model Aug 27, 2024 Graph Neural Network Intrusion Detection
Code Code Available 15 Measuring Implicit Bias in Explicitly Unbiased Large Language Models Feb 6, 2024 Decision Making Diagnostic
Code Code Available 15 A Cheaper and Better Diffusion Language Model with Soft-Masked Noise Apr 10, 2023 Denoising Image Generation
Code Code Available 15 MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems Oct 17, 2024 Answer Generation Language Modeling
Code Code Available 15 DeeperImpact: Optimizing Sparse Learned Index Structures May 27, 2024 Language Modeling Language Modelling
Code Code Available 15 XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual Knowledge Sep 26, 2021 Language Modeling Language Modelling
Code Code Available 15 Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding Jan 3, 2025 Hallucination Language Modeling
Code Code Available 15 Mixture of Attention Heads: Selecting Attention Heads Per Token Oct 11, 2022 Computational Efficiency Language Modeling
Code Code Available 15 An Efficient Self-Supervised Cross-View Training For Sentence Embedding Nov 6, 2023 Contrastive Learning Language Modeling
Code Code Available 15 MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection Dec 20, 2024 Cancer Classification Chatbot
Code Code Available 15 End-to-End Beam Retrieval for Multi-Hop Question Answering Aug 17, 2023 Language Modelling Large Language Model
Code Code Available 15 An Efficient Multilingual Language Model Compression through Vocabulary Trimming May 24, 2023 Language Modeling Language Modelling
Code Code Available 15 BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models Sep 23, 2023 Code Completion Hallucination
Code Code Available 15 Deep contextualized word representations Feb 15, 2018 Citation Intent Classification Conversational Response Selection
Code Code Available 15 Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as a Target for NLP Apr 17, 2021 Language Modelling
Code Code Available 15 Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection Dec 15, 2022 Deep Learning Graph Learning
Code Code Available 15 Decoding-Time Language Model Alignment with Multiple Objectives Jun 27, 2024 Language Modeling Language Modelling
Code Code Available 15 Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity Apr 22, 2024 GPU Language Modeling
Code Code Available 15 Decoding Speculative Decoding Feb 2, 2024 Language Modeling Language Modelling
Code Code Available 15 Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences Jan 19, 2024 Language Modeling Language Modelling
Code Code Available 15 Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving May 23, 2025 Language Modeling Language Modelling
Code Code Available 15 DUMA: Reading Comprehension with Transposition Thinking Jan 26, 2020 Language Modeling Language Modelling
Code Code Available 15 Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models Jul 24, 2024 ARC Inductive Bias
Code Code Available 15 Decouple knowledge from parameters for plug-and-play language modeling May 19, 2023 Domain Adaptation Language Modeling
Code Code Available 15 BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla May 23, 2022 Conditional Text Generation Dialogue Generation
Code Code Available 15 Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers Jan 22, 2023 Language Modeling Language Modelling
Code Code Available 15 Deep Equilibrium Models Sep 3, 2019 Language Modeling Language Modelling
Code Code Available 15 DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation Mar 30, 2024 Dataset Distillation In-Context Learning
Code Code Available 15 Memory-Based Model Editing at Scale Jun 13, 2022 counterfactual Dialogue Generation
Code Code Available 15 MetaICL: Learning to Learn In Context Oct 29, 2021 Few-Shot Learning In-Context Learning
Code Code Available 15 Modelling Suspense in Short Stories as Uncertainty Reduction over Neural Representation Apr 30, 2020 Language Modeling Language Modelling
Code Code Available 15 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Jul 5, 2024 General Knowledge Instruction Following
Code Code Available 15 Language Model Alignment in Multilingual Trolley Problems Jul 2, 2024 Decision Making Ethics
Code Code Available 15 ZC3: Zero-Shot Cross-Language Code Clone Detection Aug 26, 2023 Clone Detection Language Modelling
Code Code Available 15 On Faithfulness and Factuality in Abstractive Summarization May 2, 2020 Abstractive Text Summarization Document Summarization
Code Code Available 15 Merging Feed-Forward Sublayers for Compressed Transformers Jan 10, 2025 image-classification Image Classification
Code Code Available 15 Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding Oct 23, 2023 Language Modeling Language Modelling
Code Code Available 15 Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers Nov 13, 2024 Language Modeling Language Modelling
Code Code Available 15 An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification Sep 5, 2024 Data Augmentation Diversity
Code Code Available 05 Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting Feb 19, 2024 Language Modeling Language Modelling
Code Code Available 05 MILL: Mutual Verification with Large Language Models for Zero-Shot Query Expansion Oct 29, 2023 Information Retrieval Language Modelling
Code Code Available 05 Bayesian Neural Network Language Modeling for Speech Recognition Aug 28, 2022 Data Augmentation Language Modeling
Code Code Available 05 An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Jan 10, 2024 Dialogue Generation Language Modelling
Code Code Available 05 MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output Jan 1, 2025 Instruction Following Language Modeling
Code Code Available 05 Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM May 16, 2025 Language Modeling Language Modelling
Code Code Available 05 Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context Learning Jun 16, 2024 In-Context Learning Language Modeling
Code Code Available 05