A Tensorized Transformer for Language Modeling Jun 24, 2019 Decoder Language Modeling
Code Code Available 1EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling Oct 7, 2023 Language Modeling Language Modelling
Code Code Available 1Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation Mar 20, 2022 Knowledge Distillation Language Modelling
Code Code Available 1Emergence of Social Norms in Generative Agent Societies: Principles and Architecture Mar 13, 2024 Language Modelling Large Language Model
Code Code Available 1ConfliBERT: A Language Model for Political Conflict Dec 19, 2024 Language Modeling Language Modelling
Code Code Available 1OPI@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text using RoBERTa Pre-trained Language Models May 1, 2022 Depression Detection Language Modeling
Code Code Available 1AESOP: Paraphrase Generation with Adaptive Syntactic Control Nov 1, 2021 Data Augmentation Language Modeling
Code Code Available 1Connecting Language and Vision for Natural Language-Based Vehicle Retrieval May 31, 2021 Language Modelling Management
Code Code Available 1Emergent Analogical Reasoning in Large Language Models Dec 19, 2022 Language Modeling Language Modelling
Code Code Available 1EmojiLM: Modeling the New Emoji Language Nov 3, 2023 Language Modeling Language Modelling
Code Code Available 1Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators Jun 4, 2021 Language Modeling Language Modelling
Code Code Available 1Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications Feb 5, 2025 In-Context Learning Language Modeling
Code Code Available 1Evolving Deep Neural Networks Mar 1, 2017 Deep Learning Image Captioning
Code Code Available 1ELI5: Long Form Question Answering Jul 22, 2019 Form Language Modeling
Code Code Available 1Elephants Never Forget: Testing Language Models for Memorization of Tabular Data Mar 11, 2024 Language Modelling Memorization
Code Code Available 1Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer Jan 14, 2022 Classification Contrastive Learning
Code Code Available 1OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition Nov 30, 2023 Descriptive Language Modelling
Code Code Available 1Ouroboros: On Accelerating Training of Transformer-Based Language Models Sep 14, 2019 Language Modeling Language Modelling
Code Code Available 1ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation Oct 24, 2022 Language Modeling Language Modelling
Code Code Available 1Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events Jan 4, 2021 Keyword Extraction Language Modeling
Code Code Available 1ELECTRAMed: a new pre-trained language representation model for biomedical NLP Apr 19, 2021 Drug–drug Interaction Extraction Language Modeling
Code Code Available 1Elastic Weight Removal for Faithful and Abstractive Dialogue Generation Mar 30, 2023 Dialogue Generation Language Modelling
Code Code Available 1ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators Mar 23, 2020 GPU Language Modeling
Code Code Available 1Efficient recurrent architectures through activity sparsity and sparse back-propagation through time Jun 13, 2022 Gesture Recognition Language Modeling
Code Code Available 1Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models Apr 9, 2024 Few-Shot Learning Language Modelling
Code Code Available 1Augmenting Interpretable Models with LLMs during Training Sep 23, 2022 Additive models Language Modelling
Code Code Available 1PaLI-3 Vision Language Models: Smaller, Faster, Stronger Oct 13, 2023 Chart Question Answering Cross-Modal Retrieval
Code Code Available 1EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning Oct 14, 2022 Caption Generation Knowledge Distillation
Code Code Available 1Efficient Online Data Mixing For Language Model Pre-Training Dec 5, 2023 Language Modeling Language Modelling
Code Code Available 1Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking Dec 15, 2022 Language Modeling Language Modelling
Code Code Available 1Atla Selene Mini: A General Purpose Evaluation Model Jan 27, 2025 Language Modeling Language Modelling
Code Code Available 1Efficient Nearest Neighbor Language Models Sep 9, 2021 Domain Adaptation Language Modeling
Code Code Available 1Efficient Neural Architecture Search via Parameter Sharing Feb 9, 2018 GPU Language Modelling
Code Code Available 1Human Sentence Processing: Recurrence or Attention? May 19, 2020 Language Modelling Retrieval
Code Code Available 1Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks Mar 31, 2022 Language Modeling Language Modelling
Code Code Available 1Efficiently Modeling Long Sequences with Structured State Spaces Oct 31, 2021 Data Augmentation Language Modeling
Code Code Available 1Efficient OCR for Building a Diverse Digital History Apr 5, 2023 Diversity Image Retrieval
Code Code Available 1Parameter-Efficient Fine-Tuning of State Space Models Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 1Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models Mar 2, 2022 Language Modeling Language Modelling
Code Code Available 1EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence Information Jan 25, 2021 Classification Drug–drug Interaction Extraction
Code Code Available 1BiasEdit: Debiasing Stereotyped Language Models via Model Editing Mar 11, 2025 counterfactual Language Modeling
Code Code Available 1Parsing as Pretraining Feb 5, 2020 Dependency Parsing Language Modeling
Code Code Available 1ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization Nov 22, 2023 GPU Language Modelling
Code Code Available 1Efficient Hierarchical Domain Adaptation for Pretrained Language Models Dec 16, 2021 Domain Adaptation Language Modeling
Code Code Available 1Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought Mar 8, 2024 Language Modeling Language Modelling
Code Code Available 1Efficient Content-Based Sparse Attention with Routing Transformers Mar 12, 2020 Image Generation Language Modeling
Code Code Available 1Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs Jul 31, 2024 Hallucination Image Comprehension
Code Code Available 1Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation Apr 4, 2025 Clustering Hallucination
Code Code Available 1Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images May 31, 2021 Few-Shot Learning Image Classification
Code Code Available 1