| Protoformer: Embedding Prototypes for Transformers | Jun 25, 2022 | ClassificationGeneral Classification | CodeCode Available | 1 |
| Using cognitive psychology to understand GPT-3 | Jun 21, 2022 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Know your audience: specializing grounded language models with listener subtraction | Jun 16, 2022 | Language ModellingLarge Language Model | —Unverified | 0 |
| Putting GPT-3's Creativity to the (Alternative Uses) Test | Jun 10, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Automatic Generation of Programming Exercises and Code Explanations using Large Language Models | Jun 3, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning | Jun 3, 2022 | Image Paragraph CaptioningLanguage Modeling | —Unverified | 0 |
| Happenstance: Utilizing Semantic Search to Track Russian State Media Narratives about the Russo-Ukrainian War On Reddit | May 28, 2022 | ArticlesFact Checking | —Unverified | 0 |
| Differentially Private Decoding in Large Language Models | May 26, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | May 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RankGen: Improving Text Generation with Large Ranking Models | May 19, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning | May 6, 2022 | In-Context LearningLanguage Modelling | CodeCode Available | 1 |
| Combining Extraction and Generation for Constructing Belief-Consequence Causal Links | May 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis | Mar 25, 2022 | Code GenerationHumanEval | CodeCode Available | 6 |
| Extraction of Sleep Information from Clinical Notes of Patients with Alzheimer's Disease Using Natural Language Processing | Mar 8, 2022 | Language ModellingLarge Language Model | —Unverified | 0 |
| Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text Representation | Mar 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 |
| Pop Quiz! Can a Large Language Model Help With Reverse Engineering? | Feb 2, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hardness Masking via Auto-Regressive Language Model | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Imagined versus Remembered Stories: Quantifying Differences in Narrative Flow | Jan 7, 2022 | Language ModellingLarge Language Model | —Unverified | 0 |
| MacBERTh: Development and Evaluation of a Historically Pre-trained Language Model for English (1450-1950) | Dec 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic | Nov 29, 2021 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| Adaptive Testing and Debugging of NLP Models | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets | Nov 11, 2021 | AttributeLanguage Modeling | —Unverified | 0 |
| The Klarna Product Page Dataset: Web Element Nomination with Graph Neural Networks and Large Language Models | Nov 3, 2021 | ClassificationLanguage Modelling | CodeCode Available | 1 |
| bert2BERT: Towards Reusable Pretrained Language Models | Oct 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts | Oct 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation | Sep 29, 2021 | Few-Shot LearningKnowledge Distillation | —Unverified | 0 |
| Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning | Sep 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss | Jun 20, 2021 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis | Apr 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transfer training from smaller language model | Apr 23, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Arabic Compact Language Modelling for Resource Limited Devices | Apr 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Globalizing BERT-based Transformer Architectures for Long Document Summarization | Apr 1, 2021 | ArticlesDocument Summarization | —Unverified | 0 |
| Story Centaur: Large Language Model Few Shot Learning as a Creative Writing Tool | Apr 1, 2021 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation | Jan 2, 2021 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| Graphmax for Text Generation | Jan 1, 2021 | Language ModellingLarge Language Model | —Unverified | 0 |
| A review of on-device fully neural end-to-end automatic speech recognition algorithms | Dec 14, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning | Nov 3, 2020 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| Plug-and-Play Conversational Models | Oct 9, 2020 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Plug-and-Play Conversational Models | Jul 23, 2020 | AttributeLanguage Modeling | —Unverified | 0 |
| Challenge Closed-book Science Exam: A Meta-learning Based Question Answering System | Apr 26, 2020 | AI2 Reasoning ChallengeARC | —Unverified | 0 |
| Explaining Relationships Between Scientific Documents | Feb 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Compressing Language Models using Doped Kronecker Products | Jan 24, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Paraphrasing with Large Language Models | Nov 21, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fast Transformer Decoding: One Write-Head is All You Need | Nov 6, 2019 | AllLanguage Modelling | CodeCode Available | 4 |
| Enhancing Clinical Concept Extraction with Contextual Embeddings | Feb 22, 2019 | Clinical Concept ExtractionLanguage Modelling | —Unverified | 0 |