| Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs | Feb 10, 2022 | Dense CaptioningImage Captioning | —Unverified | 0 |
| Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations | Feb 9, 2022 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Using a Language Model in a Kiosk Recommender System at Fast-Food Restaurants | Feb 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TimeLMs: Diachronic Language Models from Twitter | Feb 8, 2022 | Continual LearningLanguage Modeling | CodeCode Available | 2 |
| Differentiable N-gram Objective on Abstractive Summarization | Feb 8, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis | Feb 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Cedille: A large autoregressive French language model | Feb 7, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework | Feb 7, 2022 | Image Captioningimage-classification | CodeCode Available | 0 |
| Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling | Feb 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Prompt-Guided Injection of Conformation to Pre-trained Protein Model | Feb 7, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Data Scaling Laws in NMT: The Effect of Noise and Architecture | Feb 4, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer | Feb 4, 2022 | Knowledge Graph CompletionLanguage Modeling | CodeCode Available | 0 |
| Formal Mathematics Statement Curriculum Learning | Feb 3, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 2 |
| mSLAM: Massively multilingual joint pre-training for speech and text | Feb 3, 2022 | cross-modal alignmentintent-classification | —Unverified | 0 |
| Pre-Trained Language Models for Interactive Decision-Making | Feb 3, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |
| GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records | Feb 2, 2022 | Clinical Concept ExtractionLanguage Modeling | —Unverified | 0 |
| What Has Been Enhanced in my Knowledge-Enhanced Language Model? | Feb 2, 2022 | Graph AttentionLanguage Modeling | CodeCode Available | 1 |
| Pop Quiz! Can a Large Language Model Help With Reverse Engineering? | Feb 2, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unified Scaling Laws for Routed Language Models | Feb 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Regression Transformer: Concurrent sequence regression and generation for molecular language modeling | Feb 1, 2022 | Conditional Text GenerationInductive Bias | CodeCode Available | 1 |
| BEA-Base: A Benchmark for ASR of Spontaneous Hungarian | Feb 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Examining Scaling and Transfer of Language Model Architectures for Machine Translation | Feb 1, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Disaster Tweets Classification using BERT-Based Language Model | Jan 31, 2022 | ClassificationLanguage Modeling | —Unverified | 0 |
| Does Transliteration Help Multilingual Language Modeling? | Jan 29, 2022 | DiversityLanguage Modeling | CodeCode Available | 0 |
| MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning | Jan 29, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval | Jan 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| Multiple-Source Domain Adaptation via Coordinated Domain Encoders and Paired Classifiers | Jan 28, 2022 | Cross-Domain Text ClassificationDomain Adaptation | CodeCode Available | 0 |
| Neural-FST Class Language Model for End-to-End Speech Recognition | Jan 28, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | Jan 28, 2022 | Common Sense ReasoningGSM8K | CodeCode Available | 6 |
| Impact of representation matching with neural machine translation | Jan 26, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| FiNCAT: Financial Numeral Claim Analysis Tool | Jan 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR | Jan 26, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model | Jan 26, 2022 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 |
| On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR | Jan 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Synchromesh: Reliable code generation from pre-trained language models | Jan 26, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models | Jan 26, 2022 | Grapheme-to-Phoneme ConversionLanguage Modeling | CodeCode Available | 1 |
| Multimodal data matters: language model pre-training over structured and unstructured electronic health records | Jan 25, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection | Jan 25, 2022 | ArticlesLanguage Modeling | —Unverified | 0 |
| BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment | Jan 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Relational Memory Augmented Language Models | Jan 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large and Diverse Arabic Corpus for Language Modeling | Jan 23, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Application of Pseudo-Log-Likelihoods to Natural Language Scoring | Jan 23, 2022 | Common Sense ReasoningGPU | —Unverified | 0 |
| Chinese Word Segmentation with Heterogeneous Graph Neural Network | Jan 22, 2022 | Chinese Word SegmentationGraph Neural Network | —Unverified | 0 |
| A Comparative Study on Language Models for Task-Oriented Dialogue Systems | Jan 21, 2022 | Dialogue State TrackingHallucination | CodeCode Available | 0 |
| Nearest Class-Center Simplification through Intermediate Layers | Jan 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text Style Transfer for Bias Mitigation using Masked Language Modeling | Jan 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees | Jan 20, 2022 | Clone DetectionCode Search | —Unverified | 0 |
| LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training | Jan 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TourBERT: A pretrained language model for the tourism industry | Jan 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |