| Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval | Jan 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| Multiple-Source Domain Adaptation via Coordinated Domain Encoders and Paired Classifiers | Jan 28, 2022 | Cross-Domain Text ClassificationDomain Adaptation | CodeCode Available | 0 |
| Neural-FST Class Language Model for End-to-End Speech Recognition | Jan 28, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | Jan 28, 2022 | Common Sense ReasoningGSM8K | CodeCode Available | 6 |
| Impact of representation matching with neural machine translation | Jan 26, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| FiNCAT: Financial Numeral Claim Analysis Tool | Jan 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR | Jan 26, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model | Jan 26, 2022 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 |
| On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR | Jan 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Synchromesh: Reliable code generation from pre-trained language models | Jan 26, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models | Jan 26, 2022 | Grapheme-to-Phoneme ConversionLanguage Modeling | CodeCode Available | 1 |
| Multimodal data matters: language model pre-training over structured and unstructured electronic health records | Jan 25, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection | Jan 25, 2022 | ArticlesLanguage Modeling | —Unverified | 0 |
| BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment | Jan 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Relational Memory Augmented Language Models | Jan 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large and Diverse Arabic Corpus for Language Modeling | Jan 23, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Application of Pseudo-Log-Likelihoods to Natural Language Scoring | Jan 23, 2022 | Common Sense ReasoningGPU | —Unverified | 0 |
| Chinese Word Segmentation with Heterogeneous Graph Neural Network | Jan 22, 2022 | Chinese Word SegmentationGraph Neural Network | —Unverified | 0 |
| A Comparative Study on Language Models for Task-Oriented Dialogue Systems | Jan 21, 2022 | Dialogue State TrackingHallucination | CodeCode Available | 0 |
| Nearest Class-Center Simplification through Intermediate Layers | Jan 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text Style Transfer for Bias Mitigation using Masked Language Modeling | Jan 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees | Jan 20, 2022 | Clone DetectionCode Search | —Unverified | 0 |
| LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training | Jan 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TourBERT: A pretrained language model for the tourism industry | Jan 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |