| DATScore: Evaluating Translation with Data Augmented Translations | Oct 12, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Foundation Transformers | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Context Generation Improves Open Domain Question Answering | Oct 12, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MedJEx: A Medical Jargon Extraction Model with Wiki's Hyperlink Span and Contextualized Masked Language Model Score | Oct 12, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Zero-Shot Prompting for Implicit Intent Prediction and Recommendation with Commonsense Reasoning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Predictive Querying for Autoregressive Neural Sequence Models | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Designing Robust Transformers using Robust Kernel Density Estimation | Oct 11, 2022 | Density Estimationimage-classification | —Unverified | 0 |
| Cross-Lingual Speaker Identification Using Distant Supervision | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Decoupled Context Processing for Context Augmented Language Modeling | Oct 11, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Retrieval Augmentation for T5 Re-ranker using External Sources | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training | Oct 11, 2022 | GPUKnowledge Graphs | —Unverified | 0 |
| Mind's Eye: Grounded Language Model Reasoning through Simulation | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization | Oct 11, 2022 | ClusteringLanguage Modeling | —Unverified | 0 |
| Understanding the Failure of Batch Normalization for Transformers in NLP | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model | Oct 11, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Like a bilingual baby: The advantage of visually grounding a bilingual language model | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Kernel-Based View of Language Model Fine-Tuning | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Instance Regularization for Discriminative Language Model Pre-training | Oct 11, 2022 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation | Oct 10, 2022 | counterfactualData Augmentation | CodeCode Available | 0 |
| Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning | Oct 10, 2022 | DecoderHeadline Generation | —Unverified | 0 |
| Do Children Texts Hold The Key To Commonsense Knowledge? | Oct 10, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |