| DATScore: Evaluating Translation with Data Augmented Translations | Oct 12, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Foundation Transformers | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Context Generation Improves Open Domain Question Answering | Oct 12, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MedJEx: A Medical Jargon Extraction Model with Wiki's Hyperlink Span and Contextualized Masked Language Model Score | Oct 12, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Zero-Shot Prompting for Implicit Intent Prediction and Recommendation with Commonsense Reasoning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Predictive Querying for Autoregressive Neural Sequence Models | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Designing Robust Transformers using Robust Kernel Density Estimation | Oct 11, 2022 | Density Estimationimage-classification | —Unverified | 0 |
| Cross-Lingual Speaker Identification Using Distant Supervision | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Decoupled Context Processing for Context Augmented Language Modeling | Oct 11, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Retrieval Augmentation for T5 Re-ranker using External Sources | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training | Oct 11, 2022 | GPUKnowledge Graphs | —Unverified | 0 |
| Mind's Eye: Grounded Language Model Reasoning through Simulation | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization | Oct 11, 2022 | ClusteringLanguage Modeling | —Unverified | 0 |
| Understanding the Failure of Batch Normalization for Transformers in NLP | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model | Oct 11, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Like a bilingual baby: The advantage of visually grounding a bilingual language model | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Kernel-Based View of Language Model Fine-Tuning | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Instance Regularization for Discriminative Language Model Pre-training | Oct 11, 2022 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation | Oct 10, 2022 | counterfactualData Augmentation | CodeCode Available | 0 |
| Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning | Oct 10, 2022 | DecoderHeadline Generation | —Unverified | 0 |
| Do Children Texts Hold The Key To Commonsense Knowledge? | Oct 10, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging CLIP and StyleGAN through Latent Alignment for Image Editing | Oct 10, 2022 | Image GenerationImage Manipulation | —Unverified | 0 |
| Scaling Up Probabilistic Circuits by Latent Variable Distillation | Oct 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| QAScore -- An Unsupervised Unreferenced Metric for the Question Generation Evaluation | Oct 9, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Better Pre-Training by Reducing Representation Confusion | Oct 9, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Controllable Dialogue Simulation with In-Context Learning | Oct 9, 2022 | Data AugmentationIn-Context Learning | CodeCode Available | 1 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models | Oct 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling | Oct 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts | Oct 7, 2022 | ArticlesLanguage Modeling | CodeCode Available | 2 |
| Novice Type Error Diagnosis with Natural Language Models | Oct 7, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding | Oct 7, 2022 | Chart Question AnsweringDiversity | CodeCode Available | 2 |
| PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection | Oct 6, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models | Oct 6, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Large-scale Paraphrase Acquisition and Generation | Oct 6, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving the Sample Efficiency of Prompt Tuning with Domain Adaptation | Oct 6, 2022 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Conversational Semantic Role Labeling with Predicate-Oriented Latent Graph | Oct 6, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model | Oct 5, 2022 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| GLM-130B: An Open Bilingual Pre-trained Model | Oct 5, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 |
| Bayesian Prompt Learning for Image-Language Model Generalization | Oct 5, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Towards Improving Faithfulness in Abstractive Summarization | Oct 4, 2022 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| The Surprising Computational Power of Nondeterministic Stack RNNs | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Less is More: Task-aware Layer-wise Distillation for Language Model Compression | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |