| A Measure-Theoretic Characterization of Tight Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter-efficient Zero-shot Transfer for Cross-Language Dense Retrieval with Adapters | Dec 20, 2022 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization | Dec 20, 2022 | Dialogue GenerationLanguage Modeling | CodeCode Available | 2 |
| Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Precise Zero-Shot Dense Retrieval without Relevance Labels | Dec 20, 2022 | Fact VerificationInstruction Following | CodeCode Available | 2 |
| Toward Human-Like Evaluation for Natural Language Generation with Error Analysis | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Optimizing Prompts for Text-to-Image Generation | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Natural Language to Code Generation in Interactive Data Science Notebooks | Dec 19, 2022 | Code GenerationDiversity | —Unverified | 0 |
| Visconde: Multi-document QA with GPT-3 and Neural Reranking | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mu^2SLAM: Multitask, Multilingual Speech and Language Models | Dec 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MANTIS at TSAR-2022 Shared Task: Improved Unsupervised Lexical Simplification with Pretrained Encoders | Dec 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improved Long-Form Spoken Language Translation with Large Language Models | Dec 19, 2022 | FormLanguage Modeling | —Unverified | 0 |
| Python Code Generation by Asking Clarification Questions | Dec 19, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Explanation Regeneration via Information Bottleneck | Dec 19, 2022 | Explanation GenerationLanguage Modeling | CodeCode Available | 0 |
| Emergent Analogical Reasoning in Large Language Models | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Discovering Language Model Behaviors with Model-Written Evaluations | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning | Dec 19, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Very Large Language Model as a Unified Methodology of Text Mining | Dec 19, 2022 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| Reasoning with Language Model Prompting: A Survey | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 3 |
| Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering | Dec 19, 2022 | Chart Question AnsweringData Summarization | —Unverified | 0 |
| Language model acceptability judgements are not always robust to context | Dec 18, 2022 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model | Dec 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale | Dec 18, 2022 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Claim Optimization in Computational Argumentation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| POIBERT: A Transformer-based Model for the Tour Recommendation Problem | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ALERT: Adapting Language Models to Reasoning Tasks | Dec 16, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation | Dec 16, 2022 | Answer GenerationDecoder | CodeCode Available | 1 |
| Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines | Dec 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference | Dec 15, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Attention as a Guide for Simultaneous Speech Translation | Dec 15, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Joint processing of linguistic properties in brains and language models | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Efficient Long Sequence Modeling via State Space Augmented Transformer | Dec 15, 2022 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| The Effects of In-domain Corpus Size on pre-training BERT | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning | Dec 15, 2022 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling | Dec 14, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Modal Similarity-Based Curriculum Learning for Image Captioning | Dec 14, 2022 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique | Dec 13, 2022 | Handwriting RecognitionHandwritten Text Recognition | —Unverified | 0 |
| Technical Report -- Competition Solution for Prompt Tuning using Pretrained Language Model | Dec 13, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Deep Image Style Transfer from Freeform Text | Dec 13, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages | Dec 13, 2022 | Code SummarizationLanguage Modeling | CodeCode Available | 6 |
| Do Text-to-Text Multi-Task Learners Suffer from Task Conflict? | Dec 13, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| CNO-LSTM: A Chaotic Neural Oscillatory Long Short-Term Memory Model for Text Classification | Dec 12, 2022 | ClassificationGPU | —Unverified | 0 |
| Prompting Is Programming: A Query Language for Large Language Models | Dec 12, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 3 |