| Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale | Dec 18, 2022 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Claim Optimization in Computational Argumentation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| POIBERT: A Transformer-based Model for the Tour Recommendation Problem | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ALERT: Adapting Language Models to Reasoning Tasks | Dec 16, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation | Dec 16, 2022 | Answer GenerationDecoder | CodeCode Available | 1 |
| Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines | Dec 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference | Dec 15, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Attention as a Guide for Simultaneous Speech Translation | Dec 15, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Joint processing of linguistic properties in brains and language models | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Efficient Long Sequence Modeling via State Space Augmented Transformer | Dec 15, 2022 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| The Effects of In-domain Corpus Size on pre-training BERT | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning | Dec 15, 2022 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling | Dec 14, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Modal Similarity-Based Curriculum Learning for Image Captioning | Dec 14, 2022 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique | Dec 13, 2022 | Handwriting RecognitionHandwritten Text Recognition | —Unverified | 0 |
| Technical Report -- Competition Solution for Prompt Tuning using Pretrained Language Model | Dec 13, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Deep Image Style Transfer from Freeform Text | Dec 13, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages | Dec 13, 2022 | Code SummarizationLanguage Modeling | CodeCode Available | 6 |
| Do Text-to-Text Multi-Task Learners Suffer from Task Conflict? | Dec 13, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| CNO-LSTM: A Chaotic Neural Oscillatory Long Short-Term Memory Model for Text Classification | Dec 12, 2022 | ClassificationGPU | —Unverified | 0 |
| Prompting Is Programming: A Query Language for Large Language Models | Dec 12, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 3 |