| Rethinking Masked Language Modeling for Chinese Spelling Correction | May 28, 2023 | DiversityDomain Generalization | CodeCode Available | 1 |
| Matrix Information Theory for Self-Supervised Learning | May 27, 2023 | Contrastive LearningGSM8K | CodeCode Available | 1 |
| Query-Efficient Black-Box Red Teaming via Bayesian Optimization | May 27, 2023 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques | May 27, 2023 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models Implement Simple Word2Vec-style Vector Arithmetic | May 25, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst | May 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Meta-Learning Online Adaptation of Language Models | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| An Efficient Multilingual Language Model Compression through Vocabulary Trimming | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PIVOINE: Instruction Tuning for Open-world Information Extraction | May 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Text-Augmented Open Knowledge Graph Completion via Pre-Trained Language Models | May 24, 2023 | Knowledge Graph CompletionLanguage Modeling | CodeCode Available | 1 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VisorGPT: Learning Visual Prior via Generative Pre-Training | May 23, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Aligning Large Language Models through Synthetic Feedback | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings | May 23, 2023 | Community DetectionContrastive Learning | CodeCode Available | 1 |
| Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Frustratingly Simple Decoding Method for Neural Text Generation | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints | May 22, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| How Language Model Hallucinations Can Snowball | May 22, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |