| Rethinking Masked Language Modeling for Chinese Spelling Correction | May 28, 2023 | DiversityDomain Generalization | CodeCode Available | 1 |
| Matrix Information Theory for Self-Supervised Learning | May 27, 2023 | Contrastive LearningGSM8K | CodeCode Available | 1 |
| Query-Efficient Black-Box Red Teaming via Bayesian Optimization | May 27, 2023 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques | May 27, 2023 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst | May 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models Implement Simple Word2Vec-style Vector Arithmetic | May 25, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| An Efficient Multilingual Language Model Compression through Vocabulary Trimming | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Meta-Learning Online Adaptation of Language Models | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |