| Block-Recurrent Transformers | Mar 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models | Mar 4, 2022 | DecoderGPU | CodeCode Available | 2 |
| Contextual Semantic Embeddings for Ontology Subsumption Prediction | Feb 20, 2022 | Knowledge Graph EmbeddingsLanguage Modeling | CodeCode Available | 2 |
| Online Decision Transformer | Feb 11, 2022 | D4RLEfficient Exploration | CodeCode Available | 2 |
| ProteinBERT: a universal deep-learning model of protein sequence and function | Feb 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TimeLMs: Diachronic Language Models from Twitter | Feb 8, 2022 | Continual LearningLanguage Modeling | CodeCode Available | 2 |
| Cedille: A large autoregressive French language model | Feb 7, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Pre-Trained Language Models for Interactive Decision-Making | Feb 3, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Formal Mathematics Statement Curriculum Learning | Feb 3, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 2 |
| Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval | Jan 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |