| CodeGen2: Lessons for Training LLMs on Programming and Natural Languages | May 3, 2023 | Causal Language ModelingDecoder | CodeCode Available | 5 |
| AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model | Aug 2, 2022 | Causal Language ModelingCommon Sense Reasoning | CodeCode Available | 2 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 |
| Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling | May 25, 2022 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval | Mar 10, 2025 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Self-Supervised Learning of Brain Dynamics from Broad Neuroimaging Data | Jun 22, 2022 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning | Dec 2, 2023 | Causal Language ModelingContrastive Learning | CodeCode Available | 1 |
| Interpretable Language Modeling via Induction-head Ngram Models | Oct 31, 2024 | Causal Language ModelingHuman fMRI response prediction | CodeCode Available | 1 |
| Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles | Sep 16, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Video Pre-trained Transformer: A Multimodal Mixture of Pre-trained Experts | Mar 24, 2023 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |