| Merging Text Transformer Models from Different Initializations | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Resonance RoPE: Improving Context Length Generalization of Large Language Models | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings | Feb 29, 2024 | Conditional Text GenerationDecoder | CodeCode Available | 1 |
| Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogBench: a large language model walks into a psychology lab | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model | Feb 28, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Grounding Language Models for Visual Entity Recognition | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents | Feb 27, 2024 | Document ClassificationLanguage Modeling | CodeCode Available | 1 |
| A Language Model based Framework for New Concept Placement in Ontologies | Feb 27, 2024 | Contrastive LearningEntity Linking | CodeCode Available | 1 |