| InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation | Dec 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models | Oct 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Algorithmic progress in language models | Mar 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Tensorized Transformer for Language Modeling | Jun 24, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |