| MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding | Jun 3, 2021 | Conversational Response SelectionLanguage Modeling | CodeCode Available | 1 | 5 |
| LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling | Jun 14, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Length Generalization of Causal Transformers without Position Encoding | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| M-RewardBench: Evaluating Reward Models in Multilingual Settings | Oct 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| mRNA2vec: mRNA Embedding with Language Model in the 5'UTR-CDS for mRNA Design | Aug 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Dec 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| A Tensorized Transformer for Language Modeling | Jun 24, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Large language models are good medical coders, if provided with tools | Jul 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 | 5 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text | Nov 21, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | May 29, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | Aug 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Jan 27, 2023 | Few-Shot LearningGSM8K | CodeCode Available | 1 | 5 |
| Causal Distillation for Language Models | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |