| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| Improved Hierarchical Patient Classification with Language Model Pretraining over Clinical Notes | Sep 6, 2019 | General ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Aspect Sentiment Quad Prediction via Template-Order Data Augmentation | Oct 19, 2022 | Aspect-Based Sentiment Analysis (ABSA)Data Augmentation | CodeCode Available | 1 | 5 |
| Improving Language Understanding by Generative Pre-Training | Jun 11, 2018 | Cloze TestDocument Classification | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Implicit Language Models are RNNs: Balancing Parallelization and Expressivity | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation | Jun 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks | Mar 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Dec 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| Implementing contextual biasing in GPU decoder for online ASR | Jun 23, 2023 | CPUDecoder | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| Entity Tracking in Language Models | May 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 | 5 |
| Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation | Apr 5, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| ImaginaryNet: Learning Object Detectors without Real Images and Annotations | Oct 13, 2022 | Image GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 | 5 |