| Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved Negatives | Jun 6, 2022 | AttributeContrastive Learning | CodeCode Available | 1 | 5 |
| Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta Information | Dec 15, 2021 | Conversational RecommendationLanguage Modeling | CodeCode Available | 1 | 5 |
| MarianCG: a code generation transformer model inspired by machine translation | Nov 22, 2022 | Code GenerationCode Translation | CodeCode Available | 1 | 5 |
| Mark My Words: Analyzing and Evaluating Language Model Watermarks | Dec 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving End-to-End SLU performance with Prosodic Attention and Distillation | May 14, 2023 | intent-classificationIntent Classification | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-Commerce | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models | Jun 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering | Jul 22, 2023 | Graph Representation LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| ExpertQA: Expert-Curated Questions and Attributed Answers | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Jul 26, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Explaining Datasets in Words: Statistical Models with Natural Language Parameters | Sep 13, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| Improving Biomedical Pretrained Language Models with Knowledge | Apr 21, 2021 | Entity LinkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Blank Language Models | Feb 8, 2020 | Ancient Text RestorationLanguage Modeling | CodeCode Available | 1 | 5 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 | 5 |
| Improved training of end-to-end attention models for speech recognition | May 8, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference | Jan 21, 2020 | Few-Shot Text ClassificationGeneral Classification | CodeCode Available | 1 | 5 |
| Improving antibody language models with native pairing | Aug 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |