| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness | Jun 16, 2023 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding | Jun 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset | Mar 26, 2024 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| JobBERT: Understanding Job Titles through Skills | Sep 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| A Kernel-Based View of Language Model Fine-Tuning | Oct 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| ASR2K: Speech Recognition for Around 2000 Languages without Audio | Sep 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks | May 1, 2022 | Joint Entity and Relation ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge-Augmented Language Model Verification | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| IvyGPT: InteractiVe Chinese pathwaY language model in medical domain | Jul 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Aspect-Controlled Neural Argument Generation | Apr 30, 2020 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| IterVM: Iterative Vision Modeling Module for Scene Text Recognition | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Bayesian Flow Network Framework for Chemistry Tasks | Jul 28, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |