| Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering | Jul 22, 2023 | Graph Representation LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-Commerce | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models | Jun 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Jul 26, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Passage Retrieval with Zero-Shot Question Generation | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation | Mar 18, 2021 | Bilingual Lexicon InductionLanguage Modeling | CodeCode Available | 1 | 5 |
| Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference | Jan 21, 2020 | Few-Shot Text ClassificationGeneral Classification | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MemCap: Memorizing Style Knowledge for Image Captioning | Apr 3, 2020 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Multi-Party Dialogue Discourse Parsing via Domain Integration | Oct 9, 2021 | Discourse ParsingDomain Adaptation | CodeCode Available | 1 | 5 |
| MemeSem:A Multi-modal Framework for Sentimental Analysis of Meme via Transfer Learning | Jun 12, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 | 5 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Blank Language Models | Feb 8, 2020 | Ancient Text RestorationLanguage Modeling | CodeCode Available | 1 | 5 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Mandarin Speech Recogntion with Block-augmented Transformer | Jul 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Improving NER's Performance with Massive financial corpus | Jul 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Merging Feed-Forward Sublayers for Compressed Transformers | Jan 10, 2025 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations | Jul 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 | 5 |
| Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators | Mar 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Exploring Quantization for Efficient Pre-Training of Transformer Language Models | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Exploring Stochastic Autoregressive Image Modeling for Visual Representation | Dec 3, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Exploring the Limits of Language Modeling | Feb 7, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| Improving Neural Machine Translation Models with Monolingual Data | Nov 20, 2015 | Cross-Lingual Bitext MiningDecoder | CodeCode Available | 1 | 5 |
| Improving Transformer Optimization Through Better Initialization | Jan 1, 2020 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Adaptive Attention Span in Computer Vision | Apr 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models | Jul 31, 2024 | Dictionary LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding | Mar 27, 2025 | FormLanguage Modeling | CodeCode Available | 1 | 5 |
| BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision | Jun 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Extensive Self-Contrast Enables Feedback-Free Language Model Alignment | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques | May 27, 2023 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 | 5 |