| InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training | Jul 15, 2020 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 | 5 |
| AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions | Nov 1, 2021 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| Declaration-based Prompt Tuning for Visual Question Answering | May 5, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Generating Query Focused Summaries from Query-Free Resources | Dec 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations | Sep 1, 2021 | Emotion ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Generalizable Approach to Learning Optimizers | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| -former: Infinite Memory Transformer | Sep 1, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection | Apr 10, 2023 | Action DetectionLanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Aligning Large Language Models through Synthetic Feedback | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-Modeling | Aug 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Inductive Relation Prediction by BERT | Mar 12, 2021 | Few-Shot LearningInductive Learning | CodeCode Available | 1 | 5 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Aligning Knowledge Concepts to Whole Slide Images for Precise Histopathology Image Analysis | Nov 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Fluent dreaming for language models | Jan 24, 2024 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 | 5 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 | 5 |
| A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing | Sep 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 1 | 5 |
| Bring Your Own Data! Self-Supervised Evaluation for Large Language Models | Jun 23, 2023 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| Forcing Diffuse Distributions out of Language Models | Apr 16, 2024 | Dataset GenerationDiversity | CodeCode Available | 1 | 5 |
| Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations | Oct 2, 2023 | In-Context LearningInstruction Following | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring | May 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Jul 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Incorporating External POS Tagger for Punctuation Restoration | Jun 12, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Generative Approach for Script Event Prediction via Contrastive Fine-tuning | Dec 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Neural Abstractive Text Summarization with Sequence-to-Sequence Models | Dec 5, 2018 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Building A Coding Assistant via the Retrieval-Augmented Language Model | Oct 21, 2024 | Code CompletionCode Generation | CodeCode Available | 1 | 5 |
| Neural Language Correction with Character-Based Attention | Mar 31, 2016 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| NeuralLog: Natural Language Inference with Joint Neural and Logical Reasoning | May 29, 2021 | Deep LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation | Oct 6, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective | May 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Incorporating Large Language Models into Production Systems for Enhanced Task Automation and Flexibility | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |