| CTRL: A Conditional Transformer Language Model for Controllable Generation | Sep 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations | Sep 1, 2021 | Emotion ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation | Apr 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FIRE: Fact-checking with Iterative Retrieval and Verification | Oct 17, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 | 5 |
| LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering | Jun 7, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 1 | 5 |
| CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images | Oct 22, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model | Apr 9, 2023 | Cross-Part Crowd CountingCrowd Counting | CodeCode Available | 1 | 5 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering | May 2, 2020 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 | 5 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy Reward | Mar 31, 2025 | Crowd CountingLanguage Modeling | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task | Jul 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain | Apr 2, 2024 | Argument MiningDecision Making | CodeCode Available | 1 | 5 |
| FLEX: Unifying Evaluation for Few-Shot NLP | Jul 15, 2021 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| AraGPT2: Pre-Trained Transformer for Arabic Language Generation | Dec 31, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-model Control: Improving Multiple Large Language Models in One-time Training | Oct 23, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Linear Recurrent Units for Sequential Recommendation | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model | Mar 11, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Fluent dreaming for language models | Jan 24, 2024 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 | 5 |
| Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-Modeling | Aug 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Gradient-Based Constrained Sampling from Language Models | May 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 | 5 |
| LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation | Oct 26, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs | Apr 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Librispeech Transducer Model with Internal Language Model Prior Correction | Apr 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Likelihood-Based Diffusion Language Models | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Foundation Transformers | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Construction Repetition Reduces Information Rate in Dialogue | Oct 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Pre-training via Paraphrasing | Jun 26, 2020 | Document SummarizationDocument Translation | CodeCode Available | 1 | 5 |
| Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing | Jul 28, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Content-based Controls For Music Large Language Modeling | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Content Planning for Neural Story Generation with Aristotelian Rescoring | Sep 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive KalmanNet: Data-Driven Kalman Filter with Fast Adaptation | Sep 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Free and Customizable Code Documentation with LLMs: A Fine-Tuning Approach | Dec 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Probabilistic Generative Transformer Language models for Generative Design of Molecules | Sep 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Probing Across Time: What Does RoBERTa Know and When? | Apr 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model | Jan 19, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning | Oct 31, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |