| Exploring the Limits of Language Modeling | Feb 7, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels | Mar 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer | May 6, 2021 | Data AugmentationDecoder | CodeCode Available | 1 |
| Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extensive Self-Contrast Enables Feedback-Free Language Model Alignment | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems | Jul 12, 2019 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Feb 9, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Accelerating Vision-Language Pretraining with Free Language Modeling | Mar 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims | Oct 16, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables | Feb 20, 2024 | Fact CheckingGraph Neural Network | CodeCode Available | 1 |
| Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations | Jul 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Exploring Quantization for Efficient Pre-Training of Transformer Language Models | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hierarchical Transformers Are More Efficient Language Models | Oct 26, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction | Oct 30, 2023 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 1 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring Stochastic Autoregressive Image Modeling for Visual Representation | Dec 3, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Extracting and Inferring Personal Attributes from Dialogue | Sep 26, 2021 | AttributeLanguage Modeling | CodeCode Available | 1 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | May 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Apr 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FALL-E: A Foley Sound Synthesis Model and Strategies | Jun 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Exploiting Novel GPT-4 APIs | Dec 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |