| Gated Linear Attention Transformers with Hardware-Efficient Training | Dec 11, 2023 | 2kLanguage Modeling | CodeCode Available | 1 |
| GenAug: Data Augmentation for Finetuning Text Generators | Oct 5, 2020 | Data AugmentationDiversity | CodeCode Available | 1 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| gaBERT -- an Irish Language Model | Jul 27, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Masked Structural Growth for 2x Faster Language Model Pre-training | May 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GAP: A Graph-aware Language Model Framework for Knowledge Graph-to-Text Generation | Apr 13, 2022 | Data-to-Text GenerationGraph Attention | CodeCode Available | 1 |
| Multi-modal vision-language model for generalizable annotation-free pathology localization and clinical diagnosis | Jan 4, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning | Jan 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FVEval: Understanding Language Model Capabilities in Formal Verification of Digital Hardware | Oct 15, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Frustratingly Simple Pretraining Alternatives to Masked Language Modeling | Sep 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Tensorized Transformer for Language Modeling | Jun 24, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Fusing Context Into Knowledge Graph for Commonsense Question Answering | Dec 9, 2020 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 |
| Generalization through Memorization: Nearest Neighbor Language Models | Nov 1, 2019 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| From Text to Pixel: Advancing Long-Context Understanding in MLLMs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network | Aug 22, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective | May 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning | Sep 30, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Free and Customizable Code Documentation with LLMs: A Fine-Tuning Approach | Dec 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| f-PO: Generalizing Preference Optimization with f-divergence Minimization | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis | Apr 11, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Forecasting Future World Events with Neural Networks | Jun 30, 2022 | Decision MakingDiversity | CodeCode Available | 1 |
| Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations | Oct 2, 2023 | In-Context LearningInstruction Following | CodeCode Available | 1 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 |
| Forcing Diffuse Distributions out of Language Models | Apr 16, 2024 | Dataset GenerationDiversity | CodeCode Available | 1 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 |
| A Generative Approach for Script Event Prediction via Contrastive Fine-tuning | Dec 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FOLIO: Natural Language Reasoning with First-Order Logic | Sep 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FonBund: A Library for Combining Cross-lingual Phonological Segment Data | May 1, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Foundation Transformers | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model | Apr 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain | Apr 2, 2024 | Argument MiningDecision Making | CodeCode Available | 1 |
| A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing | Sep 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model | Jan 21, 2025 | HallucinationImage Captioning | CodeCode Available | 1 |
| FLEX: Unifying Evaluation for Few-Shot NLP | Jul 15, 2021 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FIRE: Fact-checking with Iterative Retrieval and Verification | Oct 17, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression | Sep 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |