| Filtering Noisy Parallel Corpus using Transformers with Proxy Task Learning | Nov 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain | Apr 2, 2024 | Argument MiningDecision Making | CodeCode Available | 1 |
| FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model | Aug 31, 2024 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 1 |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Jun 17, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation | Mar 19, 2024 | Gloss-free Sign Language TranslationLanguage Modeling | CodeCode Available | 1 |
| Facilitating large language model Russian adaptation with Learned Embedding Propagation | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Factorized Neural Transducer for Efficient Language Model Adaptation | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Oct 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| fairseq: A Fast, Extensible Toolkit for Sequence Modeling | Apr 1, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing | Sep 29, 2020 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Feb 21, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain | Oct 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GREEK-BERT: The Greeks visiting Sesame Street | Aug 27, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction | Dec 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs | Oct 23, 2023 | Contrastive LearningGraph Neural Network | CodeCode Available | 1 |
| Extracting Latent Steering Vectors from Pretrained Language Models | May 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference | Aug 7, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Extracting Cultural Commonsense Knowledge at Scale | Oct 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extracting Definienda in Mathematical Scholarly Articles with Transformers | Nov 21, 2023 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Extracting Training Data from Large Language Models | Dec 14, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | May 13, 2025 | 3D visual groundingAutonomous Driving | CodeCode Available | 1 |
| gzip Predicts Data-dependent Scaling Laws | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unifying Segment Anything in Microscopy with Multimodal Large Language Model | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring the Limits of Language Modeling | Feb 7, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels | Mar 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer | May 6, 2021 | Data AugmentationDecoder | CodeCode Available | 1 |
| Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extensive Self-Contrast Enables Feedback-Free Language Model Alignment | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems | Jul 12, 2019 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Feb 9, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Accelerating Vision-Language Pretraining with Free Language Modeling | Mar 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims | Oct 16, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables | Feb 20, 2024 | Fact CheckingGraph Neural Network | CodeCode Available | 1 |
| Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations | Jul 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Exploring Quantization for Efficient Pre-Training of Transformer Language Models | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hierarchical Transformers Are More Efficient Language Models | Oct 26, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction | Oct 30, 2023 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 1 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring Stochastic Autoregressive Image Modeling for Visual Representation | Dec 3, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Extracting and Inferring Personal Attributes from Dialogue | Sep 26, 2021 | AttributeLanguage Modeling | CodeCode Available | 1 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | May 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Apr 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FALL-E: A Foley Sound Synthesis Model and Strategies | Jun 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Exploiting Novel GPT-4 APIs | Dec 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |