| MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data-to-Text Generation with Iterative Text Editing | Nov 3, 2020 | Data-to-Text GenerationDomain Adaptation | CodeCode Available | 1 | 5 |
| Approaching Deep Learning through the Spectral Dynamics of Weights | Aug 21, 2024 | Deep Learningimage-classification | CodeCode Available | 1 | 5 |
| MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation | May 12, 2021 | Adversarial TextData Augmentation | CodeCode Available | 1 | 5 |
| Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction | Aug 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection | Apr 10, 2023 | Action DetectionLanguage Modeling | CodeCode Available | 1 | 5 |
| Matching Networks for One Shot Learning | Jun 13, 2016 | Few-Shot Image ClassificationFew-Shot Learning | CodeCode Available | 1 | 5 |
| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education | Jun 2, 2021 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training | Jun 1, 2022 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains | Feb 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy Reward | Mar 31, 2025 | Crowd CountingLanguage Modeling | CodeCode Available | 1 | 5 |
| Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling | Jan 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Massive Editing for Large Language Models via Meta Learning | Nov 8, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 | 5 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| Debiasing Methods in Natural Language Understanding Make Bias More Accessible | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Interpreting Language Models with Contrastive Explanations | Feb 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations | Sep 1, 2021 | Emotion ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Investigating Fairness Disparities in Peer Review: A Language Model Enhanced Approach | Nov 7, 2022 | FairnessLanguage Modeling | CodeCode Available | 1 | 5 |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Feb 21, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 | 5 |
| Mass-Producing Failures of Multimodal Systems with Language Models | Jun 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRL: A Conditional Transformer Language Model for Controllable Generation | Sep 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes | Oct 22, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| Towards Evaluating Generalist Agents: An Automated Benchmark in Open World | Oct 12, 2023 | BenchmarkingDiversity | CodeCode Available | 1 | 5 |
| Markovian Transformers for Informative Language Modeling | Apr 29, 2024 | GSM8KInformativeness | CodeCode Available | 1 | 5 |
| LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction | Dec 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| Invariant Language Modeling | Oct 16, 2021 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 | 5 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InvestLM: A Large Language Model for Investment using Financial Domain Instruction Tuning | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation | May 12, 2023 | FairnessLanguage Modeling | CodeCode Available | 1 | 5 |
| IoT-LM: Large Multisensory Language Models for the Internet of Things | Jul 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling | Apr 3, 2025 | Grapheme-to-Phoneme ConversionLanguage Modeling | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering | Jul 11, 2023 | Language ModelingMedical Visual Question Answering | CodeCode Available | 1 | 5 |
| Mapping Memes to Words for Multimodal Hateful Meme Classification | Oct 12, 2023 | Hateful Meme ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| Talking-Heads Attention | Mar 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MarianCG: a code generation transformer model inspired by machine translation | Nov 22, 2022 | Code GenerationCode Translation | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| TAPEX: Table Pre-training via Learning a Neural SQL Executor | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 | 5 |