| A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification | Mar 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | May 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language Modeling | May 12, 2022 | Conditional Text GenerationDecoder | CodeCode Available | 1 | 5 |
| IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model | Jul 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images | Oct 22, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| hmBERT: Historical Multilingual Language Models for Named Entity Recognition | May 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ | Sep 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues | May 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 | 5 |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Apr 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Hierarchical Transformers Are More Efficient Language Models | Oct 26, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| High-Dimension Human Value Representation in Large Language Models | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Debiasing Methods in Natural Language Understanding Make Bias More Accessible | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Autonomous Microscopy Experiments through Large Language Model Agents | Dec 18, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 | 5 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 | 5 |
| A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER | Aug 28, 2023 | Contrastive Learningfew-shot-ner | CodeCode Available | 1 | 5 |
| Enhancing Monocular 3D Scene Completion with Diffusion Model | Mar 2, 2025 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 | 5 |
| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DistilProtBert: A distilled protein language model used to distinguish between real proteins and their randomly shuffled counterparts | May 10, 2022 | Dimensionality ReductionKnowledge Distillation | CodeCode Available | 1 | 5 |
| Decoding Speculative Decoding | Feb 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 | 5 |
| History Matters: Temporal Knowledge Editing in Large Language Model | Dec 9, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 1 | 5 |
| Decoding-Time Language Model Alignment with Multiple Objectives | Jun 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Mar 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables | Feb 20, 2024 | Fact CheckingGraph Neural Network | CodeCode Available | 1 | 5 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| HetSeq: Distributed GPU Training on Heterogeneous Infrastructure | Sep 25, 2020 | GPUimage-classification | CodeCode Available | 1 | 5 |
| Deep Equilibrium Models | Sep 3, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving NER's Performance with Massive financial corpus | Jul 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Automated Spinal MRI Labelling from Reports Using a Large Language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |