| Guiding Attention for Self-Supervised Learning with Transformers | Oct 6, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis | Oct 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Control Prefixes for Parameter-Efficient Text Generation | Oct 15, 2021 | Abstractive Text SummarizationAttribute | CodeCode Available | 1 | 5 |
| LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital Twins | May 28, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | Jul 26, 2019 | Common Sense ReasoningDocument Image Classification | CodeCode Available | 1 | 5 |
| Hallucinations in Large Multilingual Translation Models | Mar 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LOGO -- Long cOntext aliGnment via efficient preference Optimization | Oct 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention | Oct 2, 2020 | Common Sense ReasoningEntity Typing | CodeCode Available | 1 | 5 |
| Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity | Apr 8, 2020 | AMR-to-Text GenerationData-to-Text Generation | CodeCode Available | 1 | 5 |
| Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent Space | May 29, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging | Apr 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| RoChBert: Towards Robust BERT Fine-tuning for Chinese | Oct 28, 2022 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation? | Feb 17, 2025 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19 | Jun 19, 2020 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| HYTREL: Hypergraph-enhanced Tabular Data Representation Learning | Jul 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims | Oct 16, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 1 | 5 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |