| Less is More: Task-aware Layer-wise Distillation for Language Model Compression | Oct 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Let's Stop Incorrect Comparisons in End-to-end Relation Extraction! | Sep 22, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Leveraging Label Correlations in a Multi-label Setting: A Case Study in Emotion | Oct 28, 2022 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 |
| Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development | May 12, 2023 | Knowledge ProbingLanguage Modeling | CodeCode Available | 1 |
| Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling | Mar 21, 2024 | Grounded language learningLanguage Acquisition | CodeCode Available | 1 |
| Extracting Cultural Commonsense Knowledge at Scale | Oct 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fast Vocabulary Transfer for Language Model Compression | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evolutionary Large Language Model for Automated Feature Transformation | May 25, 2024 | Efficient ExplorationEvolutionary Algorithms | CodeCode Available | 1 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 |
| Evolving Deep Neural Networks | Mar 1, 2017 | Deep LearningImage Captioning | CodeCode Available | 1 |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation | Jun 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks | Mar 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Evaluation Benchmarks for Spanish Sentence Representations | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media | Sep 7, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Linking Emergent and Natural Languages via Corpus Transfer | Mar 24, 2022 | AttributeDisentanglement | CodeCode Available | 1 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Jul 26, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| ExaRanker: Explanation-Augmented Neural Ranker | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Language Model Finetuning Techniques for Low-resource Languages | Jun 30, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language Model | Dec 14, 2020 | Deep LearningFeature Engineering | CodeCode Available | 1 |
| Lite Transformer with Long-Short Range Attention | Apr 24, 2020 | Abstractive Text SummarizationAutoML | CodeCode Available | 1 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 |
| BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model | Apr 8, 2022 | Entity LinkingLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators | Mar 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Adaptive Attention Span in Computer Vision | Apr 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Biomedical Event Extraction with Hierarchical Knowledge Graphs | Sep 20, 2020 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models | Jul 31, 2024 | Dictionary LearningLanguage Modeling | CodeCode Available | 1 |