| Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Legilimens: Practical and Unified Content Moderation for Large Language Model Services | Aug 28, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Length Generalization of Causal Transformers without Position Encoding | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content | Jan 28, 2021 | Argument MiningLanguage Modeling | CodeCode Available | 1 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Jul 26, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Leveraging Label Correlations in a Multi-label Setting: A Case Study in Emotion | Oct 28, 2022 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 |
| Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling | Mar 21, 2024 | Grounded language learningLanguage Acquisition | CodeCode Available | 1 |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation | Jun 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks | Mar 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |