| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language Models | Dec 16, 2021 | Active LearningLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Model Finetuning Techniques for Low-resource Languages | Jun 30, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark | Oct 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework | Nov 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Mar 2, 2024 | 16kCPU | CodeCode Available | 1 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Non-Exchangeable Conformal Language Generation with Nearest Neighbors | Feb 1, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Nonparametric Masked Language Modeling | Dec 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models | Dec 20, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Not All Memories are Created Equal: Learning to Expire | Jan 1, 2021 | AllLanguage Modeling | CodeCode Available | 1 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Oct 14, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Mask-Predict: Parallel Decoding of Conditional Masked Language Models | Apr 19, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cascaded Head-colliding Attention | May 31, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| On Diversified Preferences of Large Language Model Alignment | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring the Limits of Language Modeling | Feb 7, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| On Faithfulness and Factuality in Abstractive Summarization | May 2, 2020 | Abstractive Text SummarizationDocument Summarization | CodeCode Available | 1 |
| Entity Tracking in Language Models | May 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On Measuring Social Biases in Prompt-Based Multi-Task Learning | May 23, 2022 | FormLanguage Modeling | CodeCode Available | 1 |
| Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Enhancing Vision-Language Model with Unmasked Token Alignment | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability | Mar 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment Generation | Jun 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus | Jan 27, 2023 | Language AcquisitionLanguage Modeling | CodeCode Available | 1 |
| DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts | May 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval | Oct 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation | May 3, 2020 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Dec 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On the Sentence Embeddings from Pre-trained Language Models | Nov 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Algorithmic progress in language models | Mar 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Tensorized Transformer for Language Modeling | Jun 24, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Feb 5, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning | Sep 19, 2024 | Change DetectionDecoder | CodeCode Available | 1 |