| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles | Sep 16, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 | 5 |
| Entity Tracking in Language Models | May 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning Passage Impacts for Inverted Indexes | Apr 24, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| Librispeech Transducer Model with Internal Language Model Prior Correction | Apr 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis | Oct 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| A Tensorized Transformer for Language Modeling | Jun 24, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Algorithmic progress in language models | Mar 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | May 29, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | Aug 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training | Oct 23, 2020 | Data-to-Text GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Mask-Predict: Parallel Decoding of Conditional Masked Language Models | Apr 19, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning | Jul 20, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 | 5 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| NaturalProver: Grounded Mathematical Proof Generation with Language Models | May 25, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 1 | 5 |
| Causal Distillation for Language Models | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Causal Discovery with Language Models as Imperfect Experts | Jul 5, 2023 | Causal DiscoveryDecision Making | CodeCode Available | 1 | 5 |
| Catwalk: A Unified Language Model Evaluation Framework for Many Datasets | Dec 15, 2023 | In-Context LearningLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model | Apr 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Neural Implicit Vision-Language Feature Fields | Mar 20, 2023 | Image SegmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| A Realistic Threat Model for Large Language Model Jailbreaks | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CAT-LM: Training Language Models on Aligned Code And Tests | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 | 5 |
| Evaluation Benchmarks for Spanish Sentence Representations | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| 4-bit Shampoo for Memory-Efficient Network Training | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |