| Personalized Abstractive Summarization by Tri-agent Generation Pipeline | May 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Generalizing and Hybridizing Count-based and Neural Language Models | Jun 1, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Alternative structures for character-level RNNs | Nov 19, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Alternating Synthetic and Real Gradients for Neural Language Modeling | Feb 27, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| InRanker: Distilled Rankers for Zero-shot Information Retrieval | Jan 12, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| General Point Model Pretraining with Autoencoding and Autoregressive | Jan 1, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Automatic benchmarking of large multimodal models via iterative experiment programming | Jun 18, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse | Sep 6, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Adaptively Truncating Backpropagation Through Time to Control Gradient Bias | May 17, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DPPA: Pruning Method for Large Language Model to Model Merging | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Instance Regularization for Discriminative Language Model Pre-training | Oct 11, 2022 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| LG-CAV: Train Any Concept Activation Vector with Language Guidance | Oct 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text | Feb 5, 2018 | Dialogue GenerationDiversity | CodeCode Available | 0 |
| Generate then Refine: Data Augmentation for Zero-shot Intent Detection | Oct 2, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| Do You Have the Right Scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods | Jul 13, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChartFormer: A Large Vision Language Model for Converting Chart Images into Tactile Accessible SVGs | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Character-Level Language Modeling with Deeper Self-Attention | Aug 9, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Character-Level Incremental Speech Recognition with Recurrent Neural Networks | Jan 25, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Do Vision-Language Models Understand Compound Nouns? | Mar 30, 2024 | Image RetrievalLanguage Modeling | CodeCode Available | 0 |
| Characterizing Learning Curves During Language Model Pre-Training: Learning, Forgetting, and Stability | Aug 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition | Aug 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A City of Millions: Mapping Literary Social Networks At Scale | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DagoBERT: Generating Derivational Morphology with a Pretrained Language Model | May 2, 2020 | General ClassificationLanguage Modeling | CodeCode Available | 0 |
| Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding | Jan 10, 2024 | DecoderDiversity | CodeCode Available | 0 |
| MCRanker: Generating Diverse Criteria On-the-Fly to Improve Point-wise LLM Rankers | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AlphaZip: Neural Network-Enhanced Lossless Text Compression | Sep 23, 2024 | BenchmarkingData Compression | CodeCode Available | 0 |
| Generating event descriptions under syntactic and semantic constraints | Dec 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Language Modeling Using Tensor Trains | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment | Apr 8, 2022 | Image to textLanguage Modeling | CodeCode Available | 0 |
| Generating Hypothetical Events for Abductive Inference | Jun 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Character-based Neural Networks for Sentence Pair Modeling | May 21, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies? | Oct 21, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 0 |
| Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Do Text-to-Text Multi-Task Learners Suffer from Task Conflict? | Dec 13, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters | Feb 6, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Generating Memorable Mnemonic Encodings of Numbers | May 7, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Low-Resource Approach to the Grammatical Error Correction of Ukrainian | May 5, 2023 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 |
| NegatER: Unsupervised Discovery of Negatives in Commonsense Knowledge Bases | Nov 15, 2020 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation | Jan 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Prototypes for Contradiction Detection Using Large Language Models and Linguistic Rules | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Question-Answer Hierarchies | Jun 6, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Repetitions with Appropriate Repeated Words | Jul 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Sentences by Editing Prototypes | Sep 26, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Do RNNs learn human-like abstract word order preferences? | Nov 5, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Challenges in Measuring Bias via Open-Ended Language Generation | May 23, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Chain-of-Model Learning for Language Model | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Synthetic Free-text Medical Records with Low Re-identification Risk using Masked Language Modeling | Sep 15, 2024 | Causal Language ModelingDe-identification | CodeCode Available | 0 |