| MASS: Masked Sequence to Sequence Pre-training for Language Generation | May 7, 2019 | Conversational Response GenerationDecoder | CodeCode Available | 2 |
| Toward Controlled Generation of Text | Mar 2, 2017 | AttributeSentence | CodeCode Available | 2 |
| Mitigating Object Hallucinations via Sentence-Level Early Intervention | Jul 16, 2025 | HallucinationMM-Vet | CodeCode Available | 1 |
| TokAlign: Efficient Vocabulary Adaptation via Token Alignment | Jun 4, 2025 | SentenceText Compression | CodeCode Available | 1 |
| Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective | May 29, 2025 | DecoderRAG | CodeCode Available | 1 |
| A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations | May 20, 2025 | SentenceSentence Classification | CodeCode Available | 1 |
| Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications Globally | May 15, 2025 | BenchmarkingSentence | CodeCode Available | 1 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 |
| LongAttn: Selecting Long-context Training Data via Token-level Attention | Feb 24, 2025 | Sentence | CodeCode Available | 1 |
| FanChuan: A Multilingual and Graph-Structured Benchmark For Parody Detection and Analysis | Feb 23, 2025 | SentenceSentence Embedding | CodeCode Available | 1 |
| Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human? | Feb 13, 2025 | Grammatical Error CorrectionSentence | CodeCode Available | 1 |
| AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements | Feb 10, 2025 | Sentence | CodeCode Available | 1 |
| SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models | Feb 5, 2025 | SentenceSentence Embeddings | CodeCode Available | 1 |
| Enhancing Biomedical Relation Extraction with Directionality | Jan 23, 2025 | BenchmarkingDocument-level Relation Extraction | CodeCode Available | 1 |
| FlanEC: Exploring Flan-T5 for Post-ASR Error Correction | Jan 22, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset | Jan 16, 2025 | HallucinationSentence | CodeCode Available | 1 |
| Enhancing Automated Interpretability with Output-Centric Feature Descriptions | Jan 14, 2025 | Sentence | CodeCode Available | 1 |
| CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Dec 26, 2024 | Backdoor AttackSentence | CodeCode Available | 1 |
| Fine-tuning Whisper on Low-Resource Languages for Real-World Applications | Dec 20, 2024 | FormSentence | CodeCode Available | 1 |
| Assessing the Limitations of Large Language Models in Clinical Fact Decomposition | Dec 17, 2024 | Fact VerificationSentence | CodeCode Available | 1 |
| EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation | Dec 17, 2024 | Question AnsweringRAG | CodeCode Available | 1 |
| CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval | Dec 17, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 1 |
| Robust Multi-bit Text Watermark with LLM-based Paraphrasers | Dec 4, 2024 | DecoderSentence | CodeCode Available | 1 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |