| MASS: Masked Sequence to Sequence Pre-training for Language Generation | May 7, 2019 | Conversational Response GenerationDecoder | CodeCode Available | 2 |
| Toward Controlled Generation of Text | Mar 2, 2017 | AttributeSentence | CodeCode Available | 2 |
| Mitigating Object Hallucinations via Sentence-Level Early Intervention | Jul 16, 2025 | HallucinationMM-Vet | CodeCode Available | 1 |
| TokAlign: Efficient Vocabulary Adaptation via Token Alignment | Jun 4, 2025 | SentenceText Compression | CodeCode Available | 1 |
| Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective | May 29, 2025 | DecoderRAG | CodeCode Available | 1 |
| A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations | May 20, 2025 | SentenceSentence Classification | CodeCode Available | 1 |
| Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications Globally | May 15, 2025 | BenchmarkingSentence | CodeCode Available | 1 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 |
| LongAttn: Selecting Long-context Training Data via Token-level Attention | Feb 24, 2025 | Sentence | CodeCode Available | 1 |
| FanChuan: A Multilingual and Graph-Structured Benchmark For Parody Detection and Analysis | Feb 23, 2025 | SentenceSentence Embedding | CodeCode Available | 1 |
| Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human? | Feb 13, 2025 | Grammatical Error CorrectionSentence | CodeCode Available | 1 |
| AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements | Feb 10, 2025 | Sentence | CodeCode Available | 1 |
| SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models | Feb 5, 2025 | SentenceSentence Embeddings | CodeCode Available | 1 |
| Enhancing Biomedical Relation Extraction with Directionality | Jan 23, 2025 | BenchmarkingDocument-level Relation Extraction | CodeCode Available | 1 |
| FlanEC: Exploring Flan-T5 for Post-ASR Error Correction | Jan 22, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset | Jan 16, 2025 | HallucinationSentence | CodeCode Available | 1 |
| Enhancing Automated Interpretability with Output-Centric Feature Descriptions | Jan 14, 2025 | Sentence | CodeCode Available | 1 |
| CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Dec 26, 2024 | Backdoor AttackSentence | CodeCode Available | 1 |
| Fine-tuning Whisper on Low-Resource Languages for Real-World Applications | Dec 20, 2024 | FormSentence | CodeCode Available | 1 |
| Assessing the Limitations of Large Language Models in Clinical Fact Decomposition | Dec 17, 2024 | Fact VerificationSentence | CodeCode Available | 1 |
| EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation | Dec 17, 2024 | Question AnsweringRAG | CodeCode Available | 1 |
| CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval | Dec 17, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 1 |
| Robust Multi-bit Text Watermark with LLM-based Paraphrasers | Dec 4, 2024 | DecoderSentence | CodeCode Available | 1 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Gumbel Counterfactual Generation From Language Models | Nov 11, 2024 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Nov 2, 2024 | Line DetectionSemantic Similarity | CodeCode Available | 1 |
| Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation | Oct 28, 2024 | Document Level Machine TranslationMachine Translation | CodeCode Available | 1 |
| Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction | Oct 24, 2024 | Representation LearningSentence | CodeCode Available | 1 |
| LESS: Label-Efficient and Single-Stage Referring 3D Segmentation | Oct 17, 2024 | cross-modal alignmentInstance Segmentation | CodeCode Available | 1 |
| A Closer Look at Machine Unlearning for Large Language Models | Oct 10, 2024 | DiversityMachine Unlearning | CodeCode Available | 1 |
| ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time | Oct 9, 2024 | Sentence | CodeCode Available | 1 |
| Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes | Oct 8, 2024 | ArticlesClassification | CodeCode Available | 1 |
| CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation | Oct 3, 2024 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 |
| FactAlign: Long-form Factuality Alignment of Large Language Models | Oct 2, 2024 | FormHallucination | CodeCode Available | 1 |
| RisingBALLER: A player is a token, a match is a sentence, A path towards a foundational model for football players data analytics | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MEXMA: Token-level objectives improve sentence representations | Sep 19, 2024 | Sentence | CodeCode Available | 1 |
| Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference | Sep 2, 2024 | Computational EfficiencySentence | CodeCode Available | 1 |
| Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language | Sep 2, 2024 | Lip ReadingSentence | CodeCode Available | 1 |
| Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text Generation | Aug 24, 2024 | DiversitySentence | CodeCode Available | 1 |
| Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images | Aug 15, 2024 | Image GenerationSentence | CodeCode Available | 1 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| SentenceVAE: Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context | Aug 1, 2024 | DecoderSentence | CodeCode Available | 1 |
| Can Editing LLMs Inject Harm? | Jul 29, 2024 | FairnessGeneral Knowledge | CodeCode Available | 1 |
| ClinicRealm: Re-evaluating Large Language Models with Conventional Machine Learning for Non-Generative Clinical Prediction Tasks | Jul 26, 2024 | BenchmarkingModel Selection | CodeCode Available | 1 |
| AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description | Jul 22, 2024 | Sentence | CodeCode Available | 1 |
| Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition | Jul 17, 2024 | Grounded Multimodal Named Entity RecognitionMachine Reading Comprehension | CodeCode Available | 1 |
| AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning | Jul 9, 2024 | Keyword ExtractionSentence | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |