| Enhancing Retrieval-Augmented Generation: A Study of Best Practices | Jan 13, 2025 | In-Context LearningRAG | CodeCode Available | 2 |
| Diversified Augmentation with Domain Adaptation for Debiased Video Temporal Grounding | Jan 12, 2025 | Data AugmentationDomain Adaptation | —Unverified | 0 |
| AFRIDOC-MT: Document-level MT Corpus for African Languages | Jan 10, 2025 | Machine TranslationNMT | CodeCode Available | 0 |
| Bridging Dialects: Translating Standard Bangla to Regional Variants Using Neural Models | Jan 10, 2025 | DiversityMachine Translation | —Unverified | 0 |
| ParaRev: Building a dataset for Scientific Paragraph Revision annotated with revision instruction | Jan 9, 2025 | Sentence | —Unverified | 0 |
| Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier | Jan 9, 2025 | RAGRelation | —Unverified | 0 |
| Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction | Jan 9, 2025 | MathSentence | CodeCode Available | 0 |
| Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing | Jan 9, 2025 | Paraphrase IdentificationSentence | CodeCode Available | 0 |
| Advancing Retrieval-Augmented Generation for Persian: Development of Language Models, Comprehensive Benchmarks, and Best Practices for Optimization | Jan 8, 2025 | BenchmarkingGeneral Knowledge | —Unverified | 0 |
| Multi-label Cross-lingual automatic music genre classification from lyrics with Sentence BERT | Jan 7, 2025 | ClassificationGenre classification | —Unverified | 0 |
| Progressive Document-level Text Simplification via Large Language Models | Jan 7, 2025 | Document SummarizationSentence | —Unverified | 0 |
| Interactive Information Need Prediction with Intent and Context | Jan 5, 2025 | PredictionRetrieval | —Unverified | 0 |
| Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks | Jan 5, 2025 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation | Jan 2, 2025 | SentenceSpeech Separation | —Unverified | 0 |
| Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection | Jan 2, 2025 | HallucinationSentence | —Unverified | 0 |
| Embedding-based Approaches to Hyperpartisan News Detection | Jan 2, 2025 | SentenceSentiment Analysis | —Unverified | 0 |
| Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT | Jan 2, 2025 | Polyphone disambiguationSentence | —Unverified | 0 |
| DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning | Jan 1, 2025 | Document Layout AnalysisImage Segmentation | —Unverified | 0 |
| Incremental Dialogue Management: Survey, Discussion, and Implications for HRI | Jan 1, 2025 | Dialogue ManagementManagement | —Unverified | 0 |
| KnowRA: Knowledge Retrieval Augmented Method for Document-level Relation Extraction with Comprehensive Reasoning Abilities | Dec 31, 2024 | Common Sense ReasoningDocument-level Relation Extraction | —Unverified | 0 |
| Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria | Dec 30, 2024 | Sentence | —Unverified | 0 |
| AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models | Dec 29, 2024 | RelationRelation Classification | —Unverified | 0 |
| Counterfactual Samples Constructing and Training for Commonsense Statements Estimation | Dec 29, 2024 | counterfactualSentence | —Unverified | 0 |
| Advancing LLM detection in the ALTA 2024 Shared Task: Techniques and Analysis | Dec 26, 2024 | ArticlesSentence | —Unverified | 0 |
| CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Dec 26, 2024 | Backdoor AttackSentence | CodeCode Available | 1 |