| AFRIDOC-MT: Document-level MT Corpus for African Languages | Jan 10, 2025 | Machine TranslationNMT | CodeCode Available | 0 |
| Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing | Jan 9, 2025 | Paraphrase IdentificationSentence | CodeCode Available | 0 |
| Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier | Jan 9, 2025 | RAGRelation | —Unverified | 0 |
| ParaRev: Building a dataset for Scientific Paragraph Revision annotated with revision instruction | Jan 9, 2025 | Sentence | —Unverified | 0 |
| Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction | Jan 9, 2025 | MathSentence | CodeCode Available | 0 |
| Advancing Retrieval-Augmented Generation for Persian: Development of Language Models, Comprehensive Benchmarks, and Best Practices for Optimization | Jan 8, 2025 | BenchmarkingGeneral Knowledge | —Unverified | 0 |
| Multi-label Cross-lingual automatic music genre classification from lyrics with Sentence BERT | Jan 7, 2025 | ClassificationGenre classification | —Unverified | 0 |
| Progressive Document-level Text Simplification via Large Language Models | Jan 7, 2025 | Document SummarizationSentence | —Unverified | 0 |
| Interactive Information Need Prediction with Intent and Context | Jan 5, 2025 | PredictionRetrieval | —Unverified | 0 |
| Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks | Jan 5, 2025 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |