| Enhancing Retrieval-Augmented Generation: A Study of Best Practices | Jan 13, 2025 | In-Context LearningRAG | CodeCode Available | 2 |
| Diversified Augmentation with Domain Adaptation for Debiased Video Temporal Grounding | Jan 12, 2025 | Data AugmentationDomain Adaptation | —Unverified | 0 |
| AFRIDOC-MT: Document-level MT Corpus for African Languages | Jan 10, 2025 | Machine TranslationNMT | CodeCode Available | 0 |
| Bridging Dialects: Translating Standard Bangla to Regional Variants Using Neural Models | Jan 10, 2025 | DiversityMachine Translation | —Unverified | 0 |
| Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier | Jan 9, 2025 | RAGRelation | —Unverified | 0 |
| Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction | Jan 9, 2025 | MathSentence | CodeCode Available | 0 |
| Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing | Jan 9, 2025 | Paraphrase IdentificationSentence | CodeCode Available | 0 |
| ParaRev: Building a dataset for Scientific Paragraph Revision annotated with revision instruction | Jan 9, 2025 | Sentence | —Unverified | 0 |
| Advancing Retrieval-Augmented Generation for Persian: Development of Language Models, Comprehensive Benchmarks, and Best Practices for Optimization | Jan 8, 2025 | BenchmarkingGeneral Knowledge | —Unverified | 0 |
| Multi-label Cross-lingual automatic music genre classification from lyrics with Sentence BERT | Jan 7, 2025 | ClassificationGenre classification | —Unverified | 0 |
| Progressive Document-level Text Simplification via Large Language Models | Jan 7, 2025 | Document SummarizationSentence | —Unverified | 0 |
| Interactive Information Need Prediction with Intent and Context | Jan 5, 2025 | PredictionRetrieval | —Unverified | 0 |
| Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks | Jan 5, 2025 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation | Jan 2, 2025 | SentenceSpeech Separation | —Unverified | 0 |
| Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection | Jan 2, 2025 | HallucinationSentence | —Unverified | 0 |
| Embedding-based Approaches to Hyperpartisan News Detection | Jan 2, 2025 | SentenceSentiment Analysis | —Unverified | 0 |
| Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT | Jan 2, 2025 | Polyphone disambiguationSentence | —Unverified | 0 |
| DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning | Jan 1, 2025 | Document Layout AnalysisImage Segmentation | —Unverified | 0 |
| Incremental Dialogue Management: Survey, Discussion, and Implications for HRI | Jan 1, 2025 | Dialogue ManagementManagement | —Unverified | 0 |
| KnowRA: Knowledge Retrieval Augmented Method for Document-level Relation Extraction with Comprehensive Reasoning Abilities | Dec 31, 2024 | Common Sense ReasoningDocument-level Relation Extraction | —Unverified | 0 |
| Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria | Dec 30, 2024 | Sentence | —Unverified | 0 |
| Counterfactual Samples Constructing and Training for Commonsense Statements Estimation | Dec 29, 2024 | counterfactualSentence | —Unverified | 0 |
| AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models | Dec 29, 2024 | RelationRelation Classification | —Unverified | 0 |
| Advancing LLM detection in the ALTA 2024 Shared Task: Techniques and Analysis | Dec 26, 2024 | ArticlesSentence | —Unverified | 0 |
| CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Dec 26, 2024 | Backdoor AttackSentence | CodeCode Available | 1 |
| Towards Expressive Video Dubbing with Multiscale Multimodal Context Interaction | Dec 25, 2024 | Graph AttentionSentence | CodeCode Available | 0 |
| Simple is not Enough: Document-level Text Simplification using Readability and Coherence | Dec 24, 2024 | SentenceText Simplification | —Unverified | 0 |
| Multiple References with Meaningful Variations Improve Literary Machine Translation | Dec 24, 2024 | Machine TranslationSemantic Similarity | —Unverified | 0 |
| Investigating Length Issues in Document-level Machine Translation | Dec 23, 2024 | Document Level Machine TranslationMachine Translation | —Unverified | 0 |
| A Dual-Perspective Metaphor Detection Framework Using Large Language Models | Dec 23, 2024 | Decision MakingKnowledge Graphs | CodeCode Available | 0 |
| ERUPD -- English to Roman Urdu Parallel Dataset | Dec 23, 2024 | Machine TranslationPrompt Engineering | —Unverified | 0 |
| Underutilization of Syntactic Processing by Chinese Learners of English in Comprehending English Sentences, Evidenced from Adapted Garden-Path Ambiguity Experiment | Dec 21, 2024 | DescriptiveSentence | —Unverified | 0 |
| Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding | Dec 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models | Dec 20, 2024 | Adversarial TextInformation Retrieval | CodeCode Available | 0 |
| Fine-tuning Whisper on Low-Resource Languages for Real-World Applications | Dec 20, 2024 | FormSentence | CodeCode Available | 1 |
| Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network | Dec 20, 2024 | SentenceTemporal Sentence Grounding | —Unverified | 0 |
| EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation | Dec 17, 2024 | Question AnsweringRAG | CodeCode Available | 1 |
| Improving Explainability of Sentence-level Metrics via Edit-level Attribution for Grammatical Error Correction | Dec 17, 2024 | AttributeGrammatical Error Correction | CodeCode Available | 0 |
| Transferable and Forecastable User Targeting Foundation Model | Dec 17, 2024 | Marketingmodel | —Unverified | 0 |
| Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval | Dec 17, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 1 |
| Assessing the Limitations of Large Language Models in Clinical Fact Decomposition | Dec 17, 2024 | Fact VerificationSentence | CodeCode Available | 1 |
| Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features | Dec 17, 2024 | MisinformationSentence | —Unverified | 0 |
| An Incremental Clustering Baseline for Event Detection on Twitter | Dec 16, 2024 | ClusteringEvent Detection | CodeCode Available | 0 |
| Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers | Dec 16, 2024 | In-Context LearningSentence | —Unverified | 0 |
| Token Prepending: A Training-Free Approach for Eliciting Better Sentence Embeddings from LLMs | Dec 16, 2024 | Prompt EngineeringSemantic Textual Similarity | —Unverified | 0 |
| Feature engineering vs. deep learning for paper section identification: Toward applications in Chinese medical literature | Dec 15, 2024 | Deep LearningFeature Engineering | —Unverified | 0 |
| Quantifying Positional Biases in Text Embedding Models | Dec 13, 2024 | Information RetrievalPosition | CodeCode Available | 0 |
| No Free Lunch for Defending Against Prefilling Attack by In-Context Learning | Dec 13, 2024 | In-Context LearningSafety Alignment | —Unverified | 0 |
| The role of inhibitory control in garden-path sentence processing: A Chinese-English bilingual perspective | Dec 13, 2024 | Sentence | —Unverified | 0 |