| Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution | Mar 3, 2025 | counterfactualDomain Adaptation | —Unverified | 0 |
| OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment | Mar 3, 2025 | Anomaly LocalizationClassification | CodeCode Available | 0 |
| Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference | Mar 1, 2025 | Sentence | —Unverified | 0 |
| Autoencoder-Based Framework to Capture Vocabulary Quality in NLP | Feb 28, 2025 | DiversitySentence | —Unverified | 0 |
| The Noisy Path from Source to Citation: Measuring How Scholars Engage with Past Research | Feb 27, 2025 | Sentence | —Unverified | 0 |
| Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models | Feb 27, 2025 | counterfactualLanguage Modeling | —Unverified | 0 |
| Revisiting Word Embeddings in the LLM Era | Feb 26, 2025 | DecoderSentence | —Unverified | 0 |
| A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition | Feb 25, 2025 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing | Feb 24, 2025 | Cross-Lingual TransferDependency Parsing | —Unverified | 0 |
| Dependency Parsing with the Structuralized Prompt Template | Feb 24, 2025 | Dependency ParsingSentence | —Unverified | 0 |
| A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory Texts | Feb 24, 2025 | Answer GenerationInformation Retrieval | CodeCode Available | 0 |
| LongAttn: Selecting Long-context Training Data via Token-level Attention | Feb 24, 2025 | Sentence | CodeCode Available | 1 |
| Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences | Feb 24, 2025 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| DISC: DISC: Dynamic Decomposition Improves LLM Inference Scaling | Feb 23, 2025 | Computational EfficiencyMath | —Unverified | 0 |
| FanChuan: A Multilingual and Graph-Structured Benchmark For Parody Detection and Analysis | Feb 23, 2025 | SentenceSentence Embedding | CodeCode Available | 1 |
| OrderSum: Semantic Sentence Ordering for Extractive Summarization | Feb 22, 2025 | Extractive SummarizationSentence | CodeCode Available | 0 |
| Single-pass Detection of Jailbreaking Input in Large Language Models | Feb 21, 2025 | Sentence | —Unverified | 0 |
| Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts | Feb 21, 2025 | Contrastive LearningDecision Making | —Unverified | 0 |
| Evolutionary Algorithms Approach For Search Based On Semantic Document Similarity | Feb 20, 2025 | Cloud ComputingDistributed Computing | —Unverified | 0 |
| Entity Framing and Role Portrayal in the News | Feb 20, 2025 | ArticlesSentence | —Unverified | 0 |
| Sentence Smith: Formally Controllable Text Transformation and its Application to Evaluation of Text Embedding Models | Feb 20, 2025 | BenchmarkingSentence | —Unverified | 0 |
| Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering | Feb 20, 2025 | Answer GenerationMulti-hop Question Answering | —Unverified | 0 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Optimal word order for non-causal text generation with Large Language Models: the Spanish case | Feb 20, 2025 | DecoderSentence | —Unverified | 0 |
| Multi-Scale and Multi-Objective Optimization for Cross-Lingual Aspect-Based Sentiment Analysis | Feb 19, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | —Unverified | 0 |
| Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models | Feb 19, 2025 | Contrastive LearningSentence | CodeCode Available | 2 |
| Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor | Feb 19, 2025 | SentenceSentence Embedding | —Unverified | 0 |
| Meaning Beyond Truth Conditions: Evaluating Discourse Level Understanding via Anaphora Accessibility | Feb 19, 2025 | DiagnosticNatural Language Understanding | —Unverified | 0 |
| SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin | Feb 19, 2025 | GPULogical Reasoning | —Unverified | 0 |
| Complex Ontology Matching with Large Language Model Embeddings | Feb 19, 2025 | Graph MatchingLanguage Modeling | —Unverified | 0 |
| A Fuzzy Evaluation of Sentence Encoders on Grooming Risk Classification | Feb 18, 2025 | SentenceTAG | —Unverified | 0 |
| Contrast-Unity for Partially-Supervised Temporal Sentence Grounding | Feb 18, 2025 | Contrastive LearningDenoising | —Unverified | 0 |
| Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models | Feb 18, 2025 | MarketingSentence | —Unverified | 0 |
| Towards Practical First-Order Model Counting | Feb 17, 2025 | C++ codemodel | —Unverified | 0 |
| Aligning Sentence Simplification with ESL Learner's Proficiency for Language Acquisition | Feb 17, 2025 | DiversityLanguage Acquisition | CodeCode Available | 0 |
| Can Your Uncertainty Scores Detect Hallucinated Entity? | Feb 17, 2025 | HallucinationSentence | —Unverified | 0 |
| If Attention Serves as a Cognitive Model of Human Memory Retrieval, What is the Plausible Memory Representation? | Feb 17, 2025 | RetrievalSentence | —Unverified | 0 |
| EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models | Feb 17, 2025 | Automated Essay ScoringFeature Engineering | —Unverified | 0 |
| Evaluating Large language models on Understanding Korean indirect Speech acts | Feb 16, 2025 | Sentence | —Unverified | 0 |
| Probing Semantic Routing in Large Mixture-of-Expert Models | Feb 15, 2025 | Mixture-of-ExpertsSentence | —Unverified | 0 |
| PropNet: a White-Box and Human-Like Network for Sentence Representation | Feb 15, 2025 | Semantic Textual SimilaritySentence | —Unverified | 0 |
| Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human? | Feb 13, 2025 | Grammatical Error CorrectionSentence | CodeCode Available | 1 |
| When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models | Feb 13, 2025 | Image GenerationSentence | —Unverified | 0 |
| SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models | Feb 13, 2025 | Long Form Question AnsweringQuestion Answering | CodeCode Available | 0 |
| ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation | Feb 12, 2025 | RAGRetrieval | —Unverified | 0 |
| Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples | Feb 12, 2025 | Distractor GenerationInformation Retrieval | —Unverified | 0 |
| Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation | Feb 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Making Language Models Robust Against Negation | Feb 11, 2025 | Natural Language UnderstandingNegation | —Unverified | 0 |
| A Large-Scale Benchmark for Vietnamese Sentence Paraphrases | Feb 11, 2025 | Paraphrase GenerationSentence | CodeCode Available | 0 |
| Perceived Confidence Scoring for Data Annotation with Zero-Shot LLMs | Feb 11, 2025 | Sentence | —Unverified | 0 |