| A Surprising Failure? Multimodal LLMs and the NLVR Challenge | Feb 26, 2024 | SentenceSpatial Reasoning | —Unverified | 0 |
| Semantic change detection for Slovene language: a novel dataset and an approach based on optimal transport | Feb 26, 2024 | Change DetectionSentence | CodeCode Available | 0 |
| Hitting "Probe"rty with Non-Linearity, and More | Feb 25, 2024 | Sentence | —Unverified | 0 |
| Likelihood-based Mitigation of Evaluation Bias in Large Language Models | Feb 25, 2024 | Grammatical Error CorrectionIn-Context Learning | CodeCode Available | 0 |
| Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot Examples | Feb 23, 2024 | Dataset GenerationDecoder | CodeCode Available | 0 |
| Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency | Feb 23, 2024 | AttributeSentence | —Unverified | 0 |
| GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection? | Feb 23, 2024 | DiagnosticHate Speech Detection | CodeCode Available | 0 |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Feb 23, 2024 | DecoderMachine Translation | CodeCode Available | 0 |
| Self-Adaptive Reconstruction with Contrastive Learning for Unsupervised Sentence Embeddings | Feb 23, 2024 | Contrastive LearningSentence | —Unverified | 0 |
| GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages | Feb 22, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task | Feb 22, 2024 | Adversarial AttackContrastive Learning | —Unverified | 0 |
| Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews | Feb 21, 2024 | Opinion MiningSentence | —Unverified | 0 |
| Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions | Feb 21, 2024 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment | Feb 21, 2024 | Sentence | CodeCode Available | 0 |
| Explaining Relationships Among Research Papers | Feb 20, 2024 | Sentence | —Unverified | 0 |
| Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity | Feb 20, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 0 |
| ChatEL: Entity Linking with Chatbots | Feb 20, 2024 | Entity LinkingSentence | CodeCode Available | 0 |
| More Discriminative Sentence Embeddings via Semantic Graph Smoothing | Feb 20, 2024 | ClusteringSentence | CodeCode Available | 0 |
| Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation | Feb 20, 2024 | SentenceTranslation | —Unverified | 0 |
| UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation | Feb 20, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| Ontology Enhanced Claim Detection | Feb 19, 2024 | SentenceSentence Embeddings | —Unverified | 0 |
| Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST! | Feb 19, 2024 | Sentence | CodeCode Available | 0 |
| NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Feb 19, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| Asynchronous and Segmented Bidirectional Encoding for NMT | Feb 19, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning | Feb 18, 2024 | Deception DetectionSentence | —Unverified | 0 |
| Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models | Feb 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Syntactic Language Change in English and German: Metrics, Parsers, and Convergences | Feb 18, 2024 | Optical Character Recognition (OCR)Sentence | CodeCode Available | 0 |
| Where is the answer? Investigating Positional Bias in Language Model Knowledge Extraction | Feb 16, 2024 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| UMAIR-FPS: User-aware Multi-modal Animation Illustration Recommendation Fusion with Painting Style | Feb 16, 2024 | Image GenerationMulti-modal Recommendation | CodeCode Available | 0 |
| GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models | Feb 16, 2024 | RelationRelation Extraction | CodeCode Available | 0 |
| Neural paraphrasing by automatically crawled and aligned sentence pairs | Feb 16, 2024 | SentenceText Generation | —Unverified | 0 |
| Word Embeddings Revisited: Do LLMs Offer Something New? | Feb 16, 2024 | Document EmbeddingLanguage Modeling | —Unverified | 0 |
| `Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory | Feb 16, 2024 | InformativenessSentence | —Unverified | 0 |
| Comparing Hallucination Detection Metrics for Multilingual Generation | Feb 16, 2024 | HallucinationNatural Language Inference | —Unverified | 0 |
| Grounding Language Model with Chunking-Free In-Context Retrieval | Feb 15, 2024 | ChunkingLanguage Modeling | —Unverified | 0 |
| The optimal placement of the head in the noun phrase. The case of demonstrative, numeral, adjective and noun | Feb 15, 2024 | Sentence | —Unverified | 0 |
| Integrating ChatGPT into Secure Hospital Networks: A Case Study on Improving Radiology Report Analysis | Feb 14, 2024 | Contrastive LearningKnowledge Distillation | —Unverified | 0 |
| OrderBkd: Textual backdoor attack through repositioning | Feb 12, 2024 | Backdoor AttackPOS | CodeCode Available | 0 |
| Text Detoxification as Style Transfer in English and Hindi | Feb 12, 2024 | Multi-Task LearningSentence | CodeCode Available | 0 |
| Auxiliary Tasks to Boost Biaffine Semantic Dependency Parsing | Feb 12, 2024 | Dependency ParsingSemantic Dependency Parsing | CodeCode Available | 0 |
| A Rational Analysis of the Speech-to-Song Illusion | Feb 10, 2024 | Sentence | —Unverified | 0 |
| Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method | Feb 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| FaBERT: Pre-training BERT on Persian Blogs | Feb 9, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Zero-Shot Chain-of-Thought Reasoning Guided by Evolutionary Algorithms in Large Language Models | Feb 8, 2024 | Evolutionary AlgorithmsSentence | —Unverified | 0 |
| Phonetically rich corpus construction for a low-resourced language | Feb 8, 2024 | Sentence | —Unverified | 0 |
| Detecting Generated Native Ads in Conversational Search | Feb 7, 2024 | Conversational SearchSentence | CodeCode Available | 0 |
| Source Identification in Abstractive Summarization | Feb 7, 2024 | Abstractive Text SummarizationSentence | CodeCode Available | 0 |
| RankSum An unsupervised extractive text summarization based on rank fusion | Feb 7, 2024 | Extractive Text SummarizationSentence | —Unverified | 0 |
| Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification | Feb 6, 2024 | BenchmarkingMultiple-choice | —Unverified | 0 |
| Linguistic features for sentence difficulty prediction in ABSA | Feb 5, 2024 | Aspect-Based Sentiment AnalysisDiversity | —Unverified | 0 |