| Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency | Feb 23, 2024 | AttributeSentence | —Unverified | 0 |
| GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection? | Feb 23, 2024 | DiagnosticHate Speech Detection | CodeCode Available | 0 |
| Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot Examples | Feb 23, 2024 | Dataset GenerationDecoder | CodeCode Available | 0 |
| Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided Decoding | Feb 23, 2024 | HallucinationObject | CodeCode Available | 1 |
| Self-Adaptive Reconstruction with Contrastive Learning for Unsupervised Sentence Embeddings | Feb 23, 2024 | Contrastive LearningSentence | —Unverified | 0 |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Feb 23, 2024 | DecoderMachine Translation | CodeCode Available | 0 |
| 2D Matryoshka Sentence Embeddings | Feb 22, 2024 | RAGRepresentation Learning | CodeCode Available | 4 |
| GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages | Feb 22, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task | Feb 22, 2024 | Adversarial AttackContrastive Learning | —Unverified | 0 |
| Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective | Feb 22, 2024 | HallucinationSentence | CodeCode Available | 2 |
| Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews | Feb 21, 2024 | Opinion MiningSentence | —Unverified | 0 |
| Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment | Feb 21, 2024 | Sentence | CodeCode Available | 0 |
| Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions | Feb 21, 2024 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| ChatEL: Entity Linking with Chatbots | Feb 20, 2024 | Entity LinkingSentence | CodeCode Available | 0 |
| Explaining Relationships Among Research Papers | Feb 20, 2024 | Sentence | —Unverified | 0 |
| UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation | Feb 20, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| SiLLM: Large Language Models for Simultaneous Machine Translation | Feb 20, 2024 | Machine TranslationSentence | CodeCode Available | 1 |
| Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity | Feb 20, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 0 |
| Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation | Feb 20, 2024 | SentenceTranslation | —Unverified | 0 |
| Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations | Feb 20, 2024 | Sentence | CodeCode Available | 2 |
| More Discriminative Sentence Embeddings via Semantic Graph Smoothing | Feb 20, 2024 | ClusteringSentence | CodeCode Available | 0 |
| TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization | Feb 20, 2024 | HallucinationNews Summarization | CodeCode Available | 1 |
| Asynchronous and Segmented Bidirectional Encoding for NMT | Feb 19, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST! | Feb 19, 2024 | Sentence | CodeCode Available | 0 |
| Machine-Generated Text Localization | Feb 19, 2024 | Binary ClassificationMisinformation | CodeCode Available | 1 |
| NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Feb 19, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| Ontology Enhanced Claim Detection | Feb 19, 2024 | SentenceSentence Embeddings | —Unverified | 0 |
| Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning | Feb 18, 2024 | Deception DetectionSentence | —Unverified | 0 |
| Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models | Feb 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Syntactic Language Change in English and German: Metrics, Parsers, and Convergences | Feb 18, 2024 | Optical Character Recognition (OCR)Sentence | CodeCode Available | 0 |
| Contrastive Instruction Tuning | Feb 17, 2024 | Sentence | CodeCode Available | 1 |
| Word Embeddings Revisited: Do LLMs Offer Something New? | Feb 16, 2024 | Document EmbeddingLanguage Modeling | —Unverified | 0 |
| Where is the answer? Investigating Positional Bias in Language Model Knowledge Extraction | Feb 16, 2024 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| UMAIR-FPS: User-aware Multi-modal Animation Illustration Recommendation Fusion with Painting Style | Feb 16, 2024 | Image GenerationMulti-modal Recommendation | CodeCode Available | 0 |
| LongHeads: Multi-Head Attention is Secretly a Long Context Processor | Feb 16, 2024 | Sentence | CodeCode Available | 1 |
| Comparing Hallucination Detection Metrics for Multilingual Generation | Feb 16, 2024 | HallucinationNatural Language Inference | —Unverified | 0 |
| `Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory | Feb 16, 2024 | InformativenessSentence | —Unverified | 0 |
| Neural paraphrasing by automatically crawled and aligned sentence pairs | Feb 16, 2024 | SentenceText Generation | —Unverified | 0 |
| GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models | Feb 16, 2024 | RelationRelation Extraction | CodeCode Available | 0 |
| The optimal placement of the head in the noun phrase. The case of demonstrative, numeral, adjective and noun | Feb 15, 2024 | Sentence | —Unverified | 0 |
| Grounding Language Model with Chunking-Free In-Context Retrieval | Feb 15, 2024 | ChunkingLanguage Modeling | —Unverified | 0 |
| Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking | Feb 14, 2024 | BenchmarkingLanguage Modelling | CodeCode Available | 1 |
| Integrating ChatGPT into Secure Hospital Networks: A Case Study on Improving Radiology Report Analysis | Feb 14, 2024 | Contrastive LearningKnowledge Distillation | —Unverified | 0 |
| SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages | Feb 13, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| Pixel Sentence Representation Learning | Feb 13, 2024 | Natural Language InferenceRepresentation Learning | CodeCode Available | 1 |
| Auxiliary Tasks to Boost Biaffine Semantic Dependency Parsing | Feb 12, 2024 | Dependency ParsingSemantic Dependency Parsing | CodeCode Available | 0 |
| Text Detoxification as Style Transfer in English and Hindi | Feb 12, 2024 | Multi-Task LearningSentence | CodeCode Available | 0 |
| OrderBkd: Textual backdoor attack through repositioning | Feb 12, 2024 | Backdoor AttackPOS | CodeCode Available | 0 |
| A Rational Analysis of the Speech-to-Song Illusion | Feb 10, 2024 | Sentence | —Unverified | 0 |
| Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method | Feb 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |