| Opera Graeca Adnotata: Building a 34M+ Token Multilayer Corpus for Ancient Greek | Mar 31, 2024 | LemmatizationSentence | CodeCode Available | 1 |
| U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation | Mar 29, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages | Mar 27, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning | Mar 26, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 1 |
| Attribute First, then Generate: Locally-attributable Grounded Text Generation | Mar 25, 2024 | AttributeDocument Summarization | CodeCode Available | 1 |
| You Only Read Once: Constituency-Oriented Relational Graph Convolutional Network for Multi-Aspect Multi-Sentiment Classification | Mar 24, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling | Mar 21, 2024 | Grounded language learningLanguage Acquisition | CodeCode Available | 1 |
| Reinforcement Learning with Token-level Feedback for Controllable Text Generation | Mar 18, 2024 | Attributereinforcement-learning | CodeCode Available | 1 |
| Hyper-CL: Conditioning Sentence Representations with Hypernetworks | Mar 14, 2024 | Computational EfficiencyContrastive Learning | CodeCode Available | 1 |
| CODE-ACCORD: A Corpus of Building Regulatory Data for Rule Generation towards Automatic Compliance Checking | Mar 4, 2024 | Relation ExtractionSentence | CodeCode Available | 1 |
| ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context | Mar 4, 2024 | In-Context LearningSentence | CodeCode Available | 1 |
| DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models | Mar 1, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Meta-Task Prompting Elicits Embeddings from Large Language Models | Feb 28, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 1 |
| Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy | Feb 25, 2024 | HallucinationSentence | CodeCode Available | 1 |
| Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided Decoding | Feb 23, 2024 | HallucinationObject | CodeCode Available | 1 |
| TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization | Feb 20, 2024 | HallucinationNews Summarization | CodeCode Available | 1 |
| SiLLM: Large Language Models for Simultaneous Machine Translation | Feb 20, 2024 | Machine TranslationSentence | CodeCode Available | 1 |
| Machine-Generated Text Localization | Feb 19, 2024 | Binary ClassificationMisinformation | CodeCode Available | 1 |
| Contrastive Instruction Tuning | Feb 17, 2024 | Sentence | CodeCode Available | 1 |
| LongHeads: Multi-Head Attention is Secretly a Long Context Processor | Feb 16, 2024 | Sentence | CodeCode Available | 1 |
| Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking | Feb 14, 2024 | BenchmarkingLanguage Modelling | CodeCode Available | 1 |
| Pixel Sentence Representation Learning | Feb 13, 2024 | Natural Language InferenceRepresentation Learning | CodeCode Available | 1 |
| SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages | Feb 13, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector | Feb 7, 2024 | DecoderGrammatical Error Correction | CodeCode Available | 1 |
| TransLLaMa: LLM-based Simultaneous Translation System | Feb 7, 2024 | DecoderMachine Translation | CodeCode Available | 1 |