| Preference Tuning For Toxicity Mitigation Generalizes Across Languages | Jun 23, 2024 | RetrievalSentence | CodeCode Available | 1 |
| How to Compute the Probability of a Word | Jun 20, 2024 | Sentence | CodeCode Available | 1 |
| Voice Disorder Analysis: a Transformer-based Approach | Jun 20, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| SuperGLEBer: German Language Understanding Evaluation Benchmark | Jun 20, 2024 | Document ClassificationNatural Language Understanding | CodeCode Available | 1 |
| Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models | Jun 11, 2024 | Machine TranslationSentence | CodeCode Available | 1 |
| MaskLID: Code-Switching Language Identification through Iterative Masking | Jun 10, 2024 | Language IdentificationSentence | CodeCode Available | 1 |
| MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation | Jun 9, 2024 | DiversitySentence | CodeCode Available | 1 |
| Document-level Claim Extraction and Decontextualisation for Fact-Checking | Jun 5, 2024 | Extractive SummarizationFact Checking | CodeCode Available | 1 |
| CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks | Jun 4, 2024 | Document SummarizationSentence | CodeCode Available | 1 |
| Reward-based Input Construction for Cross-document Relation Extraction | May 31, 2024 | RelationRelation Extraction | CodeCode Available | 1 |
| Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization | May 31, 2024 | SentenceVideo Captioning | CodeCode Available | 1 |
| Unsupervised Extractive Dialogue Summarization in Hyperdimensional Space | May 16, 2024 | ClusteringExtractive Summarization | CodeCode Available | 1 |
| Revisiting Character-level Adversarial Attacks for Language Models | May 7, 2024 | Adversarial AttackSentence | CodeCode Available | 1 |
| Characterising the Creative Process in Humans and Large Language Models | May 1, 2024 | SentenceSentence Embeddings | CodeCode Available | 1 |
| 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset | Apr 29, 2024 | Machine TranslationMultimodal Machine Translation | CodeCode Available | 1 |
| DPO Meets PPO: Reinforced Token Optimization for RLHF | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition | Apr 27, 2024 | AttributePedestrian Attribute Recognition | CodeCode Available | 1 |
| Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback | Apr 22, 2024 | AttributeHallucination | CodeCode Available | 1 |
| LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation | Apr 22, 2024 | HallucinationRAG | CodeCode Available | 1 |
| MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions | Apr 21, 2024 | Moment RetrievalSentence | CodeCode Available | 1 |
| REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking | Apr 19, 2024 | Benchmarkingcoreference-resolution | CodeCode Available | 1 |
| Simple Techniques for Enhancing Sentence Embeddings in Generative Language Models | Apr 5, 2024 | Prompt EngineeringSentence | CodeCode Available | 1 |
| From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization | Apr 4, 2024 | Abstractive Text SummarizationExtractive Summarization | CodeCode Available | 1 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity | Apr 1, 2024 | SentenceSentence Embedding | CodeCode Available | 1 |
| Opera Graeca Adnotata: Building a 34M+ Token Multilayer Corpus for Ancient Greek | Mar 31, 2024 | LemmatizationSentence | CodeCode Available | 1 |
| U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation | Mar 29, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages | Mar 27, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning | Mar 26, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 1 |
| Attribute First, then Generate: Locally-attributable Grounded Text Generation | Mar 25, 2024 | AttributeDocument Summarization | CodeCode Available | 1 |
| You Only Read Once: Constituency-Oriented Relational Graph Convolutional Network for Multi-Aspect Multi-Sentiment Classification | Mar 24, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling | Mar 21, 2024 | Grounded language learningLanguage Acquisition | CodeCode Available | 1 |
| Reinforcement Learning with Token-level Feedback for Controllable Text Generation | Mar 18, 2024 | Attributereinforcement-learning | CodeCode Available | 1 |
| Hyper-CL: Conditioning Sentence Representations with Hypernetworks | Mar 14, 2024 | Computational EfficiencyContrastive Learning | CodeCode Available | 1 |
| CODE-ACCORD: A Corpus of Building Regulatory Data for Rule Generation towards Automatic Compliance Checking | Mar 4, 2024 | Relation ExtractionSentence | CodeCode Available | 1 |
| ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context | Mar 4, 2024 | In-Context LearningSentence | CodeCode Available | 1 |
| DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models | Mar 1, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Meta-Task Prompting Elicits Embeddings from Large Language Models | Feb 28, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 1 |
| Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy | Feb 25, 2024 | HallucinationSentence | CodeCode Available | 1 |
| Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided Decoding | Feb 23, 2024 | HallucinationObject | CodeCode Available | 1 |
| TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization | Feb 20, 2024 | HallucinationNews Summarization | CodeCode Available | 1 |
| SiLLM: Large Language Models for Simultaneous Machine Translation | Feb 20, 2024 | Machine TranslationSentence | CodeCode Available | 1 |
| Machine-Generated Text Localization | Feb 19, 2024 | Binary ClassificationMisinformation | CodeCode Available | 1 |
| Contrastive Instruction Tuning | Feb 17, 2024 | Sentence | CodeCode Available | 1 |
| LongHeads: Multi-Head Attention is Secretly a Long Context Processor | Feb 16, 2024 | Sentence | CodeCode Available | 1 |
| Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking | Feb 14, 2024 | BenchmarkingLanguage Modelling | CodeCode Available | 1 |
| Pixel Sentence Representation Learning | Feb 13, 2024 | Natural Language InferenceRepresentation Learning | CodeCode Available | 1 |
| SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages | Feb 13, 2024 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector | Feb 7, 2024 | DecoderGrammatical Error Correction | CodeCode Available | 1 |
| TransLLaMa: LLM-based Simultaneous Translation System | Feb 7, 2024 | DecoderMachine Translation | CodeCode Available | 1 |