| Historical German Text Normalization Using Type- and Token-Based Language Modeling | Sep 4, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| CLUE: Concept-Level Uncertainty Estimation for Large Language Models | Sep 4, 2024 | HallucinationSentence | —Unverified | 0 |
| You Only Use Reactive Attention Slice For Long Context Retrieval | Sep 3, 2024 | RAGRetrieval | CodeCode Available | 0 |
| Less is more: concatenating videos for Sign Language Translation from a small set of signs | Sep 3, 2024 | SentenceSign Language Translation | CodeCode Available | 0 |
| Entity-Aware Biaffine Attention Model for Improved Constituent Parsing with Reduced Entity Violations | Sep 1, 2024 | Constituency ParsingSentence | —Unverified | 0 |
| Statistics of punctuation in experimental literature -- the remarkable case of "Finnegans Wake" by James Joyce | Aug 31, 2024 | SentenceSurvival Analysis | —Unverified | 0 |
| ConCSE: Unified Contrastive Learning and Augmentation for Code-Switched Embeddings | Aug 28, 2024 | Contrastive LearningNatural Language Inference | CodeCode Available | 0 |
| Empowering Sign Language Communication: Integrating Sentiment and Semantics for Facial Expression Synthesis | Aug 27, 2024 | Facial expression generationSentence | CodeCode Available | 0 |
| Probing Causality Manipulation of Large Language Models | Aug 26, 2024 | In-Context LearningRAG | —Unverified | 0 |
| FLEURS-ASL: Including American Sign Language in Massively Multilingual Multitask Evaluation | Aug 24, 2024 | Machine TranslationSentence | —Unverified | 0 |
| Towards Estimating Personal Values in Song Lyrics | Aug 22, 2024 | Sentence | —Unverified | 0 |
| High-Quality Data Augmentation for Low-Resource NMT: Combining a Translation Memory, a GAN Generator, and Filtering | Aug 22, 2024 | Data AugmentationGenerative Adversarial Network | —Unverified | 0 |
| MCDubber: Multimodal Context-Aware Expressive Video Dubbing | Aug 21, 2024 | Sentence | CodeCode Available | 0 |
| Towards Inducing Document-Level Abilities in Standard Multilingual Neural Machine Translation Models | Aug 21, 2024 | DecoderMachine Translation | —Unverified | 0 |
| Practical token pruning for foundation models in few-shot conversational virtual assistant systems | Aug 21, 2024 | ClassificationContrastive Learning | —Unverified | 0 |
| SEMDR: A Semantic-Aware Dual Encoder Model for Legal Judgment Prediction with Legal Clue Tracing | Aug 19, 2024 | Representation LearningSentence | —Unverified | 0 |
| GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization | Aug 19, 2024 | Document SummarizationInformativeness | CodeCode Available | 0 |
| Fostering Natural Conversation in Large Language Models with NICO: a Natural Interactive COnversation dataset | Aug 18, 2024 | Sentence | —Unverified | 0 |
| Scaling up Multimodal Pre-training for Sign Language Understanding | Aug 16, 2024 | Gloss-free Sign Language TranslationSentence | —Unverified | 0 |
| Nl2Hltl2Plan: Scaling Up Natural Language Understanding for Multi-Robots Through Hierarchical Temporal Logic Task Representation | Aug 15, 2024 | Natural Language UnderstandingRobot Task Planning | —Unverified | 0 |
| MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU | Aug 15, 2024 | domain classificationIntent Detection | CodeCode Available | 0 |
| Extracting Sentence Embeddings from Pretrained Transformer Models | Aug 15, 2024 | ClusteringRetrieval-augmented Generation | —Unverified | 0 |
| Sign Language Translation with Sentence Embedding Supervision | Aug 14, 2024 | Gloss-free Sign Language TranslationSentence | CodeCode Available | 0 |
| From Brazilian Portuguese to European Portuguese | Aug 14, 2024 | SentenceTranslation | —Unverified | 0 |
| Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated High-Quality Parallel Data Outperforms Traditional Web-Crawled Data | Aug 13, 2024 | Machine TranslationNMT | —Unverified | 0 |