| Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark | May 24, 2023 | Discourse ParsingInformation Retrieval | CodeCode Available | 0 |
| Self-Evolution Learning for Discriminative Language Model Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Structural Ambiguity and its Disambiguation in Language Model Based Parsers: the Case of Dutch Clause Relativization | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| C-STS: Conditional Semantic Textual Similarity | May 24, 2023 | Information RetrievalLanguage Model Evaluation | CodeCode Available | 1 |
| Estimating class separability of text embeddings with persistent homology | May 24, 2023 | Language ModellingMulti Class Text Classification | —Unverified | 0 |
| Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples | May 24, 2023 | DiagnosticReferring Expression | CodeCode Available | 0 |
| COMET-M: Reasoning about Multiple Events in Complex Sentences | May 24, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation | May 23, 2023 | Contrastive LearningMachine Translation | CodeCode Available | 1 |
| Are Large Language Models Robust Coreference Resolvers? | May 23, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Cascaded Beam Search: Plug-and-Play Terminology-Forcing For Neural Machine Translation | May 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA | May 23, 2023 | SentenceText Simplification | —Unverified | 0 |
| Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries | May 23, 2023 | SentenceSentence Embeddings | —Unverified | 0 |
| Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control | May 23, 2023 | SentenceText Generation | —Unverified | 0 |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | May 23, 2023 | BenchmarkingCross-Lingual Transfer | CodeCode Available | 1 |
| USB: A Unified Summarization Benchmark Across Tasks and Domains | May 23, 2023 | Abstractive Text SummarizationArticles | CodeCode Available | 0 |
| Challenges in Context-Aware Neural Machine Translation | May 23, 2023 | Machine TranslationSentence | CodeCode Available | 0 |
| Causal Intervention for Abstractive Related Work Generation | May 23, 2023 | Sentence | —Unverified | 0 |
| mPMR: A Multilingual Pre-trained Machine Reader at Scale | May 23, 2023 | ClassificationMachine Reading Comprehension | CodeCode Available | 0 |
| Validating Multimedia Content Moderation Software via Semantic Fusion | May 23, 2023 | Sentencesoftware testing | —Unverified | 0 |
| AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Factual Consistency of Summaries with Large Language Models | May 23, 2023 | Binary ClassificationSentence | CodeCode Available | 0 |
| IdEALS: Idiomatic Expressions for Advancement of Language Skills | May 23, 2023 | Grammatical Error CorrectionSentence | —Unverified | 0 |
| Text Is All You Need: Learning Language Representations for Sequential Recommendation | May 23, 2023 | AllRepresentation Learning | CodeCode Available | 1 |
| Linear Cross-Lingual Mapping of Sentence Embeddings | May 23, 2023 | SentenceSentence Embeddings | —Unverified | 0 |
| TaDSE: Template-aware Dialogue Sentence Embeddings | May 23, 2023 | Contrastive Learningintent-classification | —Unverified | 0 |