| Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews | May 14, 2023 | Sentence | —Unverified | 0 |
| ParaLS: Lexical Substitution via Pretrained Paraphraser | May 14, 2023 | Sentence | CodeCode Available | 0 |
| CroSentiNews 2.0: A Sentence-Level News Sentiment Corpus | May 14, 2023 | Sentence | —Unverified | 0 |
| PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity | May 13, 2023 | Machine TranslationSemantic Similarity | —Unverified | 0 |
| A Simple and Plug-and-play Method for Unsupervised Sentence Representation Enhancement | May 13, 2023 | RetrievalSentence | —Unverified | 0 |
| Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization | May 12, 2023 | Machine TranslationNMT | CodeCode Available | 1 |
| Instance Smoothed Contrastive Learning for Unsupervised Sentence Embedding | May 12, 2023 | Contrastive LearningSemantic Similarity | CodeCode Available | 0 |
| Active Retrieval Augmented Generation | May 11, 2023 | RetrievalRetrieval-augmented Generation | CodeCode Available | 2 |
| A General-Purpose Multilingual Document Encoder | May 11, 2023 | Cross-Lingual TransferDocument Classification | CodeCode Available | 0 |
| Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation | May 11, 2023 | Machine TranslationSentence | CodeCode Available | 0 |
| LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM | May 10, 2023 | GPULanguage Modeling | —Unverified | 0 |
| WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia | May 10, 2023 | Automated Essay ScoringSentence | CodeCode Available | 0 |
| Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge | May 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer | May 10, 2023 | AttributeLanguage Modeling | —Unverified | 0 |
| Context-Aware Document Simplification | May 10, 2023 | SentenceText Simplification | CodeCode Available | 0 |
| PAI at SemEval-2023 Task 2: A Universal System for Named Entity Recognition with External Entity Information | May 10, 2023 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| DeepTextMark: A Deep Learning-Driven Text Watermarking Approach for Identifying Large Language Model Generated Text | May 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Structured Sentiment Analysis as Transition-based Dependency Parsing | May 9, 2023 | Dependency ParsingSentence | —Unverified | 0 |
| Estimating related words computationally using language model from the Mahabharata - an Indian epic | May 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multilevel Sentence Embeddings for Personality Prediction | May 9, 2023 | PredictionSentence | CodeCode Available | 0 |
| Attack Named Entity Recognition by Entity Boundary Interference | May 9, 2023 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Multi-Teacher Knowledge Distillation For Text Image Machine Translation | May 9, 2023 | DecoderKnowledge Distillation | CodeCode Available | 0 |
| Alleviating Over-smoothing for Unsupervised Sentence Representation | May 9, 2023 | Contrastive LearningSemantic Textual Similarity | CodeCode Available | 1 |
| ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models | May 8, 2023 | NegationSentence | —Unverified | 0 |
| Multi-Temporal Lip-Audio Memory for Visual Speech Recognition | May 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |