| LinkBERT: Pretraining Language Models with Document Links | Mar 29, 2022 | Document ClassificationLanguage Modeling | CodeCode Available | 2 |
| SuperGLEBer: German Language Understanding Evaluation Benchmark | Jun 20, 2024 | Document ClassificationNatural Language Understanding | CodeCode Available | 1 |
| From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization | Apr 4, 2024 | Abstractive Text SummarizationExtractive Summarization | CodeCode Available | 1 |
| Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings | Jan 28, 2024 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| Gloss Attention for Gloss-free Sign Language Translation | Jul 14, 2023 | Gloss-free Sign Language TranslationLanguage Modeling | CodeCode Available | 1 |
| What Do Self-Supervised Speech Models Know About Words? | Jun 30, 2023 | SentenceSentence Similarity | CodeCode Available | 1 |
| Contrastive Learning of Sentence Embeddings from Scratch | May 24, 2023 | Contrastive LearningNatural Language Inference | CodeCode Available | 1 |
| C-STS: Conditional Semantic Textual Similarity | May 24, 2023 | Information RetrievalLanguage Model Evaluation | CodeCode Available | 1 |
| On Isotropy, Contextualization and Learning Dynamics of Contrastive-based Sentence Representation Learning | Dec 18, 2022 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Subspace Representations for Soft Set Operations and Sentence Similarities | Oct 24, 2022 | RetrievalSemantic Textual Similarity | CodeCode Available | 1 |
| Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity | Aug 29, 2022 | SentenceSentence Embedding | CodeCode Available | 1 |
| SynWMD: Syntax-aware Word Mover's Distance for Sentence Similarity Evaluation | Jun 20, 2022 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| SBERT studies Meaning Representations: Decomposing Sentence Embeddings into Explainable Semantic Features | Jun 14, 2022 | Abstract Meaning RepresentationNegation | CodeCode Available | 1 |
| Extracting Latent Steering Vectors from Pretrained Language Models | May 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Event Representation via Simultaneous Weakly Supervised Contrastive Learning and Clustering | Mar 15, 2022 | ClusteringContrastive Learning | CodeCode Available | 1 |
| Just Rank: Rethinking Evaluation with Word and Sentence Similarities | Mar 5, 2022 | BenchmarkingSemantic Similarity | CodeCode Available | 1 |
| Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning | Feb 26, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations | Sep 27, 2021 | Contrastive LearningLanguage Modelling | CodeCode Available | 1 |
| LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution | Jul 11, 2021 | SentenceSentence Similarity | CodeCode Available | 1 |
| Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT | Jul 9, 2021 | BenchmarkingDocument Classification | CodeCode Available | 1 |
| BioELECTRA:Pretrained Biomedical text Encoder using Discriminators | Jun 11, 2021 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders | Apr 16, 2021 | Contrastive LearningCross-Lingual Semantic Textual Similarity | CodeCode Available | 1 |
| Match-Ignition: Plugging PageRank into Transformer for Long-form Text Matching | Jan 16, 2021 | Community Question AnsweringForm | CodeCode Available | 1 |
| Example-Driven Intent Prediction with Observers | Oct 17, 2020 | Attributeintent-classification | CodeCode Available | 1 |
| Parallel Sentence Mining by Constrained Decoding | Jul 1, 2020 | Cross-Lingual Bitext MiningMachine Translation | CodeCode Available | 1 |
| Deep Learning Enabled Semantic Communication Systems | Jun 18, 2020 | Deep LearningSemantic Communication | CodeCode Available | 1 |
| Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task | May 1, 2020 | Answer SelectionSentence | CodeCode Available | 1 |
| A Divide-and-Conquer Approach to the Summarization of Long Documents | Apr 13, 2020 | ArticlesDocument Summarization | CodeCode Available | 1 |
| On the Effect of Dropping Layers of Pre-trained Transformer Models | Apr 8, 2020 | Knowledge DistillationSentence | CodeCode Available | 1 |
| SentEval: An Evaluation Toolkit for Universal Sentence Representations | Mar 14, 2018 | General ClassificationMulti-class Classification | CodeCode Available | 1 |
| Sentence Ordering and Coherence Modeling using Recurrent Neural Networks | Nov 8, 2016 | ArticlesSentence | CodeCode Available | 1 |
| Neural Paraphrase Generation with Stacked Residual LSTM Networks | Oct 10, 2016 | Deep LearningParaphrase Generation | CodeCode Available | 1 |
| EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models | May 29, 2025 | Ensemble LearningIn-Context Learning | —Unverified | 0 |
| Can Vision-Language Models Understand and Interpret Dynamic Gestures from Pedestrians? Pilot Datasets and Exploration Towards Instructive Nonverbal Commands for Cooperative Autonomous Vehicles | Apr 15, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Coarse-to-Fine Semantic Communication Systems for Text Transmission | Apr 2, 2025 | Semantic CommunicationSentence | —Unverified | 0 |
| How does a Multilingual LM Handle Multiple Languages? | Feb 6, 2025 | Multilingual NLPMultilingual Word Embeddings | —Unverified | 0 |
| Can linguists better understand DNA? | Dec 10, 2024 | ClassificationSentence | CodeCode Available | 0 |
| 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation | Dec 9, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| A Novel Word Pair-based Gaussian Sentence Similarity Algorithm For Bengali Extractive Text Summarization | Nov 26, 2024 | ArticlesExtractive Summarization | CodeCode Available | 0 |
| Toeing the Party Line: Election Manifestos as a Key to Understand Political Discourse on Twitter | Oct 21, 2024 | Political evalutationSemantic Textual Similarity | CodeCode Available | 0 |
| Towards Quantifying The Privacy Of Redacted Text | Oct 10, 2024 | DiversitySentence | —Unverified | 0 |
| No Dataset Needed for Downstream Knowledge Benchmarking: Response Dispersion Inversely Correlates with Accuracy on Domain-specific QA | Aug 24, 2024 | BenchmarkingChatbot | —Unverified | 0 |
| Enhancing Semantic Similarity Understanding in Arabic NLP with Nested Embedding Learning | Jul 30, 2024 | Natural Language InferenceSemantic Similarity | —Unverified | 0 |
| Word Embedding Dimension Reduction via Weakly-Supervised Feature Selection | Jul 17, 2024 | Dimensionality Reductionfeature selection | CodeCode Available | 0 |
| OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection | Jun 4, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| MTEB-French: Resources for French Sentence Embedding Evaluation and Analysis | May 30, 2024 | SentenceSentence Embedding | —Unverified | 0 |
| Data Augmentation Techniques for Process Extraction from Scientific Publications | May 23, 2024 | Data AugmentationSentence | —Unverified | 0 |
| Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark | May 17, 2024 | Document ClassificationLanguage Modeling | —Unverified | 0 |
| Span-Aggregatable, Contextualized Word Embeddings for Effective Phrase Mining | May 12, 2024 | RetrievalSentence | —Unverified | 0 |
| Self-Critical Alternate Learning based Semantic Broadcast Communication | Dec 3, 2023 | Reinforcement Learning (RL)Semantic Communication | —Unverified | 0 |