SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 601650 of 2381 papers

TitleStatusHype
SAMScore: A Content Structural Similarity Metric for Image Translation EvaluationCode1
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional OperationsCode0
C-STS: Conditional Semantic Textual SimilarityCode1
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual ModelsCode1
Modeling Empathic Similarity in Personal Narratives0
Sentence Representations via Gaussian EmbeddingCode0
SneakyPrompt: Jailbreaking Text-to-image Generative ModelsCode1
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change AnalysisCode0
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMsCode1
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings0
Balancing Lexical and Semantic Quality in Abstractive SummarizationCode1
Semantic Similarity Measure of Natural Language Text through Machine Learning and a Keyword-Aware Cross-Encoder-Ranking Summarizer -- A Case Study Using UCGIS GIS&T Body of Knowledge0
Clustering-Aware Negative Sampling for Unsupervised Sentence RepresentationCode1
Adapting Sentence Transformers for the Aviation Domain0
Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation TasksCode1
PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity0
Instance Smoothed Contrastive Learning for Unsupervised Sentence EmbeddingCode0
SMATCH++: Standardized and Extended Evaluation of Semantic GraphsCode1
Benchmarking large language models for biomedical natural language processing applications and recommendationsCode1
Alleviating Over-smoothing for Unsupervised Sentence RepresentationCode1
REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search ModelsCode0
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense DisambiguationCode1
Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation0
Improving Contrastive Learning of Sentence Embeddings from AI FeedbackCode1
Neural Keyphrase Generation: Analysis and Evaluation0
Deep Lifelong Cross-modal Hashing0
Low-resource Bilingual Dialect Lexicon Induction with Large Language ModelsCode0
Bridging Natural Language Processing and Psycholinguistics: computationally grounded semantic similarity datasets for Basque and Spanish0
D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence Embeddings0
Learning Geometry-aware Representations by Sketching0
A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery0
PCPNet: An Efficient and Semantic-Enhanced Transformer Network for Point Cloud PredictionCode1
Semantic Feature Verification in FLAN-T50
Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language UnderstandingCode1
Static Fuzzy Bag-of-Words: a lightweight sentence embedding algorithm0
Efficient Audio Captioning Transformer with Patchout and Text Guidance0
SuperDisco: Super-Class Discovery Improves Visual Recognition for the Long-Tail0
Using Semantic Similarity and Text Embedding to Measure the Social Media Echo of Strategic Communications0
LEURN: Learning Explainable Univariate Rules with Neural Networks0
A Novel Patent Similarity Measurement Methodology: Semantic Distance and Technological DistanceCode0
Micro-video Tagging via Jointly Modeling Social Influence and Tag RelationCode0
ESCL: Equivariant Self-Contrastive Learning for Sentence Representations0
INO at Factify 2: Structure Coherence based Multi-Modal Fact VerificationCode0
Weighted Sampling for Masked Language Modeling0
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching SummarizationCode0
A Parametric Similarity Method: Comparative Experiments based on Semantically Annotated Large Datasets0
Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach0
How to choose "Good" Samples for Text Data Augmentation0
TransFool: An Adversarial Attack against Neural Machine Translation ModelsCode0
Show:102550
← PrevPage 13 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified