SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 501550 of 2381 papers

TitleStatusHype
Can Translation Memories afford not to use paraphrasing?0
AMRITA\_CEN@SemEval-2015: Paraphrase Detection for Twitter using Unsupervised Feature Learning with Recursive Autoencoders0
Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder0
Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations0
Contrastive Word Embedding Learning for Neural Machine Translation0
Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation0
Canonicalizing Open Knowledge Bases with Multi-Layered Meta-Graph Neural Network0
ConvFiT: Conversational Fine-Tuning of Pretrained Language Models0
ArbEngVec : Arabic-English Cross-Lingual Word Embedding Model0
Convolutional neural networks for structured omics: OmicsCNN and the OmicsConv layer0
Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering0
Can GPT models Follow Human Summarization Guidelines? Evaluating ChatGPT and GPT-4 for Dialogue Summarization0
A Rank-Based Similarity Metric for Word Embeddings0
A Comprehensive Framework for Semantic Similarity Analysis of Human and AI-Generated Text Using Transformer Architectures and Ensemble Techniques0
A Joint Model for Answer Sentence Ranking and Answer Extraction0
A Quantitative Approach to Evaluating Open-Source EHR Systems for Indian Healthcare0
Calculating Semantic Similarity between Academic Articles using Topic Event and Ontology0
DeSpin: a prototype system for detecting spin in biomedical publications0
Detecting Backdoor Attacks via Similarity in Semantic Communication Systems0
Detecting Collocations Similarity via Logical-Linguistic Model0
BUT-TYPED: Using domain knowledge for computing typed similarity0
Bundle Optimization for Multi-aspect Embedding0
A Comparison of Vector-based Representations for Semantic Composition0
Building Static Embeddings from Contextual Ones: Is It Useful for Building Distributional Thesauri?0
Building Specialized Bilingual Lexicons Using Word Sense Disambiguation0
A Preliminary Evaluation of the Impact of Syntactic Structure in Semantic Textual Similarity and Semantic Relatedness Tasks0
3D Compositional Zero-shot Learning with DeCompositional Consensus0
Building Semantic Grams of Human Knowledge0
Building RadiologyNET: Unsupervised annotation of a large-scale multimodal medical database0
A practical method for occupational skills detection in Vietnamese job listings0
Building Lexical Vector Representations from Concept Definitions0
Building Interpretable and Reliable Open Information Retriever for New Domains Overnight0
Approximating Human-Like Few-shot Learning with GPT-based Compression0
AI-KU: Using Co-Occurrence Modeling for Semantic Similarity0
Detecting Language Impairments in Autism: A Computational Analysis of Semi-structured Conversations with Vector Semantics0
Building Concept Graphs from Monolingual Dictionary Entries0
Building a Synthetic Biomedical Research Article Citation Linkage Corpus0
Approche supervis\'ee de calcul de similarit\'e s\'emantique entre paires de phrases (Supervised approach to compute semantic similarity between sentence pairs)0
Building a Semantic Transparency Dataset of Chinese Nominal Compounds: A Practice of Crowdsourcing Methodology0
Building and Evaluating a Distributional Memory for Croatian0
Apport des ontologies pour le calcul de la similarité sémantique au sein d'un système de recommandation0
AI-based Approach for Safety Signals Detection from Social Networks: Application to the Levothyrox Scandal in 2017 on Doctissimo Forum0
Building a Dataset of Multilingual Cognates for the Romanian Lexicon0
BUCC 2017 Shared Task: a First Attempt Toward a Deep Learning Framework for Identifying Parallel Sentences in Comparable Corpora0
Applying Transfer Learning for Improving Domain-Specific Search Experience Using Query to Question Similarity0
BUAP: Lexical and Semantic Similarity for Cross-lingual Textual Entailment0
BUAP: Evaluating Features for Multilingual and Cross-Level Semantic Textual Similarity0
Applying Multi-Sense Embeddings for German Verbs to Determine Semantic Relatedness and to Detect Non-Literal Language0
A Comparison of Smoothing Techniques for Bilingual Lexicon Extraction from Comparable Corpora0
2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings0
Show:102550
← PrevPage 11 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified