SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 15511600 of 2381 papers

TitleStatusHype
A Rank-Based Similarity Metric for Word Embeddings0
Semantic Structure-based Unsupervised Deep HashingCode0
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus0
Analyzing Citation-Distance Networks for Evaluating Publication Impact0
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora0
Creating Large-Scale Multilingual Cognate TablesCode0
Metaphor Suggestions based on a Semantic Metaphor Repository0
SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages0
Indra: A Word Embedding and Semantic Relatedness ServerCode0
Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities0
A Survey on Automatically-Constructed WordNets and their Evaluation: Lexical and Word Embedding-based Approaches0
A Large Resource of Patterns for Verbal Paraphrases0
Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering0
Knowing the Author by the Company His Words Keep0
Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases0
Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications0
ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and NegationCode0
Fine-grained Semantic Textual Similarity for Serbian0
Urdu Word EmbeddingsCode0
Contextualized Usage-Based Material Selection0
FrNewsLink : a corpus linking TV Broadcast News Segments and Press Articles0
Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation0
A Multilingual Wikified Data Set of Educational Material0
Automatic Thesaurus Construction for Modern Hebrew0
OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based predictionCode0
An Unsupervised Word Sense Disambiguation System for Under-Resourced LanguagesCode0
Learning Semantic Textual Similarity from ConversationsCode0
Direct Network Transfer: Transfer Learning of Sentence Embeddings for Semantic Similarity0
Similarity between Learning Outcomes from Course Objectives using Semantic Analysis, Blooms taxonomy and Corpus statistics0
Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness0
Training a Ranking Function for Open-Domain Question Answering0
Viewpoint-aware Video Summarization0
Incorporating Word Embeddings into Open Directory Project based Large-scale Classification0
DOCK: Detecting Objects by transferring Common-sense Knowledge0
Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task LearningCode0
Identifying Semantic Divergences in Parallel Text without AnnotationsCode0
Universal Sentence EncoderCode1
Neural Network Architecture for Credibility Assessment of Textual Claims0
Equation Embeddings0
Near-lossless Binarization of Word EmbeddingsCode0
RUSSE: The First Workshop on Russian Semantic Similarity0
Enhanced Word Representations for Bridging Anaphora Resolution0
Beyond Context: Exploring Semantic Similarity for Tiny Face Detection0
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial NetworkCode0
Ad Hoc Table Retrieval using Semantic SimilarityCode0
Calculating the similarity between words and sentences using a lexical database and corpus statisticsCode0
An Attention-Based Word-Level Interaction Model: Relation Detection for Knowledge Base Question Answering0
Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National CorpusCode0
A Resource-Light Method for Cross-Lingual Semantic Textual SimilarityCode0
Comparison of Paragram and GloVe Results for Similarity Benchmarks0
Show:102550
← PrevPage 32 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified