SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 12011250 of 2381 papers

TitleStatusHype
Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs0
Measuring prominence of scientific work in online news as a proxy for impact0
Big Bird: Transformers for Longer SequencesCode1
Hard negative examples are hard, but usefulCode1
Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic FeaturesCode0
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language TasksCode0
Logic Constrained Pointer Networks for Interpretable Textual SimilarityCode0
CORD19STS: COVID-19 Semantic Textual Similarity Dataset0
Unsupervised Paraphrasing via Deep Reinforcement Learning0
Language-agnostic BERT Sentence EmbeddingCode1
Unsupervised Semantic Hashing with Pairwise ReconstructionCode0
DeSpin: a prototype system for detecting spin in biomedical publications0
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity0
Estimating Mutual Information Between Dense Word Embeddings0
Text Classification with Negative Supervision0
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity DetectionCode1
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity RewardsCode1
Class-Similarity Based Label Smoothing for Confidence Calibration0
Exploiting Non-Taxonomic Relations for Measuring Semantic Similarity and Relatedness in WordNet0
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?Code0
Canonicalizing Open Knowledge Bases with Multi-Layered Meta-Graph Neural Network0
MixMOOD: A systematic approach to class distribution mismatch in semi-supervised learning using deep dataset dissimilarity measuresCode0
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
Approche supervis\'ee de calcul de similarit\'e s\'emantique entre paires de phrases (Supervised approach to compute semantic similarity between sentence pairs)0
Shoestring: Graph-Based Semi-Supervised Classification With Severely Limited Labeled Data0
Multi-Modality Cross Attention Network for Image and Sentence Matching0
Automatic Generation of Topic LabelsCode1
Boosting Few-Shot Learning With Adaptive Margin Loss0
Learning Tversky Similarity0
SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative SamplingCode0
Learning to hash with semantic similarity metrics and empirical KL divergence0
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document SummarizationCode1
Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional SemanticsCode0
Neural CRF Model for Sentence Alignment in Text SimplificationCode1
Semi-supervised lung nodule retrieval0
Discrete Optimization for Unsupervised Sentence Summarization with Word-Level ExtractionCode1
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation EvaluationCode1
Synthesizer: Rethinking Self-Attention in Transformer ModelsCode1
Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation0
MSD-1030: A Well-built Multi-Sense Evaluation Dataset for Sense Representation Models0
SAPPHIRE: Simple Aligner for Phrasal Paraphrase with Hierarchical Representation0
Spatial Multi-Arrangement for Clustering and Multi-way Similarity Dataset Construction0
Towards a Gold Standard for Evaluating Danish Word Embeddings0
Representing Verbs with Visual Argument Vectors0
Word Embedding Evaluation in Downstream Tasks and Semantic Analogies0
A French Corpus for Semantic Similarity0
Urban Dictionary Embeddings for Slang NLP Applications0
Building Semantic Grams of Human Knowledge0
Multilingual Corpus Creation for Multilingual Semantic Similarity Task0
Extrapolating Binder Style Word Embeddings to New Words0
Show:102550
← PrevPage 25 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified