SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 251300 of 2381 papers

TitleStatusHype
Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic SimilarityCode1
Linked Credibility Reviews for Explainable Misinformation DetectionCode1
Paraphrase Generation as Zero-Shot Multilingual Translation: Disentangling Semantic Similarity from Lexical and Syntactic DiversityCode1
Big Bird: Transformers for Longer SequencesCode1
Hard negative examples are hard, but usefulCode1
Language-agnostic BERT Sentence EmbeddingCode1
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity DetectionCode1
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity RewardsCode1
Automatic Generation of Topic LabelsCode1
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document SummarizationCode1
Neural CRF Model for Sentence Alignment in Text SimplificationCode1
Discrete Optimization for Unsupervised Sentence Summarization with Word-Level ExtractionCode1
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation EvaluationCode1
Synthesizer: Rethinking Self-Attention in Transformer ModelsCode1
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCOCode1
Word Rotator's DistanceCode1
Fast and Accurate Deep Bidirectional Language Representations for Unsupervised LearningCode1
Attentive Normalization for Conditional Image GenerationCode1
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language UnderstandingCode1
Text-Guided Neural Image InpaintingCode1
Evaluating Multimodal Representations on Visual Semantic Textual SimilarityCode1
Learning to Encode Position for Transformer with Continuous Dynamical ModelCode1
Semantic Pyramid for Image GenerationCode1
Generalized Product Quantization Network for Semi-supervised Image RetrievalCode1
Learning by Semantic Similarity Makes Abstractive Summarization BetterCode1
SBERT-WK: A Sentence Embedding Method by Dissecting BERT-based Word ModelsCode1
Symmetrical Synthesis for Deep Metric LearningCode1
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized OptimizationCode1
Q8BERT: Quantized 8Bit BERTCode1
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighterCode1
Sentence-BERT: Sentence Embeddings using Siamese BERT-NetworksCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
XLNet: Generalized Autoregressive Pretraining for Language UnderstandingCode1
Deep Metric Learning by Online Soft Mining and Class-Aware AttentionCode1
MedSTS: A Resource for Clinical Semantic Textual SimilarityCode1
Improving Language Understanding by Generative Pre-TrainingCode1
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question AnsweringCode1
Universal Sentence EncoderCode1
SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused EvaluationCode1
Supervised Learning of Universal Sentence Representations from Natural Language Inference DataCode1
No Fuss Distance Metric Learning using ProxiesCode1
Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label EmbeddingCode1
Semantic Similarity Based on Corpus Statistics and Lexical TaxonomyCode1
SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts0
SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression0
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution DetectionCode0
LineRetriever: Planning-Aware Observation Reduction for Web Agents0
DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning0
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval0
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation0
Show:102550
← PrevPage 6 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified