SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 251300 of 2381 papers

TitleStatusHype
Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic SimilarityCode1
Word Rotator's DistanceCode1
CgAT: Center-Guided Adversarial Training for Deep Hashing-Based RetrievalCode1
Attentive Normalization for Conditional Image GenerationCode1
Clustering-Aware Negative Sampling for Unsupervised Sentence RepresentationCode1
AugCSE: Contrastive Sentence Embedding with Diverse AugmentationsCode1
Audio-Visual Class-Incremental LearningCode1
Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-trained Vision TransformersCode1
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity RewardsCode1
CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented GenerationCode1
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMsCode1
Charformer: Fast Character Transformers via Gradient-based Subword TokenizationCode1
Attributable Visual Similarity LearningCode1
Compositional Evaluation on Japanese Textual Entailment and SimilarityCode1
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation TransferCode1
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense DisambiguationCode1
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT ModelsCode1
Towards Better Understanding of User Satisfaction in Open-Domain Conversational SearchCode1
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCOCode1
Cross-lingual Text Classification with Heterogeneous Graph Neural NetworkCode1
CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model BiasCode1
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security ResearchCode1
AutoKG: Efficient Automated Knowledge Graph Generation for Language ModelsCode1
Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based LearningCode1
Deep Representational Re-tuning using Contrastive TensionCode1
DeepSim: Semantic similarity metrics for learned image registrationCode1
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation LearningCode1
Automated radiology report generation using conditioned transformersCode1
DiffSim: Taming Diffusion Models for Evaluating Visual SimilarityCode1
Debiased Contrastive Learning of Unsupervised Sentence RepresentationsCode1
DistilCSE: Effective Knowledge Distillation For Contrastive Sentence EmbeddingsCode1
Discrete Optimization for Unsupervised Sentence Summarization with Word-Level ExtractionCode1
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image SegmentationCode1
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual GroundingCode1
Automatic Generation of Topic LabelsCode1
Distributional Formal SemanticsCode1
Improving Language Understanding by Generative Pre-TrainingCode1
EASE: Entity-Aware Contrastive Learning of Sentence EmbeddingCode1
Efficient Mask Correction for Click-Based Interactive Image SegmentationCode1
ELITE: Embedding-Less retrieval with Iterative Text ExplorationCode1
An Efficient Self-Supervised Cross-View Training For Sentence EmbeddingCode1
Entailment as Few-Shot LearnerCode1
On the Sentence Embeddings from Pre-trained Language ModelsCode1
Attention Discriminant Sampling for Point Clouds0
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora0
Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation0
Attention-aware semantic relevance predicting Chinese sentence reading0
A Multi-level Alignment Training Scheme for Video-and-Language Grounding0
A Deep Decomposable Model for Disentangling Syntax and Semantics in Sentence Representation0
A Thesaurus for Biblical Hebrew0
Show:102550
← PrevPage 6 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified