SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 101150 of 2381 papers

TitleStatusHype
Frequency-driven Imperceptible Adversarial Attack on Semantic SimilarityCode1
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner OraclesCode1
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMsCode1
Distributional Formal SemanticsCode1
Graph-based Semantical Extractive Text AnalysisCode1
Charformer: Fast Character Transformers via Gradient-based Subword TokenizationCode1
Class-relation Knowledge Distillation for Novel Class DiscoveryCode1
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security ResearchCode1
Clustering-Aware Negative Sampling for Unsupervised Sentence RepresentationCode1
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighterCode1
Alleviating Over-smoothing for Unsupervised Sentence RepresentationCode1
Improving Contrastive Learning of Sentence Embeddings from AI FeedbackCode1
Improving word mover's distance by leveraging self-attention matrixCode1
Explicitly Integrating Judgment Prediction with Legal Document Retrieval: A Law-Guided Generative ApproachCode1
Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language ModelsCode1
InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language ModelsCode1
ComStreamClust: a communicative multi-agent approach to text clustering in streaming dataCode1
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-TuningCode1
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation TransferCode1
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense DisambiguationCode1
KGE-CL: Contrastive Learning of Tensor Decomposition Based Knowledge Graph EmbeddingsCode1
AMR-DA: Data Augmentation by Abstract Meaning RepresentationCode1
Label-Guided In-Context Learning for Named Entity RecognitionCode1
Context Compression for Auto-regressive Transformers with Sentinel TokensCode1
Language-agnostic BERT Sentence EmbeddingCode1
ContraCLM: Contrastive Learning For Causal Language ModelCode1
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual GroundingCode1
Do Vision and Language Encoders Represent the World Similarly?Code1
EASE: Entity-Aware Contrastive Learning of Sentence EmbeddingCode1
DiffSim: Taming Diffusion Models for Evaluating Visual SimilarityCode1
Describing Sets of Images with Textual-PCACode1
DIP: Dual Incongruity Perceiving Network for Sarcasm DetectionCode1
Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language UnderstandingCode1
DeepSim: Semantic similarity metrics for learned image registrationCode1
Demystifying and Extracting Fault-indicating Information from Logs for Failure DiagnosisCode1
Deep Metric Learning by Online Soft Mining and Class-Aware AttentionCode1
Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose EstimationCode1
Deep Representational Re-tuning using Contrastive TensionCode1
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation LearningCode1
DialogueCSE: Dialogue-based Contrastive Learning of Sentence EmbeddingsCode1
A Semantic-based Method for Unsupervised Commonsense Question AnsweringCode1
ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive SummarizationCode1
Attributable Visual Similarity LearningCode1
AutoGCL: Automated Graph Contrastive Learning via Learnable View GeneratorsCode1
An Efficient Self-Supervised Cross-View Training For Sentence EmbeddingCode1
A Simple Long-Tailed Recognition Baseline via Vision-Language ModelCode1
AstroCLIP: A Cross-Modal Foundation Model for GalaxiesCode1
A Statistical Framework for Low-bitwidth Training of Deep Neural NetworksCode1
R&R: Metric-guided Adversarial Sentence GenerationCode1
DistilCSE: Effective Knowledge Distillation For Contrastive Sentence EmbeddingsCode1
Show:102550
← PrevPage 3 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified