SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 551600 of 2381 papers

TitleStatusHype
Comparing scalable strategies for generating numerical perspectives0
Fountain -- an intelligent contextual assistant combining knowledge representation and language models for manufacturing risk identification0
VacancySBERT: the approach for representation of titles and skills for semantic similarity search in the recruitment domainCode0
Scaling Sentence Embeddings with Large Language ModelsCode1
User-Controlled Knowledge Fusion in Large Language Models: Balancing Creativity and Hallucination0
Building RadiologyNET: Unsupervised annotation of a large-scale multimodal medical database0
Transformer-based Joint Source Channel Coding for Textual Semantic Communication0
Improving Semantic Similarity Measure Within a Recommender System Based-on RDF Graphs0
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models0
Identical and Fraternal Twins: Fine-Grained Semantic Contrastive Learning of Sentence Representations0
Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction0
Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray Report GenerationCode1
Improving Text Semantic Similarity Modeling through a 3D Siamese Network0
Class-relation Knowledge Distillation for Novel Class DiscoveryCode1
Regression-free Blind Image Quality Assessment with Content-Distortion Consistency0
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual GroundingCode1
Large-Scale Evaluation of Topic Models and Dimensionality Reduction Methods for 2D Text SpatializationCode0
AspectCSE: Sentence Embeddings for Aspect-based Semantic Textual Similarity Using Contrastive Learning and Structured Knowledge0
Improving Trace Link Recommendation by Using Non-Isotropic Distances and Combinations0
Composition-contrastive Learning for Sentence EmbeddingsCode0
Histopathology Whole Slide Image Analysis with Heterogeneous Graph Representation LearningCode1
Novel Categories Discovery Via Constraints on Empirical Prediction StatisticsCode0
Efficient Domain Adaptation of Sentence Embeddings Using Adapters0
MWPRanker: An Expression Similarity Based Math Word Problem Retriever0
Automatic Design of Semantic Similarity Ensembles Using Grammatical EvolutionCode0
Transfer learning for semantic similarity measures based on symbolic regressionCode0
A Massive Scale Semantic Similarity Dataset of Historical English0
DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations0
Large Language Models as Annotators: Enhancing Generalization of NLP Models at Minimal Cost0
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner OraclesCode1
SeFNet: Bridging Tabular Datasets with Semantic Feature NetsCode0
A Relaxed Optimization Approach for Adversarial Attacks against Neural Machine Translation Models0
Unbalanced Optimal Transport for Unbalanced Word AlignmentCode1
Supervised Knowledge May Hurt Novel Class Discovery PerformanceCode0
Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental HealthCode0
LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricSCode0
Vocabulary-free Image ClassificationCode1
Estimating Semantic Similarity between In-Domain and Out-of-Domain SamplesCode0
Boosting the Performance of Transformer Architectures for Semantic Textual Similarity0
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence SimilarityCode0
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment0
Real-World Image Variation by Aligning Diffusion Inversion ChainCode1
Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approachesCode0
Whitening-based Contrastive Learning of Sentence EmbeddingsCode1
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual InformationCode0
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-TranslationCode0
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation0
RankCSE: Unsupervised Sentence Representations Learning via Learning to RankCode1
Show:102550
← PrevPage 12 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified