SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 10011050 of 2381 papers

TitleStatusHype
Homa at SemEval-2025 Task 5: Aligning Librarian Records with OntoAligner for Subject Tagging0
Homograph Disambiguation Through Selective Diacritic Restoration0
A web-based tool to Analyze Semantic Similarity Networks0
Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools0
How does a Multilingual LM Handle Multiple Languages?0
How do Humans and Language Models Reason About Creativity? A Comparative Analysis0
How to choose "Good" Samples for Text Data Augmentation0
How to Evaluate Semantic Communications for Images with ViTScore Metric?0
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise on Machine Translation0
A Neurosymbolic Framework for Bias Correction in Convolutional Neural Networks0
How Vital is the Jurisprudential Relevance: Law Article Intervened Legal Case Retrieval and Matching0
Contrastive Word Embedding Learning for Neural Machine Translation0
HsH: Estimating Semantic Similarity of Words and Short Phrases with Frequency Normalized Distance Measures0
HSI: A Holistic Style Injector for Arbitrary Style Transfer0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating0
HulTech: A General Purpose System for Cross-Level Semantic Similarity based on Anchor Web Counts0
Human Variability vs. Machine Consistency: A Linguistic Analysis of Texts Generated by Humans and Large Language Models0
ConvFiT: Conversational Fine-Tuning of Pretrained Language Models0
DEMO: A Statistical Perspective for Efficient Image-Text Matching0
A weakly supervised adaptive triplet loss for deep metric learning0
Convolutional neural networks for structured omics: OmicsCNN and the OmicsConv layer0
HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels0
DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment0
A Walk-Based Semantically Enriched Tree Kernel Over Distributed Word Representations0
A Neural Network Approach to Selectional Preference Acquisition0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating0
DeepTrax: Embedding Graphs of Financial Transactions0
A Vector Space for Distributional Semantics for Entailment0
Deep Semantic Ranking Based Hashing for Multi-Label Image Retrieval0
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model0
Avaliando a similaridade sem\^antica entre frases curtas atrav\'es de uma abordagem h\' (A hybrid approach to measure Semantic Textual Similarity between short sentences in Brazilian Portuguese)[In Portuguese]0
An enhanced method to compute the similarity between concepts of ontology0
Improving Trace Link Recommendation by Using Non-Isotropic Distances and Combinations0
Improving Verb Metaphor Detection by Propagating Abstractness to Words, Phrases and Individual Senses0
DeepPurple: Lexical, String and Affective Feature Fusion for Sentence-Level Semantic Similarity Estimation0
DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets0
AutoTestForge: A Multidimensional Automated Testing Framework for Natural Language Processing Models0
An Empirical study of Unsupervised Neural Machine Translation: analyzing NMT output, model's behavior and sentences' contribution0
Automating Transfer Credit Assessment in Student Mobility -- A Natural Language Processing-based Approach0
Deep Lifelong Cross-modal Hashing0
Automating the Compilation of Potential Core-Outcomes for Clinical Trials0
Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records0
Deep Learning of Binary and Gradient Judgements for Semantic Paraphrase0
Adversarial Training with Contrastive Learning in NLP0
Improving Text Normalization via Unsupervised Model and Discriminative Reranking0
Automatic Visual Theme Discovery from Joint Image and Text Corpora0
An Efficient Approach to Learning Chinese Judgment Document Similarity Based on Knowledge Summarization0
Deep Contrastive Multi-view Clustering under Semantic Feature Guidance0
Automatic Thesaurus Construction for Modern Hebrew0
Improving Semantic Similarity Measure Within a Recommender System Based-on RDF Graphs0
Show:102550
← PrevPage 21 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified