SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 15011550 of 2381 papers

TitleStatusHype
IRIT: Textual Similarity Combining Conceptual Similarity with an N-Gram Comparison Method0
ISCAS\_NLP at SemEval-2016 Task 1: Sentence Similarity Based on Support Vector Regression using Multiple Features0
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text0
Is Cosine-Similarity of Embeddings Really About Similarity?0
Isolating authorship from content with semantic embeddings and contrastive learning0
Is this a Child, a Girl or a Car? Exploring the Contribution of Distributional Similarity to Learning Referential Word Meanings0
Is Twitter A Better Corpus for Measuring Sentiment Similarity?0
Iterative Relevance Feedback for Answer Passage Retrieval with Passage-level Semantic Match0
ITNLP-AiKF at SemEval-2016 Task 3 a quesiton answering system using community QA repository0
ITNLP-AiKF at SemEval-2017 Task 1: Rich Features Based SVR for Semantic Textual Similarity Computing0
ITNLP-ARC at SemEval-2018 Task 12: Argument Reasoning Comprehension with Attention0
It's About Time: Incorporating Temporality in Retrieval Augmented Language Models0
iUBC at SemEval-2016 Task 2: RNNs and LSTMs for interpretable STS0
JailbreakHunter: A Visual Analytics Approach for Jailbreak Prompts Discovery from Large-Scale Human-LLM Conversational Datasets0
Jailbreaking the Text-to-Video Generative Models0
JAMES: Normalizing Job Titles with Multi-Aspect Graph Embeddings and Reasoning0
janardhan: Semantic Textual Similarity using Universal Networking Language graph matching0
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images0
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models0
Joint Learning of Distributed Representations for Images and Texts0
JU\_CSE\_NLP: Multi-grade Classification of Semantic Similarity between Text Pairs0
JU-Evora: A Graph Based Cross-Level Semantic Similarity Analysis using Discourse Information0
JUNITMZ at SemEval-2016 Task 1: Identifying Semantic Similarity Using Levenshtein Ratio0
Just an Update on PMING Distance for Web-based Semantic Similarity in Artificial Intelligence and Data Mining0
Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text0
KEViN: A Knowledge Enhanced Validity and Novelty Classifier for Arguments0
KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment0
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus0
KLUE-CORE: A regression model of semantic textual similarity0
KnCe2013-CORE:Semantic Text Similarity by use of Knowledge Bases0
KNNs of Semantic Encodings for Rating Prediction0
Knowing the Author by the Company His Words Keep0
Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach0
Enhancing Unsupervised Sentence Embeddings via Knowledge-Driven Data Augmentation and Gaussian-Decayed Contrastive Learning0
Knowledge Base Unification via Sense Embeddings and Disambiguation0
Knowledge Graph Construction and Its Application in Automatic Radiology Report Generation from Radiologist's Dictation0
Knowledge Graph Fusion for Language Model Fine-tuning0
Knowledge Tagging System on Math Questions via LLMs with Flexible Demonstration Retriever0
Know When To Stop: A Study of Semantic Drift in Text Generation0
KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse0
L2F/INESC-ID at SemEval-2017 Tasks 1 and 2: Lexical and semantic features in word and textual similarity0
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing0
Language-agnostic, automated assessment of listeners' speech recall using large language models0
Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction0
Language-Informed Transfer Learning for Embodied Household Activities0
Language Models Explain Word Reading Times Better Than Empirical Predictability0
Language Specific Knowledge: Do Models Know Better in X than in English?0
Language Transfer Learning for Supervised Lexical Substitution0
LanguaShrink: Reducing Token Overhead with Psycholinguistics0
Large Language Model Augmented Exercise Retrieval for Personalized Language Learning0
Show:102550
← PrevPage 31 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified