SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 601650 of 2381 papers

TitleStatusHype
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERTCode0
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented EnvironmentsCode0
Supervised Online Hashing via Hadamard Codebook LearningCode0
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution DetectionCode0
Fake News Detection After LLM Laundering: Measurement and ExplanationCode0
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence SimilarityCode0
Exploring Key Point Analysis with Pairwise Generation and Graph PartitioningCode0
Analyzing how BERT performs entity matchingCode0
Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional SemanticsCode0
Exploiting Twitter as Source of Large Corpora of Weakly Similar Pairs for Semantic Sentence EmbeddingsCode0
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic SimilarityCode0
Finnish resources for evaluating language model semanticsCode0
JCSE: Contrastive Learning of Japanese Sentence Embeddings and Its ApplicationsCode0
Jmp8 at SemEval-2017 Task 2: A simple and general distributional approach to estimate word similarityCode0
Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling CorrectionCode0
ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and NegationCode0
Auto-Encoding Dictionary Definitions into Consistent Word EmbeddingsCode0
Estimating Semantic Similarity between In-Domain and Out-of-Domain SamplesCode0
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual InformationCode0
EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action PruningCode0
Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia ContentCode0
TinyBERT: Distilling BERT for Natural Language UnderstandingCode0
Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
ERNIE: Enhanced Language Representation with Informative EntitiesCode0
Emu: Enhancing Multilingual Sentence Embeddings with Semantic SpecializationCode0
EMBEDDIA at SemEval-2022 Task 8: Investigating Sentence, Image, and Knowledge Graph Representations for Multilingual News Article SimilarityCode0
Better Summarization Evaluation with Word Embeddings for ROUGECode0
Embeddings Evaluation Using a Novel Measure of Semantic SimilarityCode0
SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative SamplingCode0
EL Embeddings: Geometric construction of models for the Description Logic EL ++Code0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
Learning semantic sentence representations from visually grounded language without lexical knowledgeCode0
Annotating and analyzing the interactions between meaning relationsCode0
Learning Semantic Textual Similarity via Topic-informed Discrete Latent VariablesCode0
Ad Hoc Table Retrieval using Semantic SimilarityCode0
Creating Large-Scale Multilingual Cognate TablesCode0
Learning to Remove: Towards Isotropic Pre-trained BERT EmbeddingCode0
Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image ColorizationCode0
Efficient Heuristics Generation for Solving Combinatorial Optimization Problems Using Large Language ModelsCode0
Explaining Text Similarity in Transformer ModelsCode0
Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental HealthCode0
Counter-fitting Word Vectors to Linguistic ConstraintsCode0
A character-based steganography using masked language modelingCode0
Distilling the Knowledge of Romanian BERTs Using Multiple TeachersCode0
Distilling Word Meaning in Context from Pre-trained Language ModelsCode0
Correlations between Word Vector SetsCode0
Correlation Coefficients and Semantic Textual SimilarityCode0
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring TasksCode0
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence RepresentationsCode0
Correcting ContradictionsCode0
Show:102550
← PrevPage 13 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified