SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 20012050 of 2381 papers

TitleStatusHype
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement LearningCode0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective DistillationCode0
Syntactically Robust Training on Partially-Observed Data for Open Information ExtractionCode0
Identifying Cognate Sets Across Dictionaries of Related LanguagesCode0
An Information Minimization Based Contrastive Learning Model for Unsupervised Sentence Embeddings LearningCode0
Composition-contrastive Learning for Sentence EmbeddingsCode0
Modelling Sentence Pairs with Tree-structured Attentive EncoderCode0
Identifying Semantic Divergences in Parallel Text without AnnotationsCode0
Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2DCode0
Using Sentence Embeddings and Semantic Similarity for Seeking Consensus when Assessing Trustworthy AICode0
CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual SimilarityCode0
Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic SimilarityCode0
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language TasksCode0
Representation learning for very short texts using weighted word embedding aggregationCode0
MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations PredictionCode0
IISCNLP at SemEval-2016 Task 2: Interpretable STS with ILP based Multiple Chunk AlignerCode0
Better Summarization Evaluation with Word Embeddings for ROUGECode0
More Than Meets The Eye: Semi-supervised Learning Under Non-IID DataCode0
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase EmbeddingsCode0
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal RecipeCode0
MSnet: A BERT-based Network for Gendered Pronoun ResolutionCode0
Efficient Heuristics Generation for Solving Combinatorial Optimization Problems Using Large Language ModelsCode0
Image Similarity using An Ensemble of Context-Sensitive ModelsCode0
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal RetrievalCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Improve Chinese Word Embeddings by Exploiting Internal StructureCode0
Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse NetworksCode0
Improving Adversarial Robustness with Self-Paced Hard-Class Pair ReweightingCode0
Sentence Embeddings using Supervised Contrastive LearningCode0
WordNet EmbeddingsCode0
An Assessment of Experimental Protocols for Tracing Changes in Word Semantics Relative to Accuracy and ReliabilityCode0
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image GenerationCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on TextCode0
Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for GradientCode0
A Generalized Method for Automated Multilingual Loanword DetectionCode0
Unsupervised Semantic Hashing with Pairwise ReconstructionCode0
Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech DatasetsCode0
Comment Ranking Diversification in Forum DiscussionsCode0
Analyzing how BERT performs entity matchingCode0
Collective Human Opinions in Semantic Textual SimilarityCode0
Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource LanguagesCode0
Hierarchy-based Image Embeddings for Semantic Image RetrievalCode0
Sentence Representations via Gaussian EmbeddingCode0
Improving Lexical Embeddings with Semantic KnowledgeCode0
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning RepresentationCode0
TechNet: Technology Semantic Network Based on Patent DataCode0
Automatic Morpheme Segmentation and Labeling in Universal Dependencies ResourcesCode0
Show:102550
← PrevPage 41 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified