SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 22012250 of 2381 papers

TitleStatusHype
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation MetricsCode0
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERTCode0
A Resource-Light Method for Cross-Lingual Semantic Textual SimilarityCode0
Calculating the similarity between words and sentences using a lexical database and corpus statisticsCode0
Self-Supervised Speech Representations are More Phonetic than SemanticCode0
ParaICL: Towards Parallel In-Context LearningCode0
Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task LearningCode0
Are LLMs complicated ethical dilemma analyzers?Code0
A mathematical theory of semantic development in deep neural networksCode0
The Birth of Bias: A case study on the evolution of gender bias in an English language modelCode0
Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approachesCode0
Soft Alignment Objectives for Robust Adaptation of Language GenerationCode0
Semantic and sentiment analysis of selected Bhagavad Gita translations using BERT-based language frameworkCode0
CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMsCode0
Transformers for Green Semantic Communication: Less Energy, More SemanticsCode0
Bridging the Gap between Structural and Semantic Similarity in Diverse PlanningCode0
Urban Traffic Accident Risk Prediction Revisited: Regionality, Proximity, Similarity and SparsityCode0
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual SimilarityCode0
Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia ContentCode0
Learning Representations Specialized in Spatial Knowledge: Leveraging Language and VisionCode0
Comparison of State-of-the-Art Deep Learning APIs for Image Multi-Label Classification using Semantic MetricsCode0
Creating Large-Scale Multilingual Cognate TablesCode0
Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic SimilarityCode0
Learning semantic sentence representations from visually grounded language without lexical knowledgeCode0
Urdu Word EmbeddingsCode0
WSL: Sentence Similarity Using Semantic Distance Between WordsCode0
Investigating the Frequency Distortion of Word Embeddings and Its Impact on Bias MetricsCode0
Learning Semantic Textual Similarity from ConversationsCode0
Learning Semantic Textual Similarity via Topic-informed Discrete Latent VariablesCode0
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model PretrainingCode0
Semantic flow in language networksCode0
Learning Text Similarity with Siamese Recurrent NetworksCode0
Space Decomposition for Sentence EmbeddingCode0
SpanBERT: Improving Pre-training by Representing and Predicting SpansCode0
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial NetworkCode0
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual SimilarityCode0
Learning to Distinguish Hypernyms and Co-HyponymsCode0
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 LossCode0
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented EnvironmentsCode0
Ad Hoc Table Retrieval using Semantic SimilarityCode0
Bridging LLM-Generated Code and Requirements: Reverse Generation technique and SBC Metric for Developer InsightsCode0
Counter-fitting Word Vectors to Linguistic ConstraintsCode0
Fake News Detection After LLM Laundering: Measurement and ExplanationCode0
Learning to Remove: Towards Isotropic Pre-trained BERT EmbeddingCode0
The Impact of Word Splitting on the Semantic Content of Contextualized Word RepresentationsCode0
Correlations between Word Vector SetsCode0
Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual SimilarityCode0
VacancySBERT: the approach for representation of titles and skills for semantic similarity search in the recruitment domainCode0
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional OperationsCode0
Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language TasksCode0
Show:102550
← PrevPage 45 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified