SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 9511000 of 2381 papers

TitleStatusHype
Evaluation on Second Language Collocational Congruency with Computational Semantic Similarity0
GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method0
GiCCS: A German in-Context Conversational Similarity Benchmark0
GKR: the Graphical Knowledge Representation for semantic parsing0
Evaluation of taxonomic and neural embedding methods for calculating semantic similarity0
Evaluation of Simple Distributional Compositional Operations on Longer Texts0
CFILT-CORE: Semantic Textual Similarity using Universal Networking Language0
Evaluation of Semantic Search and its Role in Retrieved-Augmented-Generation (RAG) for Arabic Language0
Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks0
Evaluation Datasets for Cross-lingual Semantic Textual Similarity0
Gold Standard Online Debates Summaries and First Experiments Towards Automatic Summarization of Online Debate Data0
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features0
Graph-Augmented Cyclic Learning Framework for Similarity Estimation of Medical Clinical Notes0
Evaluation by Association: A Systematic Study of Quantitative Word Association Evaluation0
Center-wise Local Image Mixture For Contrastive Representation Learning0
GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion0
Grounding Action Descriptions in Videos0
Grounding Semantics in Olfactory Perception0
GSI-UPM at SemEval-2019 Task 5: Semantic Similarity and Word Embeddings for Multilingual Detection of Hate Speech Against Immigrants and Women on Twitter0
Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols0
A Large-Scale Multilingual Disambiguation of Glosses0
A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular Space0
Evaluating Topic Coherence Using Distributional Semantics0
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity0
Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples0
Evaluating the Stability of Embedding-based Word Similarities0
Hardness of Samples Need to be Quantified for a Reliable Evaluation System: Exploring Potential Opportunities with a New Task0
CDTDS: Predicting Paraphrases in Twitter via Support Vector Regression0
Harnessing label semantics to extract higher performance under noisy label for Company to Industry matching0
HashAttention: Semantic Sparsity for Faster Inference0
Hashtags are (not) judgemental: The untold story of Lok Sabha elections 20190
HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity0
Evaluating the Effectiveness of Efficient Neural Architecture Search for Sentence-Pair Tasks0
HD-RAG: Retrieval-Augmented Generation for Hybrid Documents Containing Text and Hierarchical Tables0
Headerless, Quoteless, but not Hopeless? Using Pairwise Email Classification to Disentangle Email Threads0
HENRY-CORE: Domain Adaptation and Stacking for Text Similarity0
HHU at SemEval-2016 Task 1: Multiple Approaches to Measuring Semantic Textual Similarity0
HHU at SemEval-2017 Task 2: Fast Hash-Based Embeddings for Semantic Word Similarity Assessment0
Evaluating text coherence based on semantic similarity graph0
CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation0
Evaluating Tag Recommendations for E-Book Annotation Using a Semantic Similarity Metric0
Evaluating semantic models with word-sentence relatedness0
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan0
Highlights of Semantics in Multi-objective Genetic Programming0
A Large Resource of Patterns for Verbal Paraphrases0
Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety0
Causal Adversarial Perturbations for Individual Fairness and Robustness in Heterogeneous Data Spaces0
Evaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset0
HLTC-HKUST: A Neural Network Paraphrase Classifier using Translation Metrics, Semantic Roles and Lexical Similarity Features0
Are Manually Prepared Affective Lexicons Really Useful for Sentiment Analysis0
Show:102550
← PrevPage 20 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified