SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 401450 of 2381 papers

TitleStatusHype
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing0
Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot ExamplesCode0
Efficient data selection employing Semantic Similarity-based Graph Structures for model training0
The Impact of Word Splitting on the Semantic Content of Contextualized Word RepresentationsCode0
2D Matryoshka Sentence EmbeddingsCode4
Semantic Mirror Jailbreak: Genetic Algorithm Based Jailbreak Prompts Against Open-source LLMs0
On Defining Smart Cities using Transformer Neural Networks0
DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical DomainCode1
Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual SimilarityCode0
UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translationCode0
Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based MetricCode0
Reasoning before Comparison: LLM-Enhanced Semantic Similarity Metrics for Domain Specialized Text Analysis0
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 LanguagesCode1
Pixel Sentence Representation LearningCode1
The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models0
OrderBkd: Textual backdoor attack through repositioningCode0
Large Language Model Augmented Exercise Retrieval for Personalized Language Learning0
Multi-Lingual Malaysian Embedding: Leveraging Large Language Models for Semantic Representations0
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT ModelsCode1
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on TextCode0
In-Context Learning for Few-Shot Nested Named Entity Recognition0
Benchmarking Transferable Adversarial AttacksCode1
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability0
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity0
Autoencoder-Based Domain Learning for Semantic Communication with Conceptual Spaces0
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation LearningCode1
It's About Time: Incorporating Temporality in Retrieval Augmented Language Models0
Contrastive Learning in Distilled ModelsCode0
Investigating Large Language Models for Financial Causality Detection in Multilingual Setup0
Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition0
Using LLMs to discover emerging coded antisemitic hate-speech in extremist social media0
PhotoBot: Reference-Guided Interactive Photography via Natural Language0
Noise Contrastive Estimation-based Matching Framework for Low-Resource Security Attack Pattern RecognitionCode1
A character-based steganography using masked language modelingCode0
Image Similarity using An Ensemble of Context-Sensitive ModelsCode0
MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation0
HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image RetrievalCode1
Do Vision and Language Encoders Represent the World Similarly?Code1
Estimating Text Similarity based on Semantic Concept Embeddings0
Recognizing Similar Crises through the Application of Ontology-based Knowledge Mining0
Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method0
Unsupervised hard Negative Augmentation for contrastive learningCode0
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical ImagingCode2
Saliency-Aware Regularized Graph Neural Network0
Are we describing the same sound? An analysis of word embedding spaces of expressive piano performanceCode0
AHAM: Adapt, Help, Ask, Model -- Harvesting LLMs for literature mining0
MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning0
LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces0
Automated Assessment of Students' Code Comprehension using LLMs0
An Empirical study of Unsupervised Neural Machine Translation: analyzing NMT output, model's behavior and sentences' contribution0
Show:102550
← PrevPage 9 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified