SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 18511900 of 2381 papers

TitleStatusHype
Problems With Evaluation of Word Embeddings Using Word Similarity Tasks0
Proceedings of the LexSem+Logics Workshop 20160
Processing and Normalizing Hashtags0
ProcSim: Proxy-based Confidence for Robust Similarity Learning0
Product Feature Mining: Semantic Clues versus Syntactic Constituents0
Promoting Semantics in Multi-objective Genetic Programming based on Decomposition0
PromptBERT: Improving BERT Sentence Embeddings with Prompts0
PromptExp: Multi-granularity Prompt Explanation of Large Language Models0
Prompting Large Language Model for Machine Translation: A Case Study0
Prompt Obfuscation for Large Language Models0
Prompt-tuning for Clickbait Detection via Text Summarization0
PropNet: a White-Box and Human-Like Network for Sentence Representation0
ProtCLIP: Function-Informed Protein Multi-Modal Learning0
ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation0
ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation0
PurdueNLP at SemEval-2017 Task 1: Predicting Semantic Textual Similarity with Paraphrase and Event Embeddings0
Push for Quantization: Deep Fisher Hashing0
QA4PRF: A Question Answering based Framework for Pseudo Relevance Feedback0
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT0
QLUT at SemEval-2017 Task 1: Semantic Textual Similarity Based on Word Embeddings0
QSTS: A Question-Sensitive Text Similarity Measure for Question Generation0
Quality Estimation for Machine Translation Using the Joint Method of Evaluation Criteria and Statistical Modeling0
Quantifying perturbation impacts for large language models0
QU-BIGIR at SemEval 2017 Task 3: Using Similarity Features for Arabic Community Question Answering Forums0
QurSim: A corpus for evaluation of relatedness in short texts0
Qwant Research @DEFT 2019: Document matching and information retrieval using clinical cases0
RAGulator: Lightweight Out-of-Context Detectors for Grounded Text Generation0
Raising the Bar on the Evaluation of Out-of-Distribution Detection0
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation0
Random Positive-Only Projections: PPMI-Enabled Incremental Semantic Space Construction0
Random Walks and Neural Network Language Models on Knowledge Bases0
Rank-Consistency Deep Hashing for Scalable Multi-Label Image Search0
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training0
Reasoning before Comparison: LLM-Enhanced Semantic Similarity Metrics for Domain Specialized Text Analysis0
ReCellTy: Domain-specific knowledge graph retrieval-augmented LLMs workflow for single-cell annotation0
Recognizing Arguing Subjectivity and Argument Tags0
Recognizing Humour using Word Associations and Humour Anchor Extraction0
Recognizing Similar Crises through the Application of Ontology-based Knowledge Mining0
Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics0
Reconstructing Big Semantic Similarity Networks0
Recurrent Continuous Translation Models0
Reddit Temporal N-gram Corpus and its Applications on Paraphrase and Semantic Similarity in Social Media using a Topic-based Latent Semantic Analysis0
Reducing Sentiment Bias in Language Models via Counterfactual Evaluation0
Referential Translation Machines for Predicting Translation Quality and Related Statistics0
Referential Translation Machines for Predicting Translation Quality0
Referential Translation Machines for Quality Estimation0
Refiner: Data Refining against Gradient Leakage Attacks in Federated Learning0
Regression-free Blind Image Quality Assessment with Content-Distortion Consistency0
Reinforce Attack: Adversarial Attack against BERT with Reinforcement Learning0
ReinforceBug: A Framework to Generate Adversarial Textual Examples0
Show:102550
← PrevPage 38 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified