SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 21512200 of 2381 papers

TitleStatusHype
SimMatch: Semi-supervised Learning with Similarity MatchingCode0
DIBERT: Dependency Injected Bidirectional Encoder Representations from TransformersCode0
Gating Mechanisms for Combining Character and Word-level Word Representations: An Empirical StudyCode0
Description and Evaluation of Semantic Similarity Measures ApproachesCode0
KNN-Defense: Defense against 3D Adversarial Point Clouds using Nearest-Neighbor SearchCode0
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual LearningCode0
Def2Vec: Extensible Word Embeddings from Dictionary DefinitionsCode0
Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual StorytellingCode0
On Learning Text Style Transfer with Direct RewardsCode0
Text-in-Context: Token-Level Error Detection for Table-to-Text GenerationCode0
reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive LearningCode0
SDA: Simple Discrete Augmentation for Contrastive Sentence Representation LearningCode0
Single-View Graph Contrastive Learning with Soft Neighborhood AwarenessCode0
Deep Metric Learning Beyond Binary SupervisionCode0
Agile Effort Estimation: Have We Solved the Problem Yet? Insights From A Replication StudyCode0
From Unimodal to Multimodal: Scaling up Projectors to Align ModalitiesCode0
Text Representation Distillation via Information Bottleneck PrincipleCode0
Decoupling Semantic Similarity from Spatial Alignment for Neural NetworksCode0
SEA: Sentence Encoder Assembly for Video Retrieval by Textual QueriesCode0
From Stance to Concern: Adaptation of Propositional Analysis to New Tasks and DomainsCode0
FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation LearningCode0
Deconstruct to Reconstruct a Configurable Evaluation Metric for Open-Domain Dialogue SystemsCode0
Finnish resources for evaluating language model semanticsCode0
Language-agnostic Representation from Multilingual Sentence Encoders for Cross-lingual Similarity EstimationCode0
Import2vec - Learning Embeddings for Software LibrariesCode0
Ontology-based Semantic Similarity Measures for Clustering Medical Concepts in Drug SafetyCode0
Second-Order NLP Adversarial ExamplesCode0
OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based predictionCode0
20min-XD: A Comparable Corpus of Swiss News ArticlesCode0
Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National CorpusCode0
A Semantics-Based Measure of Emoji SimilarityCode0
SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucinationCode0
TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish LanguageCode0
OrderBkd: Textual backdoor attack through repositioningCode0
FFCI: A Framework for Interpretable Automatic Evaluation of SummarizationCode0
SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph DistanceCode0
Large-Scale Evaluation of Topic Models and Dimensionality Reduction Methods for 2D Text SpatializationCode0
Large-Scale Multi-Domain Belief Tracking with Knowledge SharingCode0
SeFNet: Bridging Tabular Datasets with Semantic Feature NetsCode0
A Semantic Relevance Based Neural Network for Text Summarization and Text SimplificationCode0
Capturing Semantic Similarity for Entity Linking with Convolutional Neural NetworksCode0
Selective Text Augmentation with Word Roles for Low-Resource Text ClassificationCode0
LDIR: Low-Dimensional Dense and Interpretable Text Embeddings with Relative RepresentationsCode0
De-Conflated Semantic RepresentationsCode0
Are you tough enough? Framework for Robustness Validation of Machine Comprehension SystemsCode0
Are we describing the same sound? An analysis of word embedding spaces of expressive piano performanceCode0
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-TranslationCode0
Few-shot Hybrid Domain Adaptation of Image GeneratorsCode0
Learning Composition Models for Phrase EmbeddingsCode0
Self-Judge: Selective Instruction Following with Alignment Self-EvaluationCode0
Show:102550
← PrevPage 44 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified