SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 401450 of 2381 papers

TitleStatusHype
Language-agnostic, automated assessment of listeners' speech recall using large language models0
Statistical Mechanics of Semantic Compression0
TempRetriever: Fusion-based Temporal Dense Passage Retrieval for Time-Sensitive Questions0
Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models0
EnDive: A Cross-Dialect Benchmark for Fairness and Performance in Large Language Models0
How Vital is the Jurisprudential Relevance: Law Article Intervened Legal Case Retrieval and Matching0
ATEB: Evaluating and Improving Advanced NLP Tasks for Text Embedding Models0
CLIMB-3D: Continual Learning for Imbalanced 3D Instance SegmentationCode0
Uncertainty Quantification of Large Language Models through Multi-Dimensional Responses0
Constructing a Norm for Children's Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models0
A Meta-Evaluation of Style and Attribute Transfer Metrics0
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic SimilarityCode0
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model0
Evolutionary Algorithms Approach For Search Based On Semantic Document Similarity0
Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments0
Breaking the Clusters: Uniformity-Optimization for Text-Based Sequential RecommendationCode0
HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation0
FaMTEB: Massive Text Embedding Benchmark in Persian Language0
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models0
PropNet: a White-Box and Human-Like Network for Sentence Representation0
Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples0
PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval0
Bridging LLM-Generated Code and Requirements: Reverse Generation technique and SBC Metric for Developer InsightsCode0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering0
Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy0
Detecting Backdoor Attacks via Similarity in Semantic Communication Systems0
How does a Multilingual LM Handle Multiple Languages?0
How do Humans and Language Models Reason About Creativity? A Comparative Analysis0
HSI: A Holistic Style Injector for Arbitrary Style Transfer0
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution0
Transformer Semantic Genetic Programming for Symbolic Regression0
Fake News Detection After LLM Laundering: Measurement and ExplanationCode0
Feedback-Aware Monte Carlo Tree Search for Efficient Information Seeking in Goal-Oriented Conversations0
Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics0
Causal Graphs Meet Thoughts: Enhancing Complex Reasoning in Graph-Augmented LLMsCode0
A Comprehensive Framework for Semantic Similarity Analysis of Human and AI-Generated Text Using Transformer Architectures and Ensemble Techniques0
2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings0
Sequence Spreading-Based Semantic Communication Under High RF Interference0
LegalGuardian: A Privacy-Preserving Framework for Secure Integration of Large Language Models in Legal Practice0
Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability0
Consistency of Responses and Continuations Generated by Large Language Models on Social Media0
Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI0
Universal Features Guided Zero-Shot Category-Level Object Pose Estimation0
GeAR: Generation Augmented Retrieval0
Harnessing Frozen Unimodal Encoders for Flexible Multimodal AlignmentCode0
Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2DCode0
ProtCLIP: Function-Informed Protein Multi-Modal Learning0
Multiple References with Meaningful Variations Improve Literary Machine Translation0
HashAttention: Semantic Sparsity for Faster Inference0
Show:102550
← PrevPage 9 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified