SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 451500 of 2381 papers

TitleStatusHype
Def2Vec: Extensible Word Embeddings from Dictionary DefinitionsCode0
Explicitly Integrating Judgment Prediction with Legal Document Retrieval: A Law-Guided Generative ApproachCode1
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language ModelsCode1
FedSSA: Semantic Similarity-based Aggregation for Efficient Model-Heterogeneous Personalized Federated LearningCode1
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model ReasoningCode1
Encoding Surgical Videos as Latent Spatiotemporal Graphs for Object and Anatomy-Driven ReasoningCode1
Mining Gaze for Contrastive Learning toward Computer-Assisted DiagnosisCode1
Sim-GPT: Text Similarity via GPT Annotated DataCode0
Few-Shot Class-Incremental Learning via Training-Free Prototype CalibrationCode1
Self-Critical Alternate Learning based Semantic Broadcast Communication0
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token EmbeddingsCode0
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement LearningCode0
A Distribution-Based Threshold for Determining Sentence Similarity0
Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs0
AutoKG: Efficient Automated Knowledge Graph Generation for Language ModelsCode1
IEKM: A Model Incorporating External Keyword Matrices0
Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?0
Portuguese FAQ for Financial Services0
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual LearningCode0
Beyond Images: An Integrative Multi-modal Approach to Chest X-Ray Report Generation0
Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling CorrectionCode0
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence EmbeddingsCode2
Active Mining Sample Pair Semantics for Image-text Matching0
Text Representation Distillation via Information Bottleneck PrincipleCode0
Large-scale study of human memory for meaningful narratives0
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic RepresentationsCode1
Sparse Contrastive Learning of Sentence Embeddings0
Unveiling Safety Vulnerabilities of Large Language Models0
An Efficient Self-Supervised Cross-View Training For Sentence EmbeddingCode1
Divide & Conquer for Entailment-aware Multi-hop Evidence Retrieval0
Relation Extraction Model Based on Semantic Enhancement Mechanism0
Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks0
ProcSim: Proxy-based Confidence for Robust Similarity Learning0
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and PredictionCode2
Few-shot Hybrid Domain Adaptation of Image GeneratorsCode0
Accelerating LLaMA Inference by Enabling Intermediate Layer Decoding via Instruction Tuning with LITE0
Translating away Translationese without Parallel Data0
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model PretrainingCode0
Can GPT models Follow Human Summarization Guidelines? Evaluating ChatGPT and GPT-4 for Dialogue Summarization0
Topology-aware Debiased Self-supervised Graph Learning for RecommendationCode0
Meaning Representations from Trajectories in Autoregressive ModelsCode1
Chain-of-Factors Paper-Reviewer MatchingCode0
PaRaDe: Passage Ranking using Demonstrations with Large Language Models0
Prompt-based Grouping Transformer for Nucleus Detection and ClassificationCode0
Visual Grounding Helps Learn Word Meanings in Low-Data RegimesCode1
Investigating semantic subspaces of Transformer sentence embeddings through linear structural probingCode0
Improving Long Document Topic Segmentation Models With Enhanced Coherence ModelingCode0
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation0
Noise Contrastive Estimation-based Matching Framework for Low-resource Security Attack Pattern Recognition0
Show:102550
← PrevPage 10 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8ALBERTPearson Correlation0.93Unverified
9T5-11BPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified