SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 351400 of 2381 papers

TitleStatusHype
Hypernym Mercury: Token Optimization Through Semantic Field Constriction And Reconstruction From Hypernyms. A New Text Compression Method0
Jailbreaking the Text-to-Video Generative Models0
Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework0
Sparse Attention Remapping with Clustering for Efficient LLM Decoding on PIM0
Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems0
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training0
Homa at SemEval-2025 Task 5: Aligning Librarian Records with OntoAligner for Subject Tagging0
Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction0
20min-XD: A Comparable Corpus of Swiss News ArticlesCode0
ReCellTy: Domain-specific knowledge graph retrieval-augmented LLMs workflow for single-cell annotation0
Stay Hungry, Stay Foolish: On the Extended Reading Articles Generation with LLMs0
Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization0
Exploring Language Patterns of Prompts in Text-to-Image Generation and Their Impact on Visual Diversity0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Semantic Similarity-Informed Bayesian Borrowing for Quantitative Signal Detection of Adverse Events0
Self-Controlled Dynamic Expansion Model for Continual Learning0
HD-RAG: Retrieval-Augmented Generation for Hybrid Documents Containing Text and Hierarchical Tables0
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions0
Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety0
Balancing Complexity and Informativeness in LLM-Based Clustering: Finding the Goldilocks Zone0
ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation0
Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools0
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching0
Context-Aware Human Behavior Prediction Using Multimodal Large Language Models: Challenges and Insights0
Beyond Detection: Designing AI-Resilient Assessments with Automated Feedback Tool to Foster Critical Thinking0
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base0
A Quantitative Approach to Evaluating Open-Source EHR Systems for Indian Healthcare0
HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery0
Ontology-based Semantic Similarity Measures for Clustering Medical Concepts in Drug SafetyCode0
BeLightRec: A lightweight recommender system enhanced with BERT0
CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation0
SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI0
Unleashing the power of text for credit default prediction: Comparing human-written and generative AI-refined texts0
Vision Transformer Based Semantic Communications for Next Generation Wireless Networks0
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement0
KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse0
A General Close-loop Predictive Coding Framework for Auditory Working Memory0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
Measuring Similarity in Causal Graphs: A Framework for Semantic and Structural Analysis0
Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence GenerationCode0
PromptMap: An Alternative Interaction Style for AI-Based Image GenerationCode0
Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language AlignmentCode0
Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols0
MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation0
AuthorMist: Evading AI Text Detectors with Reinforcement Learning0
SEED: Towards More Accurate Semantic Evaluation for Visual Brain Decoding0
Improving RAG Retrieval via Propositional Content Extraction: a Speech Act Theory Approach0
AutoTestForge: A Multidimensional Automated Testing Framework for Natural Language Processing Models0
Token-Level Privacy in Large Language Models0
SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection0
Show:102550
← PrevPage 8 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified