SOTAVerified

Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Showing 51100 of 2381 papers

TitleStatusHype
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security ResearchCode1
Emotional RAG: Enhancing Role-Playing Agents through Emotional RetrievalCode1
Demystifying and Extracting Fault-indicating Information from Logs for Failure DiagnosisCode1
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective PartitioningCode1
Distinguish Confusion in Legal Judgment Prediction via Revised Relation KnowledgeCode1
Unsupervised Episode Detection for Large-Scale News EventsCode1
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous DrivingCode1
Towards Bridging the Cross-modal Semantic Gap for Multi-modal RecommendationCode1
One Prompt is not Enough: Automated Construction of a Mixture-of-Expert PromptsCode1
3D-AVS: LiDAR-based 3D Auto-Vocabulary SegmentationCode1
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report GenerationCode1
Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-trained Vision TransformersCode1
Retrieval-Augmented Open-Vocabulary Object DetectionCode1
SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian LanguagesCode1
KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive LearningCode1
Learning to Rematch Mismatched Pairs for Robust Cross-Modal RetrievalCode1
Meta-Task Prompting Elicits Embeddings from Large Language ModelsCode1
NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long DocumentsCode1
DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical DomainCode1
Pixel Sentence Representation LearningCode1
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 LanguagesCode1
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT ModelsCode1
Benchmarking Transferable Adversarial AttacksCode1
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation LearningCode1
Noise Contrastive Estimation-based Matching Framework for Low-Resource Security Attack Pattern RecognitionCode1
HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image RetrievalCode1
Do Vision and Language Encoders Represent the World Similarly?Code1
Explicitly Integrating Judgment Prediction with Legal Document Retrieval: A Law-Guided Generative ApproachCode1
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language ModelsCode1
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model ReasoningCode1
FedSSA: Semantic Similarity-based Aggregation for Efficient Model-Heterogeneous Personalized Federated LearningCode1
Mining Gaze for Contrastive Learning toward Computer-Assisted DiagnosisCode1
Encoding Surgical Videos as Latent Spatiotemporal Graphs for Object and Anatomy-Driven ReasoningCode1
Few-Shot Class-Incremental Learning via Training-Free Prototype CalibrationCode1
AutoKG: Efficient Automated Knowledge Graph Generation for Language ModelsCode1
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic RepresentationsCode1
An Efficient Self-Supervised Cross-View Training For Sentence EmbeddingCode1
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
Meaning Representations from Trajectories in Autoregressive ModelsCode1
Visual Grounding Helps Learn Word Meanings in Low-Data RegimesCode1
Context Compression for Auto-regressive Transformers with Sentinel TokensCode1
AstroCLIP: A Cross-Modal Foundation Model for GalaxiesCode1
Sieve: Multimodal Dataset Pruning Using Image Captioning ModelsCode1
InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language ModelsCode1
Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive LearningCode1
LinkTransformer: A Unified Package for Record Linkage with Transformer Language ModelsCode1
CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model BiasCode1
Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge DistillationCode1
Audio-Visual Class-Incremental LearningCode1
Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose EstimationCode1
Show:102550
← PrevPage 2 of 48Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SMARTRoBERTaDev Pearson Correlation92.8Unverified
2DeBERTa (large)Accuracy92.5Unverified
3SMART-BERTDev Pearson Correlation90Unverified
4MT-DNN-SMARTPearson Correlation0.93Unverified
5StructBERTRoBERTa ensemblePearson Correlation0.93Unverified
6Mnet-SimPearson Correlation0.93Unverified
7XLNet (single model)Pearson Correlation0.93Unverified
8T5-11BPearson Correlation0.93Unverified
9ALBERTPearson Correlation0.93Unverified
10RoBERTaPearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-UAESpearman Correlation84.54Unverified
2ST5-XXLSpearman Correlation82.63Unverified
3ST5-LargeSpearman Correlation81.83Unverified
4ST5-XLSpearman Correlation81.66Unverified
5ST5-BaseSpearman Correlation81.14Unverified
6MPNet-multilingualSpearman Correlation80.73Unverified
7SGPT-5.8B-nliSpearman Correlation80.53Unverified
8MPNetSpearman Correlation80.28Unverified
9MiniLM-L12Spearman Correlation79.8Unverified
10SimCSE-BERT-supSpearman Correlation79.12Unverified
#ModelMetricClaimedVerifiedStatus
1MT-DNN-SMARTAccuracy93.7Unverified
2ALBERTAccuracy93.4Unverified
3RoBERTa (ensemble)Accuracy92.3Unverified
4BigBirdF191.5Unverified
5StructBERTRoBERTa ensembleAccuracy91.5Unverified
6FLOATER-largeAccuracy91.4Unverified
7SMARTAccuracy91.3Unverified
8RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)Accuracy91Unverified
9RoBERTa-large 355M + Entailment as Few-shot LearnerF191Unverified
10SpanBERTAccuracy90.9Unverified
#ModelMetricClaimedVerifiedStatus
1PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.82Unverified
2PromptEOL+CSE+LLaMA-30BSpearman Correlation0.82Unverified
3PromptEOL+CSE+OPT-13BSpearman Correlation0.82Unverified
4SimCSE-RoBERTalargeSpearman Correlation0.82Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.81Unverified
6SentenceBERTSpearman Correlation0.75Unverified
7SRoBERTa-NLI-baseSpearman Correlation0.74Unverified
8SRoBERTa-NLI-largeSpearman Correlation0.74Unverified
9Dino (STS/̄🦕)Spearman Correlation0.74Unverified
10SBERT-NLI-largeSpearman Correlation0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AnglE-LLaMA-7BSpearman Correlation0.91Unverified
2AnglE-LLaMA-7B-v2Spearman Correlation0.91Unverified
3PromptEOL+CSE+LLaMA-30BSpearman Correlation0.9Unverified
4PromptEOL+CSE+OPT-13BSpearman Correlation0.9Unverified
5PromptEOL+CSE+OPT-2.7BSpearman Correlation0.9Unverified
6PromCSE-RoBERTa-large (0.355B)Spearman Correlation0.89Unverified
7Trans-Encoder-BERT-large-bi (unsup.)Spearman Correlation0.89Unverified
8Trans-Encoder-BERT-large-cross (unsup.)Spearman Correlation0.88Unverified
9Trans-Encoder-RoBERTa-large-cross (unsup.)Spearman Correlation0.88Unverified
10SimCSE-RoBERTa-largeSpearman Correlation0.87Unverified