SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 14511500 of 1564 papers

TitleStatusHype
UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translationCode0
LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricSCode0
Textual analysis of artificial intelligence manuscripts reveals features associated with peer review outcomeCode0
Urban Traffic Accident Risk Prediction Revisited: Regionality, Proximity, Similarity and SparsityCode0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language ModelsCode0
Investigating the Frequency Distortion of Word Embeddings and Its Impact on Bias MetricsCode0
Concept-Level Explainability for Auditing & Steering LLM ResponsesCode0
Making Fast Graph-based Algorithms with Graph Metric EmbeddingsCode0
Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic SimilarityCode0
Representation learning for very short texts using weighted word embedding aggregationCode0
Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental HealthCode0
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal RecipeCode0
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERTCode0
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented EnvironmentsCode0
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model PretrainingCode0
MathLingBudapest: Concept Networks for Semantic SimilarityCode0
Augmenting Neural Response Generation with Context-Aware Topical AttentionCode0
Short Text Hashing Improved by Integrating Multi-Granularity Topics and TagsCode0
Fake News Detection After LLM Laundering: Measurement and ExplanationCode0
Supervised Online Hashing via Hadamard Codebook LearningCode0
The Impact of Word Splitting on the Semantic Content of Contextualized Word RepresentationsCode0
CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual SimilarityCode0
Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech DatasetsCode0
Adversarial Self-Attention for Language UnderstandingCode0
Comment Ranking Diversification in Forum DiscussionsCode0
Audio Caption in a Car Setting with a Sentence-Level LossCode0
Measuring Semantic Similarity of Words Using Concept NetworksCode0
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning RepresentationCode0
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution DetectionCode0
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic SimilarityCode0
User-in-the-loop Adaptive Intent Detection for Instructable Digital AssistantCode0
Exploring Key Point Analysis with Pairwise Generation and Graph PartitioningCode0
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift GeneralizationCode0
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text RetrievalCode0
Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language AlignmentCode0
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence SimilarityCode0
Meta-Context Transformers for Domain-Specific Response GenerationCode0
Revisiting Cosine Similarity via Normalized ICA-transformed EmbeddingsCode0
Assessing Wordnets with WordNet EmbeddingsCode0
Word Similarity Datasets for Thai: Construction and EvaluationCode0
Robust Privacy Amidst Innovation with Large Language Models Through a Critical Assessment of the RisksCode0
The Undesirable Dependence on Frequency of Gender Bias Metrics Based on Word EmbeddingsCode0
Cognition-aware Cognate DetectionCode0
CLIMB-3D: Continual Learning for Imbalanced 3D Instance SegmentationCode0
Micro-video Tagging via Jointly Modeling Social Influence and Tag RelationCode0
A Semantics-Based Measure of Emoji SimilarityCode0
SimMatch: Semi-supervised Learning with Similarity MatchingCode0
Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022Code0
Explaining Text Similarity in Transformer ModelsCode0
Single-View Graph Contrastive Learning with Soft Neighborhood AwarenessCode0
Show:102550
← PrevPage 30 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified