SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 14511475 of 1564 papers

TitleStatusHype
UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translationCode0
LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricSCode0
Textual analysis of artificial intelligence manuscripts reveals features associated with peer review outcomeCode0
Urban Traffic Accident Risk Prediction Revisited: Regionality, Proximity, Similarity and SparsityCode0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language ModelsCode0
Investigating the Frequency Distortion of Word Embeddings and Its Impact on Bias MetricsCode0
Concept-Level Explainability for Auditing & Steering LLM ResponsesCode0
Making Fast Graph-based Algorithms with Graph Metric EmbeddingsCode0
Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic SimilarityCode0
Representation learning for very short texts using weighted word embedding aggregationCode0
Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental HealthCode0
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal RecipeCode0
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERTCode0
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented EnvironmentsCode0
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model PretrainingCode0
MathLingBudapest: Concept Networks for Semantic SimilarityCode0
Augmenting Neural Response Generation with Context-Aware Topical AttentionCode0
Short Text Hashing Improved by Integrating Multi-Granularity Topics and TagsCode0
Fake News Detection After LLM Laundering: Measurement and ExplanationCode0
Supervised Online Hashing via Hadamard Codebook LearningCode0
The Impact of Word Splitting on the Semantic Content of Contextualized Word RepresentationsCode0
CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual SimilarityCode0
Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech DatasetsCode0
Adversarial Self-Attention for Language UnderstandingCode0
Comment Ranking Diversification in Forum DiscussionsCode0
Show:102550
← PrevPage 59 of 63Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified