SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 201250 of 1564 papers

TitleStatusHype
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 LanguagesCode1
Balancing Lexical and Semantic Quality in Abstractive SummarizationCode1
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report GenerationCode1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training DataCode1
Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label EmbeddingCode1
Benchmarking Transferable Adversarial AttacksCode1
Efficient Neural Ranking using Forward IndexesCode1
Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence EncodersCode1
FedSSA: Semantic Similarity-based Aggregation for Efficient Model-Heterogeneous Personalized Federated LearningCode1
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic FactorsCode1
Learning to Rematch Mismatched Pairs for Robust Cross-Modal RetrievalCode1
SneakyPrompt: Jailbreaking Text-to-image Generative ModelsCode1
Few-Shot Image Classification Benchmarks are Too Far From Reality: Build Back Better with Semantic Task SamplingCode1
SPICE: Semantic Pseudo-labeling for Image ClusteringCode1
Meaning Representations from Trajectories in Autoregressive ModelsCode1
Frequency-driven Imperceptible Adversarial Attack on Semantic SimilarityCode1
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner OraclesCode1
Symmetrical Synthesis for Deep Metric LearningCode1
One-Step Offline Distillation of Diffusion-based Models via Koopman ModelingCode1
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language ModelsCode1
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity DetectionCode1
Generating Natural Language Attacks in a Hard Label Black Box SettingCode1
Integrating Visual and Semantic Similarity Using Hierarchies for Image RetrievalCode0
INO at Factify 2: Structure Coherence based Multi-Modal Fact VerificationCode0
Instance Smoothed Contrastive Learning for Unsupervised Sentence EmbeddingCode0
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change AnalysisCode0
Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text SummarizationCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Improving Adversarial Robustness with Self-Paced Hard-Class Pair ReweightingCode0
Image Similarity using An Ensemble of Context-Sensitive ModelsCode0
Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-AnsweringCode0
Effective and Imperceptible Adversarial Textual Attack via Multi-objectivizationCode0
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token EmbeddingsCode0
How does BERT capture semantics? A closer look at polysemous wordsCode0
An Unsupervised Word Sense Disambiguation System for Under-Resourced LanguagesCode0
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on TextCode0
A Generalized Method for Automated Multilingual Loanword DetectionCode0
Bridging the Gap between Structural and Semantic Similarity in Diverse PlanningCode0
Historical Ink: Semantic Shift Detection for 19th Century SpanishCode0
HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive RegularizationCode0
Identifying Cognate Sets Across Dictionaries of Related LanguagesCode0
Identifying Semantic Divergences in Parallel Text without AnnotationsCode0
Bridging LLM-Generated Code and Requirements: Reverse Generation technique and SBC Metric for Developer InsightsCode0
Improved Semantic Representations From Tree-Structured Long Short-Term Memory NetworksCode0
20min-XD: A Comparable Corpus of Swiss News ArticlesCode0
Improving Long Document Topic Segmentation Models With Enhanced Coherence ModelingCode0
Specializing Unsupervised Pretraining Models for Word-Level Semantic SimilarityCode0
Calculating the similarity between words and sentences using a lexical database and corpus statisticsCode0
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional OperationsCode0
Breaking the Clusters: Uniformity-Optimization for Text-Based Sequential RecommendationCode0
Show:102550
← PrevPage 5 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified