SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 651700 of 1564 papers

TitleStatusHype
Supervised Knowledge May Hurt Novel Class Discovery PerformanceCode0
LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricSCode0
Estimating Semantic Similarity between In-Domain and Out-of-Domain SamplesCode0
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence SimilarityCode0
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment0
Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approachesCode0
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual InformationCode0
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation0
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-TranslationCode0
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional OperationsCode0
Modeling Empathic Similarity in Personal Narratives0
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change AnalysisCode0
Semantic Similarity Measure of Natural Language Text through Machine Learning and a Keyword-Aware Cross-Encoder-Ranking Summarizer -- A Case Study Using UCGIS GIS&T Body of Knowledge0
PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity0
Instance Smoothed Contrastive Learning for Unsupervised Sentence EmbeddingCode0
REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search ModelsCode0
Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation0
Neural Keyphrase Generation: Analysis and Evaluation0
Deep Lifelong Cross-modal Hashing0
Bridging Natural Language Processing and Psycholinguistics: computationally grounded semantic similarity datasets for Basque and Spanish0
Low-resource Bilingual Dialect Lexicon Induction with Large Language ModelsCode0
Learning Geometry-aware Representations by Sketching0
A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery0
Semantic Feature Verification in FLAN-T50
Efficient Audio Captioning Transformer with Patchout and Text Guidance0
SuperDisco: Super-Class Discovery Improves Visual Recognition for the Long-Tail0
Using Semantic Similarity and Text Embedding to Measure the Social Media Echo of Strategic Communications0
LEURN: Learning Explainable Univariate Rules with Neural Networks0
A Novel Patent Similarity Measurement Methodology: Semantic Distance and Technological DistanceCode0
Micro-video Tagging via Jointly Modeling Social Influence and Tag RelationCode0
INO at Factify 2: Structure Coherence based Multi-Modal Fact VerificationCode0
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching SummarizationCode0
A Parametric Similarity Method: Comparative Experiments based on Semantically Annotated Large Datasets0
Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach0
How to choose "Good" Samples for Text Data Augmentation0
TransFool: An Adversarial Attack against Neural Machine Translation ModelsCode0
KNNs of Semantic Encodings for Rating Prediction0
uHelp: intelligent volunteer search for mutual help communities0
ClusterLog: Clustering Logs for Effective Log-based Anomaly Detection0
Syntactically Robust Training on Partially-Observed Data for Open Information ExtractionCode0
Prompting Large Language Model for Machine Translation: A Case Study0
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text RetrievalCode0
Language-Informed Transfer Learning for Embodied Household Activities0
Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss0
Universal Multimodal Representation for Language Understanding0
The Undesirable Dependence on Frequency of Gender Bias Metrics Based on Word EmbeddingsCode0
Scene-Aware Label Graph Learning for Multi-Label Image Classification0
Text-Guided Unsupervised Latent Transformation for Multi-Attribute Image Manipulation0
Show:102550
← PrevPage 14 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified