SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 501550 of 1564 papers

TitleStatusHype
Automatic Design of Semantic Similarity Ensembles Using Grammatical EvolutionCode0
Transfer learning for semantic similarity measures based on symbolic regressionCode0
A Massive Scale Semantic Similarity Dataset of Historical English0
DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations0
Large Language Models as Annotators: Enhancing Generalization of NLP Models at Minimal Cost0
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner OraclesCode1
SeFNet: Bridging Tabular Datasets with Semantic Feature NetsCode0
A Relaxed Optimization Approach for Adversarial Attacks against Neural Machine Translation Models0
Unbalanced Optimal Transport for Unbalanced Word AlignmentCode1
Supervised Knowledge May Hurt Novel Class Discovery PerformanceCode0
Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental HealthCode0
LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricSCode0
Vocabulary-free Image ClassificationCode1
Estimating Semantic Similarity between In-Domain and Out-of-Domain SamplesCode0
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence SimilarityCode0
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment0
Real-World Image Variation by Aligning Diffusion Inversion ChainCode1
Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approachesCode0
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual InformationCode0
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation0
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-TranslationCode0
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional OperationsCode0
SAMScore: A Content Structural Similarity Metric for Image Translation EvaluationCode1
C-STS: Conditional Semantic Textual SimilarityCode1
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual ModelsCode1
Modeling Empathic Similarity in Personal Narratives0
SneakyPrompt: Jailbreaking Text-to-image Generative ModelsCode1
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change AnalysisCode0
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMsCode1
Balancing Lexical and Semantic Quality in Abstractive SummarizationCode1
Semantic Similarity Measure of Natural Language Text through Machine Learning and a Keyword-Aware Cross-Encoder-Ranking Summarizer -- A Case Study Using UCGIS GIS&T Body of Knowledge0
Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation TasksCode1
PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity0
Instance Smoothed Contrastive Learning for Unsupervised Sentence EmbeddingCode0
SMATCH++: Standardized and Extended Evaluation of Semantic GraphsCode1
Benchmarking large language models for biomedical natural language processing applications and recommendationsCode1
REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search ModelsCode0
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense DisambiguationCode1
Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation0
Neural Keyphrase Generation: Analysis and Evaluation0
Deep Lifelong Cross-modal Hashing0
Low-resource Bilingual Dialect Lexicon Induction with Large Language ModelsCode0
Bridging Natural Language Processing and Psycholinguistics: computationally grounded semantic similarity datasets for Basque and Spanish0
Learning Geometry-aware Representations by Sketching0
PCPNet: An Efficient and Semantic-Enhanced Transformer Network for Point Cloud PredictionCode1
A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery0
Semantic Feature Verification in FLAN-T50
Efficient Audio Captioning Transformer with Patchout and Text Guidance0
Show:102550
← PrevPage 11 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified