SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 15011550 of 1564 papers

TitleStatusHype
Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected VulnerabilityCode0
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual InformationCode0
MixMOOD: A systematic approach to class distribution mismatch in semi-supervised learning using deep dataset dissimilarity measuresCode0
Using Information Content to Evaluate Semantic Similarity in a TaxonomyCode0
Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National CorpusCode0
Model Comparison for Semantic GroupingCode0
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucinationCode0
SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph DistanceCode0
Chinese Word Sense Embedding with SememeWSD and Synonym SetCode0
Modelling Sentence Pairs with Tree-structured Attentive EncoderCode0
A Semantic Relevance Based Neural Network for Text Summarization and Text SimplificationCode0
Are you tough enough? Framework for Robustness Validation of Machine Comprehension SystemsCode0
MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations PredictionCode0
More Than Meets The Eye: Semi-supervised Learning Under Non-IID DataCode0
Are we describing the same sound? An analysis of word embedding spaces of expressive piano performanceCode0
Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the BrainCode0
MSnet: A BERT-based Network for Gendered Pronoun ResolutionCode0
A Resource-Light Method for Cross-Lingual Semantic Textual SimilarityCode0
SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt DenoisingCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
Causal Graphs Meet Thoughts: Enhancing Complex Reasoning in Graph-Augmented LLMsCode0
Think Globally, Embed Locally --- Locally Linear Meta-embedding of WordsCode0
Category-aware EEG image generation based on wavelet transform and contrast semantic lossCode0
Capturing Semantic Similarity for Entity Linking with Convolutional Neural NetworksCode0
Soft Alignment Objectives for Robust Adaptation of Language GenerationCode0
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence EmbeddingCode0
Multilingual LLMs Inherently Reward In-Language Time-Sensitive Semantic Alignment for Low-Resource LanguagesCode0
A mathematical theory of semantic development in deep neural networksCode0
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation MetricsCode0
Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling CorrectionCode0
Uncovering the Semantics of Wikipedia CategoriesCode0
What If: Generating Code to Answer Simulation QuestionsCode0
You can't pick your neighbors, or can you? When and how to rely on retrieval in the kNN-LMCode0
Estimating Semantic Similarity between In-Domain and Out-of-Domain SamplesCode0
Multimodal Visual Concept Learning with Weakly Supervised TechniquesCode0
Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive BaselinesCode0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
SEA: Sentence Encoder Assembly for Video Retrieval by Textual QueriesCode0
Second-Order NLP Adversarial ExamplesCode0
Import2vec - Learning Embeddings for Software LibrariesCode0
WordNet EmbeddingsCode0
Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity MatchingCode0
Are LLMs complicated ethical dilemma analyzers?Code0
A Large-Scale Sensitivity Analysis on Latent Embeddings and Dimensionality Reductions for Text SpatializationsCode0
Specialising Word Vectors for Lexical EntailmentCode0
Specialising Word Vectors for Lexical EntailmentCode0
NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching SummarizationCode0
SeFNet: Bridging Tabular Datasets with Semantic Feature NetsCode0
Near-lossless Binarization of Word EmbeddingsCode0
Show:102550
← PrevPage 31 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified