SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 101125 of 1564 papers

TitleStatusHype
CBLUE: A Chinese Biomedical Language Understanding Evaluation BenchmarkCode1
IMPACT: A Generic Semantic Loss for Multimodal Medical Image RegistrationCode1
Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language ModelsCode1
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From CharactersCode1
ComStreamClust: a communicative multi-agent approach to text clustering in streaming dataCode1
Instance Similarity Learning for Unsupervised Feature RepresentationCode1
CODER: Knowledge infused cross-lingual medical term embedding for term normalizationCode1
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security ResearchCode1
IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response TheoryCode1
Just Rank: Rethinking Evaluation with Word and Sentence SimilaritiesCode1
Discrete Optimization for Unsupervised Sentence Summarization with Word-Level ExtractionCode1
COPNER: Contrastive Learning with Prompt Guiding for Few-shot Named Entity RecognitionCode1
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense DisambiguationCode1
Benchmarking large language models for biomedical natural language processing applications and recommendationsCode1
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual GroundingCode1
Learning to Rematch Mismatched Pairs for Robust Cross-Modal RetrievalCode1
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT ModelsCode1
Distributional Formal SemanticsCode1
Describing Sets of Images with Textual-PCACode1
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCOCode1
Cross-lingual Text Classification with Heterogeneous Graph Neural NetworkCode1
C-STS: Conditional Semantic Textual SimilarityCode1
MarkBERT: Marking Word Boundaries Improves Chinese BERTCode1
DECAF: Deep Extreme Classification with Label FeaturesCode1
Demystifying and Extracting Fault-indicating Information from Logs for Failure DiagnosisCode1
Show:102550
← PrevPage 5 of 63Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified