SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 150 of 1564 papers

TitleStatusHype
SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts0
SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression0
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution DetectionCode0
LineRetriever: Planning-Aware Observation Reduction for Web Agents0
DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning0
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval0
Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models0
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation0
PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty0
Semantic similarity estimation for domain specific data using BERT and other techniques0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
InsertRank: LLMs can reason over BM25 scores to Improve Listwise Reranking0
Similarity = Value? Consultation Value Assessment and Alignment for Personalized Search0
GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion0
FindMeIfYouCan: Bringing Open Set metrics to near , far and farther Out-of-Distribution Object Detection0
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language ModelsCode1
Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation0
Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic SimilarityCode0
Statistical Hypothesis Testing for Auditing Robustness in Language Models0
Conservative Bias in Large Language Models: Measuring Relation Predictions0
Denoising Programming Knowledge Tracing with a Code Graph-based Tuning Adaptor0
KNN-Defense: Defense against 3D Adversarial Point Clouds using Nearest-Neighbor SearchCode0
Plugging Schema Graph into Multi-Table QA: A Human-Guided Framework for Reducing LLM Reliance0
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents0
IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response TheoryCode1
VUDG: A Dataset for Video Understanding Domain Generalization0
Category-aware EEG image generation based on wavelet transform and contrast semantic lossCode0
GATE: General Arabic Text Embedding for Enhanced Semantic Textual Similarity with Matryoshka Representation Learning and Hybrid Loss Training0
PRISM: A Framework for Producing Interpretable Political Bias Embeddings with Political-Aware Cross-EncoderCode0
Label-Guided In-Context Learning for Named Entity RecognitionCode1
Document Valuation in LLM Summaries: A Cluster Shapley Approach0
Improving Brain-to-Image Reconstruction via Fine-Grained Text Bridging0
LLMs as Better Recommenders with Natural Language Collaborative Signals: A Self-Assessing Retrieval Approach0
Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEsCode0
The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary GiantsCode1
Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-AnsweringCode0
CrosGrpsABS: Cross-Attention over Syntactic and Semantic Graphs for Aspect-Based Sentiment Analysis in a Low-Resource Language0
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation0
Smoothie: Smoothing Diffusion on Token Embeddings for Text GenerationCode1
LLMs Are Not Scorers: Rethinking MT Evaluation with Generation-Based MethodsCode0
EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action PruningCode0
Omni TM-AE: A Scalable and Interpretable Embedding Model Using the Full Tsetlin Machine State Space0
Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected VulnerabilityCode0
Automated Feedback Loops to Protect Text Simplification with Generative AI from Information Loss0
EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association0
Language Specific Knowledge: Do Models Know Better in X than in English?0
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image ColorizationCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
R2MED: A Benchmark for Reasoning-Driven Medical RetrievalCode1
Show:102550
← PrevPage 1 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified