SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 150 of 1564 papers

TitleStatusHype
Rethinking the Sample Relations for Few-Shot ClassificationCode7
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
Automatically Interpreting Millions of Features in Large Language ModelsCode3
ERNIE: Enhanced Representation through Knowledge IntegrationCode3
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
Reasoning to Attend: Try to Understand How <SEG> Token WorksCode2
Squeezed Attention: Accelerating Long Context Length LLM InferenceCode2
Large Continual Instruction AssistantCode2
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language ModelsCode2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender SystemsCode2
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented GenerationCode2
Weakly-supervised Audio Separation via Bi-modal Semantic SimilarityCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical ImagingCode2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence EmbeddingsCode2
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and PredictionCode2
LinkBERT: Pretraining Language Models with Document LinksCode2
PromptBERT: Improving BERT Sentence Embeddings with PromptsCode2
Top2Vec: Distributed Representations of TopicsCode2
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language ModelsCode1
IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response TheoryCode1
Label-Guided In-Context Learning for Named Entity RecognitionCode1
The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary GiantsCode1
Smoothie: Smoothing Diffusion on Token Embeddings for Text GenerationCode1
R2MED: A Benchmark for Reasoning-Driven Medical RetrievalCode1
One-Step Offline Distillation of Diffusion-based Models via Koopman ModelingCode1
ELITE: Embedding-Less retrieval with Iterative Text ExplorationCode1
CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented GenerationCode1
IMPACT: A Generic Semantic Loss for Multimodal Medical Image RegistrationCode1
High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous FlightCode1
SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI DetectionCode1
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information FlowCode1
MedFILIP: Medical Fine-grained Language-Image Pre-trainingCode1
DiffSim: Taming Diffusion Models for Evaluating Visual SimilarityCode1
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image SegmentationCode1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training DataCode1
Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in the WildCode1
RelCon: Relative Contrastive Learning for a Motion Foundation Model for Wearable DataCode1
Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement LearningCode1
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security ResearchCode1
Emotional RAG: Enhancing Role-Playing Agents through Emotional RetrievalCode1
Demystifying and Extracting Fault-indicating Information from Logs for Failure DiagnosisCode1
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective PartitioningCode1
Distinguish Confusion in Legal Judgment Prediction via Revised Relation KnowledgeCode1
Unsupervised Episode Detection for Large-Scale News EventsCode1
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous DrivingCode1
Towards Bridging the Cross-modal Semantic Gap for Multi-modal RecommendationCode1
One Prompt is not Enough: Automated Construction of a Mixture-of-Expert PromptsCode1
3D-AVS: LiDAR-based 3D Auto-Vocabulary SegmentationCode1
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report GenerationCode1
Show:102550
← PrevPage 1 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified