SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 201250 of 1564 papers

TitleStatusHype
EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection0
Phonology-Guided Speech-to-Speech Translation for African Languages0
BIS: NL2SQL Service Evaluation Benchmark for Business Intelligence ScenariosCode0
Decoupling Semantic Similarity from Spatial Alignment for Neural NetworksCode0
Emotional RAG: Enhancing Role-Playing Agents through Emotional RetrievalCode1
Conjuring Semantic Similarity0
Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems0
Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model0
Automatically Interpreting Millions of Features in Large Language ModelsCode3
Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples Generation with MomentumCode0
SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation0
PromptExp: Multi-granularity Prompt Explanation of Large Language Models0
Back-of-the-Book Index Automation for Arabic Documents0
Improving Legal Entity Recognition Using a Hybrid Transformer Model and Semantic Filtering Approach0
Large Continual Instruction AssistantCode2
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks0
Graded Suspiciousness of Adversarial Texts to Human0
Metadata-based Data Exploration with Retrieval-Augmented Generation for Large Language Models0
Evaluating Deduplication Techniques for Economic Research Paper Titles with a Focus on Semantic Similarity using NLP and LLMs0
UniAdapt: A Universal Adapter for Knowledge Calibration0
Semantic-Driven Topic Modeling Using Transformer-Based Embeddings and Clustering Algorithms0
From Unimodal to Multimodal: Scaling up Projectors to Align ModalitiesCode0
T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task0
Exploring Semantic Clustering in Deep Reinforcement Learning for Video Games0
Unveiling Ontological Commitment in Multi-Modal Foundation Models0
Brotherhood at WMT 2024: Leveraging LLM-Generated Contextual Conversations for Cross-Lingual Image Captioning0
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment0
Towards Automated Patent Workflows: AI-Orchestrated Multi-Agent Framework for Intellectual Property Management and Analysis0
Demystifying and Extracting Fault-indicating Information from Logs for Failure DiagnosisCode1
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language ModelsCode2
Reasoning Graph Enhanced Exemplars Retrieval for In-Context LearningCode0
Prompt Obfuscation for Large Language Models0
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender SystemsCode2
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift GeneralizationCode0
An Unsupervised Dialogue Topic Segmentation Model Based on Utterance Rewriting0
Ethereum Fraud Detection via Joint Transaction Language Model and Graph Representation Learning0
Self-Judge: Selective Instruction Following with Alignment Self-EvaluationCode0
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective PartitioningCode1
LanguaShrink: Reducing Token Overhead with Psycholinguistics0
GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action CountingCode0
FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation LearningCode0
Contrastive Learning Subspace for Text Clustering0
HTS-Attack: Heuristic Token Search for Jailbreaking Text-to-Image Models0
GSTran: Joint Geometric and Semantic Coherence for Point Cloud SegmentationCode0
Distinguish Confusion in Legal Judgment Prediction via Revised Relation KnowledgeCode1
KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment0
Unsupervised Episode Detection for Large-Scale News EventsCode1
reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive LearningCode0
Semantics or spelling? Probing contextual word embeddings with orthographic noiseCode0
A Semi-supervised Multi-channel Graph Convolutional Network for Query Classification in E-commerce0
Show:102550
← PrevPage 5 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified