SOTAVerified

Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods.

Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Papers

Showing 251300 of 1564 papers

TitleStatusHype
Document Valuation in LLM Summaries: A Cluster Shapley Approach0
LLMs as Better Recommenders with Natural Language Collaborative Signals: A Self-Assessing Retrieval Approach0
Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEsCode0
Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-AnsweringCode0
CrosGrpsABS: Cross-Attention over Syntactic and Semantic Graphs for Aspect-Based Sentiment Analysis in a Low-Resource Language0
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation0
Omni TM-AE: A Scalable and Interpretable Embedding Model Using the Full Tsetlin Machine State Space0
LLMs Are Not Scorers: Rethinking MT Evaluation with Generation-Based MethodsCode0
EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action PruningCode0
Automated Feedback Loops to Protect Text Simplification with Generative AI from Information Loss0
Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected VulnerabilityCode0
Language Specific Knowledge: Do Models Know Better in X than in English?0
EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association0
Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image ColorizationCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
Efficient Heuristics Generation for Solving Combinatorial Optimization Problems Using Large Language ModelsCode0
Community Search in Time-dependent Road-social Attributed Networks0
Fine-Grained ECG-Text Contrastive Learning via Waveform Understanding Enhancement0
Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language ModelsCode0
Evaluations at Work: Measuring the Capabilities of GenAI in Use0
AI-enhanced semantic feature norms for 786 concepts0
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation0
Towards Automated Situation Awareness: A RAG-Based Framework for Peacebuilding Reports0
A 2D Semantic-Aware Position Encoding for Vision Transformers0
TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching0
Concept-Level Explainability for Auditing & Steering LLM ResponsesCode0
Hypernym Mercury: Token Optimization Through Semantic Field Constriction And Reconstruction From Hypernyms. A New Text Compression Method0
Are LLMs complicated ethical dilemma analyzers?Code0
Jailbreaking the Text-to-Video Generative Models0
Sparse Attention Remapping with Clustering for Efficient LLM Decoding on PIM0
Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework0
Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems0
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training0
Homa at SemEval-2025 Task 5: Aligning Librarian Records with OntoAligner for Subject Tagging0
20min-XD: A Comparable Corpus of Swiss News ArticlesCode0
Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction0
ReCellTy: Domain-specific knowledge graph retrieval-augmented LLMs workflow for single-cell annotation0
Stay Hungry, Stay Foolish: On the Extended Reading Articles Generation with LLMs0
Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization0
Exploring Language Patterns of Prompts in Text-to-Image Generation and Their Impact on Visual Diversity0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Semantic Similarity-Informed Bayesian Borrowing for Quantitative Signal Detection of Adverse Events0
Self-Controlled Dynamic Expansion Model for Continual Learning0
HD-RAG: Retrieval-Augmented Generation for Hybrid Documents Containing Text and Hierarchical Tables0
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions0
Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety0
Balancing Complexity and Informativeness in LLM-Based Clustering: Finding the Goldilocks Zone0
Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools0
ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation0
Context-Aware Human Behavior Prediction Using Multimodal Large Language Models: Challenges and Insights0
Show:102550
← PrevPage 6 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F193.38Unverified
2SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F191.51Unverified
3SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F190.69Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.16Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, expanded corpus")F189.12Unverified
#ModelMetricClaimedVerifiedStatus
1BioBERT (pre-trained on PubMed abstracts + PMC, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.75Unverified
2SciBERT cased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
3SciBERT uncased (SciVocab, fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F189.3Unverified
4BERT-Base uncased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F186.8Unverified
5BERT-Base cased (fine-tuned on "Annotated corpus for semantic similarity of clinical trial outcomes, original corpus")F184.21Unverified
#ModelMetricClaimedVerifiedStatus
1Doc2VecCMSE0.31Unverified
2LSTM (Tai et al., 2015)MSE0.28Unverified
3Bidirectional LSTM (Tai et al., 2015)MSE0.27Unverified
4combine-skip (Kiros et al., 2015)MSE0.27Unverified
5Dependency Tree-LSTM (Tai et al., 2015)MSE0.25Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)Pearson Correlation0.94Unverified
2BioLinkBERT (base)Pearson Correlation0.93Unverified
3NCBI_BERT(base) (P+M)Pearson Correlation0.92Unverified
#ModelMetricClaimedVerifiedStatus
1MacBERT-largeMacro F185.6Unverified
#ModelMetricClaimedVerifiedStatus
1CharacterBERT (base, medical, ensemble)Pearson Correlation85.62Unverified
#ModelMetricClaimedVerifiedStatus
1NCBI_BERT(base) (P+M)Pearson Correlation0.85Unverified