Semantic Textual Similarity

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2201–2250 of 2381 papers

Title	Date	Tasks	Status
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics	Sep 20, 2022	CPUGPU	CodeCode Available
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT	Aug 22, 2020	Multiple-choiceQuestion Answering	CodeCode Available
A Resource-Light Method for Cross-Lingual Semantic Textual Similarity	Jan 19, 2018	Cross-Lingual Information RetrievalCross-Lingual Semantic Textual Similarity	CodeCode Available
Calculating the similarity between words and sentences using a lexical database and corpus statistics	Feb 15, 2018	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Self-Supervised Speech Representations are More Phonetic than Semantic	Jun 12, 2024	intent-classificationIntent Classification	CodeCode Available
ParaICL: Towards Parallel In-Context Learning	Mar 31, 2024	In-Context LearningSemantic Similarity	CodeCode Available
Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning	Mar 30, 2018	Multi-Task LearningNatural Language Inference	CodeCode Available
Are LLMs complicated ethical dilemma analyzers?	May 12, 2025	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
A mathematical theory of semantic development in deep neural networks	Oct 23, 2018	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
The Birth of Bias: A case study on the evolution of gender bias in an English language model	Jul 21, 2022	Language ModelingLanguage Modelling	CodeCode Available
Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approaches	May 29, 2023	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Soft Alignment Objectives for Robust Adaptation of Language Generation	Nov 29, 2022	Domain AdaptationMachine Translation	CodeCode Available
Semantic and sentiment analysis of selected Bhagavad Gita translations using BERT-based language framework	Jan 9, 2022	Deep LearningLanguage Modelling	CodeCode Available
CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs	Jun 5, 2024	ClusteringNatural Language Inference	CodeCode Available
Transformers for Green Semantic Communication: Less Energy, More Semantics	Oct 11, 2023	BenchmarkingCPU	CodeCode Available
Bridging the Gap between Structural and Semantic Similarity in Diverse Planning	Oct 2, 2023	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Urban Traffic Accident Risk Prediction Revisited: Regionality, Proximity, Similarity and Sparsity	Jul 29, 2024	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity	Jun 14, 2024	Contrastive LearningSemantic Textual Similarity	CodeCode Available
Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia Content	Aug 14, 2018	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Learning Representations Specialized in Spatial Knowledge: Leveraging Language and Vision	Jan 1, 2018	Dependency ParsingObject	CodeCode Available
Comparison of State-of-the-Art Deep Learning APIs for Image Multi-Label Classification using Semantic Metrics	Mar 21, 2019	General ClassificationMulti-Label Classification	CodeCode Available
Creating Large-Scale Multilingual Cognate Tables	May 1, 2018	Machine TranslationSemantic Textual Similarity	CodeCode Available
Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity	Jun 9, 2025	Semantic SegmentationSemantic Similarity	CodeCode Available
Learning semantic sentence representations from visually grounded language without lexical knowledge	Mar 27, 2019	Grounded language learningLearning Semantic Representations	CodeCode Available
Urdu Word Embeddings	May 1, 2018	Semantic Textual SimilarityWord Embeddings	CodeCode Available
WSL: Sentence Similarity Using Semantic Distance Between Words	Jun 1, 2015	Semantic Textual SimilaritySentence	CodeCode Available
Investigating the Frequency Distortion of Word Embeddings and Its Impact on Bias Metrics	Nov 15, 2022	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Learning Semantic Textual Similarity from Conversations	Apr 20, 2018	Community Question AnsweringNatural Language Inference	CodeCode Available
Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables	Nov 7, 2022	Language ModelingLanguage Modelling	CodeCode Available
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining	Oct 25, 2023	Language ModelingLanguage Modelling	CodeCode Available
Semantic flow in language networks	May 18, 2019	Community DetectionPhilosophy	CodeCode Available
Learning Text Similarity with Siamese Recurrent Networks	Aug 1, 2016	Contrastive LearningRepresentation Learning	CodeCode Available
Space Decomposition for Sentence Embedding	Jun 5, 2024	Semantic Textual SimilaritySentence	CodeCode Available
SpanBERT: Improving Pre-training by Representing and Predicting Spans	Jul 24, 2019	Coreference ResolutionLinguistic Acceptability	CodeCode Available
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network	Feb 26, 2018	Image GenerationSemantic Similarity	CodeCode Available
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity	Dec 3, 2024	Adversarial RobustnessAdversarial Text	CodeCode Available
Learning to Distinguish Hypernyms and Co-Hyponyms	Aug 1, 2014	Natural Language InferenceSemantic Textual Similarity	CodeCode Available
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss	Jun 8, 2024	Contrastive LearningSemantic Textual Similarity	CodeCode Available
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments	Jul 13, 2024	Entity LinkingNatural Language Inference	CodeCode Available
Ad Hoc Table Retrieval using Semantic Similarity	Feb 16, 2018	RetrievalSemantic Similarity	CodeCode Available
Bridging LLM-Generated Code and Requirements: Reverse Generation technique and SBC Metric for Developer Insights	Feb 11, 2025	Code GenerationSemantic Similarity	CodeCode Available
Counter-fitting Word Vectors to Linguistic Constraints	Mar 2, 2016	Dialogue State TrackingSemantic Similarity	CodeCode Available
Fake News Detection After LLM Laundering: Measurement and Explanation	Jan 29, 2025	Fake News DetectionMisinformation	CodeCode Available
Learning to Remove: Towards Isotropic Pre-trained BERT Embedding	Apr 12, 2021	Semantic Textual SimilarityWord Similarity	CodeCode Available
The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations	Feb 22, 2024	Semantic SimilaritySemantic Textual Similarity	CodeCode Available
Correlations between Word Vector Sets	Oct 7, 2019	Semantic Textual SimilaritySTS	CodeCode Available
Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity	Feb 20, 2024	Semantic Textual SimilaritySentence	CodeCode Available
VacancySBERT: the approach for representation of titles and skills for semantic similarity search in the recruitment domain	Jul 31, 2023	Language ModellingSemantic Similarity	CodeCode Available
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations	May 24, 2023	DecoderRepresentation Learning	CodeCode Available
Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks	Aug 20, 2017	POSPOS Tagging	CodeCode Available

Show:10 25 50

← PrevPage 45 of 48Next →

All datasets STS Benchmark MTEB MRPC SICK STS13 STS14 STS12 STS15 STS16 MRPC Dev SentEval SICK-R

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SMARTRoBERTa	Dev Pearson Correlation	92.8	—	Unverified
2	DeBERTa (large)	Accuracy	92.5	—	Unverified
3	SMART-BERT	Dev Pearson Correlation	90	—	Unverified
4	MT-DNN-SMART	Pearson Correlation	0.93	—	Unverified
5	StructBERTRoBERTa ensemble	Pearson Correlation	0.93	—	Unverified
6	Mnet-Sim	Pearson Correlation	0.93	—	Unverified
7	XLNet (single model)	Pearson Correlation	0.93	—	Unverified
8	ALBERT	Pearson Correlation	0.93	—	Unverified
9	T5-11B	Pearson Correlation	0.93	—	Unverified
10	RoBERTa	Pearson Correlation	0.92	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AnglE-UAE	Spearman Correlation	84.54	—	Unverified
2	ST5-XXL	Spearman Correlation	82.63	—	Unverified
3	ST5-Large	Spearman Correlation	81.83	—	Unverified
4	ST5-XL	Spearman Correlation	81.66	—	Unverified
5	ST5-Base	Spearman Correlation	81.14	—	Unverified
6	MPNet-multilingual	Spearman Correlation	80.73	—	Unverified
7	SGPT-5.8B-nli	Spearman Correlation	80.53	—	Unverified
8	MPNet	Spearman Correlation	80.28	—	Unverified
9	MiniLM-L12	Spearman Correlation	79.8	—	Unverified
10	SimCSE-BERT-sup	Spearman Correlation	79.12	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Accuracy	93.7	—	Unverified
2	ALBERT	Accuracy	93.4	—	Unverified
3	RoBERTa (ensemble)	Accuracy	92.3	—	Unverified
4	BigBird	F1	91.5	—	Unverified
5	StructBERTRoBERTa ensemble	Accuracy	91.5	—	Unverified
6	FLOATER-large	Accuracy	91.4	—	Unverified
7	SMART	Accuracy	91.3	—	Unverified
8	RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)	Accuracy	91	—	Unverified
9	RoBERTa-large 355M + Entailment as Few-shot Learner	F1	91	—	Unverified
10	SpanBERT	Accuracy	90.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PromCSE-RoBERTa-large (0.355B)	Spearman Correlation	0.82	—	Unverified
2	PromptEOL+CSE+LLaMA-30B	Spearman Correlation	0.82	—	Unverified
3	PromptEOL+CSE+OPT-13B	Spearman Correlation	0.82	—	Unverified
4	SimCSE-RoBERTalarge	Spearman Correlation	0.82	—	Unverified
5	PromptEOL+CSE+OPT-2.7B	Spearman Correlation	0.81	—	Unverified
6	SentenceBERT	Spearman Correlation	0.75	—	Unverified
7	SRoBERTa-NLI-base	Spearman Correlation	0.74	—	Unverified
8	SRoBERTa-NLI-large	Spearman Correlation	0.74	—	Unverified
9	Dino (STS/̄🦕)	Spearman Correlation	0.74	—	Unverified
10	SBERT-NLI-large	Spearman Correlation	0.74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AnglE-LLaMA-7B	Spearman Correlation	0.91	—	Unverified
2	AnglE-LLaMA-7B-v2	Spearman Correlation	0.91	—	Unverified
3	PromptEOL+CSE+LLaMA-30B	Spearman Correlation	0.9	—	Unverified
4	PromptEOL+CSE+OPT-13B	Spearman Correlation	0.9	—	Unverified
5	PromptEOL+CSE+OPT-2.7B	Spearman Correlation	0.9	—	Unverified
6	PromCSE-RoBERTa-large (0.355B)	Spearman Correlation	0.89	—	Unverified
7	Trans-Encoder-BERT-large-bi (unsup.)	Spearman Correlation	0.89	—	Unverified
8	Trans-Encoder-BERT-large-cross (unsup.)	Spearman Correlation	0.88	—	Unverified
9	Trans-Encoder-RoBERTa-large-cross (unsup.)	Spearman Correlation	0.88	—	Unverified
10	SimCSE-RoBERTa-large	Spearman Correlation	0.87	—	Unverified