SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 3140 of 123 papers

TitleStatusHype
Influence of various text embeddings on clustering performance in NLPCode0
CEIL: A General Classification-Enhanced Iterative Learning Framework for Text Clustering0
DeepLens: Interactive Out-of-distribution Data Detection in NLP ModelsCode1
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
ClusTop: An unsupervised and integrated text clustering and topic extraction framework0
Very Large Language Model as a Unified Methodology of Text MiningCode0
MTEB: Massive Text Embedding BenchmarkCode4
Improving Deep Embedded Clustering via Learning Cluster-level Representations0
Clustering-Induced Generative Incomplete Image-Text Clustering (CIGIT-C)0
No Pattern, No Recognition: a Survey about Reproducibility and Distortion Issues of Text Clustering and Topic Modeling0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified