SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 76100 of 123 papers

TitleStatusHype
Unification of HDP and LDA Models for Optimal Topic Clustering of Subject Specific Question Banks0
Unsupervised Feature-Rich Clustering0
Unsupervised Fine-tuning for Text Clustering0
Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback0
ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models0
Hybrid Clustering based on Content and Connection Structure using Joint Nonnegative Matrix Factorization0
Hybrid Multisource Feature Fusion for the Text Clustering0
Improving Deep Embedded Clustering via Learning Cluster-level Representations0
Incremental hierarchical text clustering methods: a review0
k-LLMmeans: Scalable, Stable, and Interpretable Text Clustering via LLM-based Centroids0
LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM0
Learning Thematic Similarity Metric from Article Sections Using Triplet Networks0
LITA: An Efficient LLM-assisted Iterative Topic Augmentation Framework0
LMSim : Computing Domain-specific Semantic Word Similarities Using a Language Modeling Approach0
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling0
Moving Past Single Metrics: Exploring Short-Text Clustering Across Multiple Resolutions0
Accounting ngrams and multi-word terms can improve topic models0
Mutual Clustering on Comparative Texts via Heterogeneous Information Networks0
Neural Text Classification by Jointly Learning to Cluster and Align0
Neural Topic Modeling with Deep Mutual Information Estimation0
News clustering approach based on discourse text structure0
No Pattern, No Recognition: a Survey about Reproducibility and Distortion Issues of Text Clustering and Topic Modeling0
Notes on using Determinantal Point Processes for Clustering with Applications to Text Clustering0
Post-Retrieval Clustering Using Third-Order Similarity Measures0
QurSim: A corpus for evaluation of relatedness in short texts0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified