SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 5175 of 123 papers

TitleStatusHype
LMSim : Computing Domain-specific Semantic Word Similarities Using a Language Modeling Approach0
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling0
Moving Past Single Metrics: Exploring Short-Text Clustering Across Multiple Resolutions0
Accounting ngrams and multi-word terms can improve topic models0
Mutual Clustering on Comparative Texts via Heterogeneous Information Networks0
Neural Text Classification by Jointly Learning to Cluster and Align0
Neural Topic Modeling with Deep Mutual Information Estimation0
News clustering approach based on discourse text structure0
No Pattern, No Recognition: a Survey about Reproducibility and Distortion Issues of Text Clustering and Topic Modeling0
Notes on using Determinantal Point Processes for Clustering with Applications to Text Clustering0
Post-Retrieval Clustering Using Third-Order Similarity Measures0
QurSim: A corpus for evaluation of relatedness in short texts0
Representation Learning for Short Text Clustering0
Robust Multi-Relational Clustering via _1-Norm Symmetric Nonnegative Matrix Factorization0
Self-supervised Document Clustering Based on BERT with Data Augment0
Semi-supervised Clustering for Short Text via Deep Representation Learning0
Semi-Supervised Clustering with Contrastive Learning for Discovering New Intents0
Sequential Embedding Induced Text Clustering, a Non-parametric Bayesian Approach0
Short Text Clustering with Transformers0
Subgroup Detection in Ideological Discussions0
Text Classification and Clustering with Annealing Soft Nearest Neighbor Loss0
Text Classification for Azerbaijani Language Using Machine Learning and Embedding0
Text clustering applied to data augmentation in legal contexts0
Text Clustering with Large Language Model Embeddings0
Text Mining using Nonnegative Matrix Factorization and Latent Semantic Analysis0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified