SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 125 of 123 papers

TitleStatusHype
MTEB: Massive Text Embedding BenchmarkCode4
ComStreamClust: a communicative multi-agent approach to text clustering in streaming dataCode1
Proposition-Level Clustering for Multi-Document SummarizationCode1
Large Language Models Enable Few-Shot ClusteringCode1
Dissimilarity Mixture Autoencoder for Deep ClusteringCode1
Discovering New Intents with Deep Aligned ClusteringCode1
Short Text Clustering via Convolutional Neural NetworksCode1
DeepLens: Interactive Out-of-distribution Data Detection in NLP ModelsCode1
Text Clustering as Classification with LLMsCode1
Training Effective Neural Sentence Encoders from Automatically Mined ParaphrasesCode1
Enhancement of Short Text Clustering by Iterative ClassificationCode1
EASE: Entity-Aware Contrastive Learning of Sentence EmbeddingCode1
Neural Topic Modeling with Bidirectional Adversarial TrainingCode1
Proposition-Level Clustering for Multi-Document SummarizationCode1
Supporting Clustering with Contrastive LearningCode1
Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text ClusteringCode1
ClusterLLM: Large Language Models as a Guide for Text ClusteringCode1
NeurCAM: Interpretable Neural Clustering via Additive ModelsCode0
More Discriminative Sentence Embeddings via Semantic Graph SmoothingCode0
Influence of various text embeddings on clustering performance in NLPCode0
Learn The Big Picture: Representation Learning for ClusteringCode0
On the Use of ArXiv as a DatasetCode0
Efficient Sparse Spherical k-Means for Document ClusteringCode0
Discriminative Representation learning via Attention-Enhanced Contrastive Learning for Short Text ClusteringCode0
Guiding Sentiment Analysis with Hierarchical Text Clustering: Analyzing the German X/Twitter Discourse on Face Masks in the 2020 COVID-19 PandemicCode0
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified