SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 150 of 123 papers

TitleStatusHype
MTEB: Massive Text Embedding BenchmarkCode4
Dissimilarity Mixture Autoencoder for Deep ClusteringCode1
Large Language Models Enable Few-Shot ClusteringCode1
ComStreamClust: a communicative multi-agent approach to text clustering in streaming dataCode1
Proposition-Level Clustering for Multi-Document SummarizationCode1
DeepLens: Interactive Out-of-distribution Data Detection in NLP ModelsCode1
Short Text Clustering via Convolutional Neural NetworksCode1
Discovering New Intents with Deep Aligned ClusteringCode1
Text Clustering as Classification with LLMsCode1
Neural Topic Modeling with Bidirectional Adversarial TrainingCode1
ClusterLLM: Large Language Models as a Guide for Text ClusteringCode1
Proposition-Level Clustering for Multi-Document SummarizationCode1
Supporting Clustering with Contrastive LearningCode1
EASE: Entity-Aware Contrastive Learning of Sentence EmbeddingCode1
Enhancement of Short Text Clustering by Iterative ClassificationCode1
Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text ClusteringCode1
Training Effective Neural Sentence Encoders from Automatically Mined ParaphrasesCode1
ClusTop: An unsupervised and integrated text clustering and topic extraction framework0
A Method of Accounting Bigrams in Topic Models0
Extracting Sentence Embeddings from Pretrained Transformer Models0
A Graph-based Text Similarity Measure That Employs Named Entity Information0
An enhanced Teaching-Learning-Based Optimization (TLBO) with Grey Wolf Optimizer (GWO) for text feature selection and clustering0
CLTC: A Chinese-English Cross-lingual Topic Corpus0
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
Clustering-Induced Generative Incomplete Image-Text Clustering (CIGIT-C)0
An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering0
Clustering tweets usingWikipedia concepts0
An Unsupervised Bayesian Modelling Approach for Storyline Detection on News Articles0
Exploiting Discourse Relations between Sentences for Text Clustering0
Federated Learning for Short Text Clustering0
Character-Aware Neural Networks for Arabic Named Entity Recognition for Social Media0
An end-to-end Neural Network Framework for Text Clustering0
Elastic deep autoencoder for text embedding clustering by an improved graph regularization0
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data0
Dial-In LLM: Human-Aligned LLM-in-the-loop Intent Clustering for Customer Service Dialogues0
Automatic Construction of Multi-faceted User Profiles using Text Clustering and its Application to Expert Recommendation and Filtering Problems0
Advanced Text Analytics -- Graph Neural Network for Fake News Detection in Social Media0
Deep Clustering with Measure Propagation0
Attentive Representation Learning with Adversarial Training for Short Text Clustering0
DISCO: A System Leveraging Semantic Search in Document Review0
A Weighting Scheme for Open Information Extraction0
A Template Based Hybrid Model for Chinese Personal Name Disambiguation0
CEIL: A General Classification-Enhanced Iterative Learning Framework for Text Clustering0
Contrastive Learning Subspace for Text Clustering0
Domain Based Punjabi Text Document Clustering0
EASE: Entity-Aware Contrastive Learning of Sentence Embedding0
An Efficient and Explanatory Image and Text Clustering System with Multimodal Autoencoder Architecture0
Effects of Creativity and Cluster Tightness on Short Text Clustering Performance0
Cluster Analysis of Online Mental Health Discourse using Topic-Infused Deep Contextualized Representations0
A Comparative Study of Conversion Aided Methods for WordNet Sentence Textual Similarity0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified