SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 150 of 123 papers

TitleStatusHype
CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward PassCode0
Moving Past Single Metrics: Exploring Short-Text Clustering Across Multiple Resolutions0
Advanced Text Analytics -- Graph Neural Network for Fake News Detection in Social Media0
k-LLMmeans: Scalable, Stable, and Interpretable Text Clustering via LLM-based Centroids0
Reliable Pseudo-labeling via Optimal Transport with Attention for Short Text ClusteringCode0
Discriminative Representation learning via Attention-Enhanced Contrastive Learning for Short Text ClusteringCode0
LITA: An Efficient LLM-assisted Iterative Topic Augmentation Framework0
Dial-In LLM: Human-Aligned LLM-in-the-loop Intent Clustering for Customer Service Dialogues0
Hierarchical mixtures of Unigram models for short text clustering: The role of Beta-Liouville priors0
Text Clustering as Classification with LLMsCode1
Contrastive Learning Subspace for Text Clustering0
NeurCAM: Interpretable Neural Clustering via Additive ModelsCode0
Extracting Sentence Embeddings from Pretrained Transformer Models0
An Efficient and Explanatory Image and Text Clustering System with Multimodal Autoencoder Architecture0
Guiding Sentiment Analysis with Hierarchical Text Clustering: Analyzing the German X/Twitter Discourse on Face Masks in the 2020 COVID-19 PandemicCode0
ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models0
Human-interpretable clustering of short-text using large language modelsCode0
Context-Aware Clustering using Large Language Models0
Text clustering applied to data augmentation in legal contexts0
Text Clustering with Large Language Model Embeddings0
More Discriminative Sentence Embeddings via Semantic Graph SmoothingCode0
An enhanced Teaching-Learning-Based Optimization (TLBO) with Grey Wolf Optimizer (GWO) for text feature selection and clustering0
Automatic Construction of Multi-faceted User Profiles using Text Clustering and its Application to Expert Recommendation and Filtering Problems0
Incremental hierarchical text clustering methods: a review0
Federated Learning for Short Text Clustering0
Elastic deep autoencoder for text embedding clustering by an improved graph regularization0
Large Language Models Enable Few-Shot ClusteringCode1
ClusterLLM: Large Language Models as a Guide for Text ClusteringCode1
Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text ClusteringCode1
LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM0
Influence of various text embeddings on clustering performance in NLPCode0
CEIL: A General Classification-Enhanced Iterative Learning Framework for Text Clustering0
DeepLens: Interactive Out-of-distribution Data Detection in NLP ModelsCode1
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
ClusTop: An unsupervised and integrated text clustering and topic extraction framework0
Very Large Language Model as a Unified Methodology of Text MiningCode0
MTEB: Massive Text Embedding BenchmarkCode4
Improving Deep Embedded Clustering via Learning Cluster-level Representations0
Clustering-Induced Generative Incomplete Image-Text Clustering (CIGIT-C)0
No Pattern, No Recognition: a Survey about Reproducibility and Distortion Issues of Text Clustering and Topic Modeling0
Training Effective Neural Sentence Encoders from Automatically Mined ParaphrasesCode1
Contextual Text Block Detection towards Scene Text Understanding0
Towards Responsible AI for Financial Transactions0
Clustering Similar Amendments at the Italian SenateCode0
EASE: Entity-Aware Contrastive Learning of Sentence EmbeddingCode1
Neural Topic Modeling with Deep Mutual Information Estimation0
Subspace Co-clustering with Two-Way Graph ConvolutionCode0
EASE: Entity-Aware Contrastive Learning of Sentence Embedding0
Proposition-Level Clustering for Multi-Document SummarizationCode1
Semi-Supervised Clustering with Contrastive Learning for Discovering New Intents0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified