SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 76100 of 123 papers

TitleStatusHype
Thematic Cohesion: measuring terms discriminatory power toward themes0
Topic Models: Accounting Component Structure of Bigrams0
Towards a better understanding of Burrows's Delta in literary authorship attribution0
Towards Responsible AI for Financial Transactions0
TSDPMM: Incorporating Prior Topic Knowledge into Dirichlet Process Mixture Models for Text Clustering0
Unification of HDP and LDA Models for Optimal Topic Clustering of Subject Specific Question Banks0
Unsupervised Feature-Rich Clustering0
Unsupervised Fine-tuning for Text Clustering0
Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback0
ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models0
Multilingual Short Text Responses Clustering for Mobile Educational Activities: a Preliminary Exploration0
A Comparative Study of Conversion Aided Methods for WordNet Sentence Textual Similarity0
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
A Dirichlet Multinomial Mixture Model-based Approach for Short Text Clustering0
Advanced Text Analytics -- Graph Neural Network for Fake News Detection in Social Media0
A Graph-based Text Similarity Measure That Employs Named Entity Information0
A Method of Accounting Bigrams in Topic Models0
Amharic Text Clustering Using Encyclopedic Knowledge with Neural Word Embedding0
An Efficient and Explanatory Image and Text Clustering System with Multimodal Autoencoder Architecture0
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data0
An end-to-end Neural Network Framework for Text Clustering0
An enhanced Teaching-Learning-Based Optimization (TLBO) with Grey Wolf Optimizer (GWO) for text feature selection and clustering0
An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering0
An Unsupervised Bayesian Modelling Approach for Storyline Detection on News Articles0
A Symmetric Rank-one Quasi Newton Method for Non-negative Matrix Factorization0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified