SOTAVerified

Text Clustering

Grouping a set of texts in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). (Source: Adapted from Wikipedia)

Papers

Showing 51100 of 123 papers

TitleStatusHype
Proposition-Level Clustering for Multi-Document SummarizationCode1
Task-Oriented Clustering for DialoguesCode0
Representation Learning for Short Text Clustering0
Translation Transformers Rediscover Inherent Data DomainsCode0
Hybrid Multisource Feature Fusion for the Text Clustering0
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling0
Learn The Big Picture: Representation Learning for ClusteringCode0
Efficient Sparse Spherical k-Means for Document ClusteringCode0
Text Classification and Clustering with Annealing Soft Nearest Neighbor Loss0
Deep Clustering with Measure Propagation0
Cluster Analysis of Online Mental Health Discourse using Topic-Infused Deep Contextualized Representations0
Amharic Text Clustering Using Encyclopedic Knowledge with Neural Word Embedding0
Supporting Clustering with Contrastive LearningCode1
Short Text Clustering with Transformers0
Discovering New Intents with Deep Aligned ClusteringCode1
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data0
Unsupervised Fine-tuning for Text Clustering0
Neural Text Classification by Jointly Learning to Cluster and Align0
Self-supervised Document Clustering Based on BERT with Data Augment0
ComStreamClust: a communicative multi-agent approach to text clustering in streaming dataCode1
Unification of HDP and LDA Models for Optimal Topic Clustering of Subject Specific Question Banks0
An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering0
Dissimilarity Mixture Autoencoder for Deep ClusteringCode1
Neural Topic Modeling with Bidirectional Adversarial TrainingCode1
Enhancement of Short Text Clustering by Iterative ClassificationCode1
Text Classification for Azerbaijani Language Using Machine Learning and Embedding0
Attentive Representation Learning with Adversarial Training for Short Text Clustering0
Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster RefinementCode0
Text Mining using Nonnegative Matrix Factorization and Latent Semantic Analysis0
A Self-Training Approach for Short Text ClusteringCode0
On the Use of ArXiv as a DatasetCode0
An end-to-end Neural Network Framework for Text Clustering0
Mutual Clustering on Comparative Texts via Heterogeneous Information Networks0
ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg"Code0
Sequential Embedding Induced Text Clustering, a Non-parametric Bayesian Approach0
Multilingual Short Text Responses Clustering for Mobile Educational Activities: a Preliminary Exploration0
Learning Thematic Similarity Metric from Article Sections Using Triplet Networks0
A Graph-based Text Similarity Measure That Employs Named Entity Information0
Hybrid Clustering based on Content and Connection Structure using Joint Nonnegative Matrix Factorization0
Self-Taught Convolutional Neural Networks for Short Text ClusteringCode0
DISCO: A System Leveraging Semantic Search in Document Review0
Character-Aware Neural Networks for Arabic Named Entity Recognition for Social Media0
Accounting ngrams and multi-word terms can improve topic models0
Effects of Creativity and Cluster Tightness on Short Text Clustering Performance0
Semi-supervised Clustering for Short Text via Deep Representation Learning0
Clustering Urdu News Using HeadlinesCode0
TSDPMM: Incorporating Prior Topic Knowledge into Dirichlet Process Mixture Models for Text Clustering0
An Unsupervised Bayesian Modelling Approach for Storyline Detection on News Articles0
News clustering approach based on discourse text structure0
Robust Multi-Relational Clustering via _1-Norm Symmetric Nonnegative Matrix Factorization0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLV-Measure43.71Unverified
2MPNetV-Measure43.69Unverified
3GTR-XXLV-Measure42.42Unverified
4MiniLM-L6V-Measure42.35Unverified
5ST5-XLV-Measure42.34Unverified
6MiniLM-L12V-Measure41.81Unverified
7ST5-LargeV-Measure41.65Unverified
8GTR-LargeV-Measure41.6Unverified
9GTR-XLV-Measure41.51Unverified
10ContrieverV-Measure41.1Unverified
#ModelMetricClaimedVerifiedStatus
1G-BATAccuracy41.25Unverified
2BATAccuracy35.66Unverified
#ModelMetricClaimedVerifiedStatus
1Vector Space ModelRelated Headlines85Unverified