MTEB: Massive Text Embedding Benchmark Oct 13, 2022 Benchmarking Information Retrieval
Code Code Available 45 ComStreamClust: a communicative multi-agent approach to text clustering in streaming data Oct 11, 2020 Clustering Semantic Similarity
Code Code Available 15 DeepLens: Interactive Out-of-distribution Data Detection in NLP Models Mar 2, 2023 Text Clustering
Code Code Available 15 EASE: Entity-Aware Contrastive Learning of Sentence Embedding May 9, 2022 Clustering Contrastive Learning
Code Code Available 15 Discovering New Intents with Deep Aligned Clustering Dec 16, 2020 Clustering Open Intent Discovery
Code Code Available 15 ClusterLLM: Large Language Models as a Guide for Text Clustering May 24, 2023 Clustering Language Modelling
Code Code Available 15 Neural Topic Modeling with Bidirectional Adversarial Training Apr 26, 2020 Clustering Text Clustering
Code Code Available 15 Enhancement of Short Text Clustering by Iterative Classification Jan 31, 2020 Classification Clustering
Code Code Available 15 Short Text Clustering via Convolutional Neural Networks Jun 1, 2015 Clustering Short Text Clustering
Code Code Available 15 Dissimilarity Mixture Autoencoder for Deep Clustering Jun 15, 2020 Clustering Deep Clustering
Code Code Available 15 Training Effective Neural Sentence Encoders from Automatically Mined Paraphrases Jul 26, 2022 Language Modeling Language Modelling
Code Code Available 15 Text Clustering as Classification with LLMs Sep 30, 2024 Classification Clustering
Code Code Available 15 Proposition-Level Clustering for Multi-Document Summarization Jan 16, 2022 Clustering Document Summarization
Code Code Available 15 Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text Clustering May 23, 2023 Clustering Contrastive Learning
Code Code Available 15 Proposition-Level Clustering for Multi-Document Summarization Dec 16, 2021 Clustering Document Summarization
Code Code Available 15 Large Language Models Enable Few-Shot Clustering Jul 2, 2023 Clustering Language Modeling
Code Code Available 15 Supporting Clustering with Contrastive Learning Mar 24, 2021 Clustering Contrastive Learning
Code Code Available 15 Clustering Urdu News Using Headlines Sep 27, 2015 Clustering Information Retrieval
Code Code Available 05 Subspace Co-clustering with Two-Way Graph Convolution Feb 1, 2022 Clustering Image Clustering
Code Code Available 05 Very Large Language Model as a Unified Methodology of Text Mining Dec 19, 2022 Clustering Language Modeling
Code Code Available 05 Efficient Sparse Spherical k-Means for Document Clustering Jul 30, 2021 Clustering Short Text Clustering
Code Code Available 05 Task-Oriented Clustering for Dialogues Nov 1, 2021 Clustering Diversity
Code Code Available 05 Discriminative Representation learning via Attention-Enhanced Contrastive Learning for Short Text Clustering Jan 7, 2025 Clustering Contrastive Learning
Code Code Available 05 Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement Nov 20, 2019 Clustering Open Intent Discovery
Code Code Available 05 On the Use of ArXiv as a Dataset Apr 30, 2019 Articles Author Attribution
Code Code Available 05 Clustering Similar Amendments at the Italian Senate Jun 1, 2022 Clustering Management
Code Code Available 05 Reliable Pseudo-labeling via Optimal Transport with Attention for Short Text Clustering Jan 25, 2025 Clustering Contrastive Learning
Code Code Available 05 Self-Taught Convolutional Neural Networks for Short Text Clustering Jan 1, 2017 Clustering Dimensionality Reduction
Code Code Available 05 More Discriminative Sentence Embeddings via Semantic Graph Smoothing Feb 20, 2024 Clustering Sentence
Code Code Available 05 CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass May 1, 2025 Contrastive Learning Information Retrieval
Code Code Available 05 Learn The Big Picture: Representation Learning for Clustering Aug 1, 2021 Clustering Representation Learning
Code Code Available 05 Guiding Sentiment Analysis with Hierarchical Text Clustering: Analyzing the German X/Twitter Discourse on Face Masks in the 2020 COVID-19 Pandemic Aug 1, 2024 Clustering Data Visualization
Code Code Available 05 Human-interpretable clustering of short-text using large language models May 12, 2024 Clustering Short Text Clustering
Code Code Available 05 A Self-Training Approach for Short Text Clustering Aug 1, 2019 Clustering Deep Clustering
Code Code Available 05 ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg" Feb 10, 2019 Benchmarking Clustering
Code Code Available 05 Influence of various text embeddings on clustering performance in NLP May 4, 2023 Clustering Text Clustering
Code Code Available 05 NeurCAM: Interpretable Neural Clustering via Additive Models Aug 23, 2024 Additive models Clustering
Code Code Available 05 Translation Transformers Rediscover Inherent Data Domains Sep 16, 2021 Clustering Domain Adaptation
Code Code Available 05 ClusTop: An unsupervised and integrated text clustering and topic extraction framework Jan 3, 2023 Clustering Dimensionality Reduction
— Unverified 00 Clustering tweets usingWikipedia concepts May 1, 2014 Clustering Text Clustering
— Unverified 00 An Unsupervised Bayesian Modelling Approach for Storyline Detection on News Articles Sep 1, 2015 Articles Text Clustering
— Unverified 00 A Method of Accounting Bigrams in Topic Models Jun 1, 2015 Document Summarization Information Retrieval
— Unverified 00 Clustering-Induced Generative Incomplete Image-Text Clustering (CIGIT-C) Sep 28, 2022 Clustering Text Clustering
— Unverified 00 An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering Jul 1, 2020 Clustering Short Text Clustering
— Unverified 00 Cluster Analysis of Online Mental Health Discourse using Topic-Infused Deep Contextualized Representations Apr 1, 2021 Text Clustering
— Unverified 00 Effects of Creativity and Cluster Tightness on Short Text Clustering Performance Aug 1, 2016 Clustering Semantic Textual Similarity
— Unverified 00 CLTC: A Chinese-English Cross-lingual Topic Corpus May 1, 2012 Articles Clustering
— Unverified 00 An enhanced Teaching-Learning-Based Optimization (TLBO) with Grey Wolf Optimizer (GWO) for text feature selection and clustering Feb 19, 2024 Clustering Dimensionality Reduction
— Unverified 00 A Graph-based Text Similarity Measure That Employs Named Entity Information Sep 1, 2017 Clustering named-entity-recognition
— Unverified 00 AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models Feb 14, 2023 Clustering Language Modeling
— Unverified 00