MTEB: Massive Text Embedding Benchmark Oct 13, 2022 Benchmarking Information Retrieval
Code Code Available 4Text Clustering as Classification with LLMs Sep 30, 2024 Classification Clustering
Code Code Available 1Large Language Models Enable Few-Shot Clustering Jul 2, 2023 Clustering Language Modeling
Code Code Available 1ClusterLLM: Large Language Models as a Guide for Text Clustering May 24, 2023 Clustering Language Modelling
Code Code Available 1Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text Clustering May 23, 2023 Clustering Contrastive Learning
Code Code Available 1DeepLens: Interactive Out-of-distribution Data Detection in NLP Models Mar 2, 2023 Text Clustering
Code Code Available 1Training Effective Neural Sentence Encoders from Automatically Mined Paraphrases Jul 26, 2022 Language Modeling Language Modelling
Code Code Available 1EASE: Entity-Aware Contrastive Learning of Sentence Embedding May 9, 2022 Clustering Contrastive Learning
Code Code Available 1Proposition-Level Clustering for Multi-Document Summarization Jan 16, 2022 Clustering Document Summarization
Code Code Available 1Proposition-Level Clustering for Multi-Document Summarization Dec 16, 2021 Clustering Document Summarization
Code Code Available 1Supporting Clustering with Contrastive Learning Mar 24, 2021 Clustering Contrastive Learning
Code Code Available 1Discovering New Intents with Deep Aligned Clustering Dec 16, 2020 Clustering Open Intent Discovery
Code Code Available 1ComStreamClust: a communicative multi-agent approach to text clustering in streaming data Oct 11, 2020 Clustering Semantic Similarity
Code Code Available 1Dissimilarity Mixture Autoencoder for Deep Clustering Jun 15, 2020 Clustering Deep Clustering
Code Code Available 1Neural Topic Modeling with Bidirectional Adversarial Training Apr 26, 2020 Clustering Text Clustering
Code Code Available 1Enhancement of Short Text Clustering by Iterative Classification Jan 31, 2020 Classification Clustering
Code Code Available 1Short Text Clustering via Convolutional Neural Networks Jun 1, 2015 Clustering Short Text Clustering
Code Code Available 1CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass May 1, 2025 Contrastive Learning Information Retrieval
Code Code Available 0Moving Past Single Metrics: Exploring Short-Text Clustering Across Multiple Resolutions Feb 24, 2025 Clustering Informativeness
— Unverified 0Advanced Text Analytics -- Graph Neural Network for Fake News Detection in Social Media Feb 22, 2025 Clustering Fake News Detection
— Unverified 0k-LLMmeans: Scalable, Stable, and Interpretable Text Clustering via LLM-based Centroids Feb 12, 2025 Clustering Text Clustering
— Unverified 0Reliable Pseudo-labeling via Optimal Transport with Attention for Short Text Clustering Jan 25, 2025 Clustering Contrastive Learning
Code Code Available 0Discriminative Representation learning via Attention-Enhanced Contrastive Learning for Short Text Clustering Jan 7, 2025 Clustering Contrastive Learning
Code Code Available 0LITA: An Efficient LLM-assisted Iterative Topic Augmentation Framework Dec 17, 2024 Clustering Specificity
— Unverified 0Dial-In LLM: Human-Aligned LLM-in-the-loop Intent Clustering for Customer Service Dialogues Dec 12, 2024 Clustering Coherence Evaluation
— Unverified 0Hierarchical mixtures of Unigram models for short text clustering: The role of Beta-Liouville priors Oct 29, 2024 Short Text Clustering Text Clustering
— Unverified 0Contrastive Learning Subspace for Text Clustering Aug 26, 2024 Clustering Contrastive Learning
— Unverified 0NeurCAM: Interpretable Neural Clustering via Additive Models Aug 23, 2024 Additive models Clustering
Code Code Available 0Extracting Sentence Embeddings from Pretrained Transformer Models Aug 15, 2024 Clustering Retrieval-augmented Generation
— Unverified 0An Efficient and Explanatory Image and Text Clustering System with Multimodal Autoencoder Architecture Aug 14, 2024 Text Clustering
— Unverified 0Guiding Sentiment Analysis with Hierarchical Text Clustering: Analyzing the German X/Twitter Discourse on Face Masks in the 2020 COVID-19 Pandemic Aug 1, 2024 Clustering Data Visualization
Code Code Available 0ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models Jun 19, 2024 Clustering In-Context Learning
— Unverified 0Human-interpretable clustering of short-text using large language models May 12, 2024 Clustering Short Text Clustering
Code Code Available 0Context-Aware Clustering using Large Language Models May 2, 2024 Clustering Language Modeling
— Unverified 0Text clustering applied to data augmentation in legal contexts Apr 8, 2024 Classification Clustering
— Unverified 0Text Clustering with Large Language Model Embeddings Mar 22, 2024 Clustering Dimensionality Reduction
— Unverified 0More Discriminative Sentence Embeddings via Semantic Graph Smoothing Feb 20, 2024 Clustering Sentence
Code Code Available 0An enhanced Teaching-Learning-Based Optimization (TLBO) with Grey Wolf Optimizer (GWO) for text feature selection and clustering Feb 19, 2024 Clustering Dimensionality Reduction
— Unverified 0Automatic Construction of Multi-faceted User Profiles using Text Clustering and its Application to Expert Recommendation and Filtering Problems Jan 19, 2024 Text Clustering valid
— Unverified 0Incremental hierarchical text clustering methods: a review Dec 12, 2023 Clustering Hierarchical Text Clustering
— Unverified 0Federated Learning for Short Text Clustering Nov 23, 2023 Clustering Federated Learning
— Unverified 0Elastic deep autoencoder for text embedding clustering by an improved graph regularization Sep 23, 2023 Clustering Dimensionality Reduction
— Unverified 0LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM May 10, 2023 GPU Language Modeling
— Unverified 0Influence of various text embeddings on clustering performance in NLP May 4, 2023 Clustering Text Clustering
Code Code Available 0CEIL: A General Classification-Enhanced Iterative Learning Framework for Text Clustering Apr 20, 2023 Clustering Deep Clustering
— Unverified 0AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models Feb 14, 2023 Clustering Language Modeling
— Unverified 0ClusTop: An unsupervised and integrated text clustering and topic extraction framework Jan 3, 2023 Clustering Dimensionality Reduction
— Unverified 0Very Large Language Model as a Unified Methodology of Text Mining Dec 19, 2022 Clustering Language Modeling
Code Code Available 0Improving Deep Embedded Clustering via Learning Cluster-level Representations Oct 1, 2022 Clustering Contrastive Learning
— Unverified 0Clustering-Induced Generative Incomplete Image-Text Clustering (CIGIT-C) Sep 28, 2022 Clustering Text Clustering
— Unverified 0