SOTAVerified

Topic coverage

A prevalent use case of topic models is that of topic discovery. However, most of the topic model evaluation methods rely on abstract metrics such as perplexity or topic coherence. The topic coverage approach is to measure the models' performance by matching model-generated topics to a fixed set of reference topics - topics discovered by humans and represented in a machine-readable format. This way, the models are evaluated in the context of their use, by essentially simulating topic modeling in a fixed setting defined by a text collection and a set of reference topics. Reference topics represent a ground truth that can be used to evaluate both topic models and other measures of model performance. This coverage approach enables large-scale automatic evaluation of existing and future topic models.

Papers

Showing 118 of 18 papers

TitleStatusHype
DeFine: A Decomposed and Fine-Grained Annotated Dataset for Long-form Article Generation0
QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation0
ExpertGenQA: Open-ended QA generation in Specialized Domains0
PinLanding: Content-First Keyword Landing Page Generation via Multi-Modal AI for Web-Scale Discovery0
Neural Topic Modeling with Large Language Models in the LoopCode1
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense RetrievalCode1
MUG: A General Meeting Understanding and Generation BenchmarkCode0
Topic Ontologies for Arguments0
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High Text Coherence0
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters0
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics0
Thirty Years of Academic Finance0
Local and Global Topics in Text Modeling of Web Pages Nested in Web Sites0
Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-EncodersCode1
A Topic Coverage Approach to Evaluation of Topic ModelsCode0
Entertaining and Opinionated but Too Controlling: A Large-Scale User Study of an Open Domain Alexa Prize System0
Using OpenWordnet-PT for Question Answering on Legal Domain0
Show:102550

No leaderboard results yet.