Topic coverage

A prevalent use case of topic models is that of topic discovery. However, most of the topic model evaluation methods rely on abstract metrics such as perplexity or topic coherence. The topic coverage approach is to measure the models' performance by matching model-generated topics to a fixed set of reference topics - topics discovered by humans and represented in a machine-readable format. This way, the models are evaluated in the context of their use, by essentially simulating topic modeling in a fixed setting defined by a text collection and a set of reference topics. Reference topics represent a ground truth that can be used to evaluate both topic models and other measures of model performance. This coverage approach enables large-scale automatic evaluation of existing and future topic models.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–18 of 18 papers

Title	Date	Tasks	Status	Hype
DeFine: A Decomposed and Fine-Grained Annotated Dataset for Long-form Article Generation	Mar 10, 2025	ArticlesForm	—Unverified	0
QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation	Mar 7, 2025	Language ModelingLanguage Modelling	—Unverified	0
ExpertGenQA: Open-ended QA generation in Specialized Domains	Mar 4, 2025	DiversityFew-Shot Learning	—Unverified	0
PinLanding: Content-First Keyword Landing Page Generation via Multi-Modal AI for Web-Scale Discovery	Mar 1, 2025	AttributeAttribute Extraction	—Unverified	0
Neural Topic Modeling with Large Language Models in the Loop	Nov 13, 2024	Topic coverageTopic Models	CodeCode Available	1
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval	May 18, 2023	DescriptiveRetrieval	CodeCode Available	1
MUG: A General Meeting Understanding and Generation Benchmark	Mar 24, 2023	Extractive SummarizationKeyphrase Extraction	—Unverified	0
Topic Ontologies for Arguments	Jan 23, 2023	Stance ClassificationTopic coverage	—Unverified	0
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High Text Coherence	Dec 27, 2022	DecoderLanguage Modelling	—Unverified	0
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation	Feb 26, 2022	Data AugmentationDialogue Generation	CodeCode Available	1
TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters	Jan 18, 2022	ClusteringTopic coverage	—Unverified	0
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics	Jan 15, 2022	Articlestext-to-speech	—Unverified	0
Thirty Years of Academic Finance	Dec 30, 2021	ArticlesTopic coverage	—Unverified	0
Local and Global Topics in Text Modeling of Web Pages Nested in Web Sites	Mar 30, 2021	ArticlesTopic coverage	—Unverified	0
Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders	Dec 14, 2020	DenoisingDiversity	CodeCode Available	1
A Topic Coverage Approach to Evaluation of Topic Models	Dec 11, 2020	Topic coverageTopic Models	CodeCode Available	0
Entertaining and Opinionated but Too Controlling: A Large-Scale User Study of an Open Domain Alexa Prize System	Aug 13, 2019	SchedulingTopic coverage	—Unverified	0
Using OpenWordnet-PT for Question Answering on Legal Domain	Jan 1, 2018	Question AnsweringTopic coverage	—Unverified	0

Show:10 25 50

No leaderboard results yet.