SOTAVerified

Topic Classification

Papers

Showing 150 of 186 papers

TitleStatusHype
Prototypical Verbalizer for Prompt-based Few-shot TuningCode4
Explaining NLP Models via Minimal Contrastive Editing (MiCE)Code1
2kenize: Tying Subword Sequences for Chinese Script ConversionCode1
Hierarchical Multi-Label Classification of Scientific DocumentsCode1
Hierarchical Transformers for Long Document ClassificationCode1
DocSCAN: Unsupervised Text Classification via Learning from NeighborsCode1
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transferCode1
Zero-Shot Text Classification via Self-Supervised TuningCode1
Cross-Lingual Adaptation using Structural Correspondence LearningCode1
Language Through a Prism: A Spectral Approach for Multiscale Language RepresentationsCode1
LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual LexiconsCode1
Entailment as Few-Shot LearnerCode1
Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-TuningCode1
MasakhaNEWS: News Topic Classification for African languagesCode1
GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph KnowledgeCode1
Polyglot Prompt: Multilingual Multitask PrompTrainingCode1
Revisiting LSTM Networks for Semi-Supervised Text Classification via Mixed Objective FunctionCode1
TEMPERA: Test-Time Prompting via Reinforcement LearningCode1
Label Semantic Aware Pre-training for Few-shot Text ClassificationCode1
KLUE: Korean Language Understanding EvaluationCode1
In-Context Learning with Iterative Demonstration SelectionCode1
HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient KoreaCode1
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic LanguagesCode1
MultiEURLEX -- A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transferCode1
SynthesizRR: Generating Diverse Datasets with Retrieval AugmentationCode1
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and DialectsCode1
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question AnsweringCode1
Newswire: A Large-Scale Structured Database of a Century of Historical NewsCode1
Addressing Topic Granularity and Hallucination in Large Language Models for Topic ModellingCode0
BagBERT: BERT-based bagging-stacking for multi-topic classificationCode0
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource SettingsCode0
A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero ShotCode0
Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligenceCode0
Automatic Classification of News Subjects in Broadcast News: Application to a Gender Bias Representation AnalysisCode0
Low-Resource Language Processing: An OCR-Driven Summarization and Translation PipelineCode0
LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic ClassificationCode0
A thorough benchmark of automatic text classification: From traditional approaches to large language modelsCode0
Leap-LSTM: Enhancing Long Short-Term Memory for Text CategorizationCode0
Active learning in annotating micro-blogs dealing with e-reputationCode0
Inference and Verbalization Functions During In-Context LearningCode0
Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and SiswatiCode0
Controlling the Interaction Between Generation and Inference in Semi-Supervised Variational Autoencoders Using Importance WeightingCode0
Give your Text Representation Models some Love: the Case for BasqueCode0
Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based MethodsCode0
Optimal and efficient text counterfactuals using Graph Neural NetworksCode0
Leveraging QA Datasets to Improve Generative Data AugmentationCode0
From Random to Supervised: A Novel Dropout Mechanism Integrated with Global InformationCode0
ConCET: Entity-Aware Topic Classification for Open-Domain Conversational AgentsCode0
An Overview of the Active Gene Annotation Corpus and the BioNLP OST 2019 AGAC Track TasksCode0
Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic ModelingCode0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.