| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic Modeling | Apr 11, 2021 | ClassificationCross-Lingual Transfer | CodeCode Available | 0 |
| Inference and Verbalization Functions During In-Context Learning | Oct 12, 2024 | In-Context LearningNatural Language Inference | CodeCode Available | 0 |
| Zero-Shot Topic Classification of Column Headers: Leveraging LLMs for Metadata Enrichment | Mar 1, 2024 | Retrievaltext-classification | CodeCode Available | 0 |
| A thorough benchmark of automatic text classification: From traditional approaches to large language models | Apr 2, 2025 | Sentiment Analysistext-classification | CodeCode Available | 0 |
| Leveraging QA Datasets to Improve Generative Data Augmentation | May 25, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 0 |
| NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian | Dec 3, 2023 | Natural Language UnderstandingQuestion Answering | CodeCode Available | 0 |
| Unsupervised Keyphrase Extraction via Interpretable Neural Networks | Mar 15, 2022 | ArticlesKeyphrase Extraction | CodeCode Available | 0 |
| Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models | Nov 28, 2021 | ClassificationCross-Lingual Transfer | CodeCode Available | 0 |
| n-stage Latent Dirichlet Allocation: A Novel Approach for LDA | Oct 16, 2021 | Sentiment AnalysisText Classification | CodeCode Available | 0 |
| Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati | Jun 12, 2023 | Classificationregression | CodeCode Available | 0 |
| Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages | Oct 7, 2020 | NERTopic Classification | CodeCode Available | 0 |
| Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens | Sep 8, 2023 | Citation PredictionTopic Classification | CodeCode Available | 0 |
| DRAFT: Dense Retrieval Augmented Few-shot Topic classifier Framework | Dec 5, 2023 | ClassificationIn-Context Learning | CodeCode Available | 0 |
| L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Apr 28, 2024 | ArticlesDocument Classification | CodeCode Available | 0 |
| Domain-Specific Language Model Post-Training for Indonesian Financial NLP | Oct 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sequence Labeling Approach to the Task of Sentence Boundary Detection | Jan 20, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Controlling the Interaction Between Generation and Inference in Semi-Supervised Variational Autoencoders Using Importance Weighting | Oct 13, 2020 | Sentiment AnalysisTopic Classification | CodeCode Available | 0 |
| Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization | May 28, 2019 | General ClassificationMachine Translation | CodeCode Available | 0 |
| Topic-based Evaluation for Conversational Bots | Jan 11, 2018 | DiversityTopic Classification | CodeCode Available | 0 |
| Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods | Oct 13, 2022 | Abstractive Text SummarizationFeature Importance | CodeCode Available | 0 |
| LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Nov 29, 2024 | ArticlesClassification | CodeCode Available | 0 |
| An Overview of the Active Gene Annotation Corpus and the BioNLP OST 2019 AGAC Track Tasks | Nov 1, 2019 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling | May 1, 2024 | HallucinationTopic Classification | CodeCode Available | 0 |
| ConCET: Entity-Aware Topic Classification for Open-Domain Conversational Agents | May 28, 2020 | ClassificationGeneral Classification | CodeCode Available | 0 |
| Topic Classification from Text Using Decision Tree, K-NN and Multinomial Naïve Bayes | May 3, 2019 | ClassificationTopic Classification | CodeCode Available | 0 |
| A Robust Cybersecurity Topic Classification Tool | Aug 30, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 0 |
| What Drives Performance in Multilingual Language Models? | Apr 29, 2024 | Cross-Lingual TransferMultilingual NLP | CodeCode Available | 0 |
| Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligence | Sep 24, 2023 | BenchmarkingChange Detection | CodeCode Available | 0 |
| Bidirectional Context-Aware Hierarchical Attention Network for Document Understanding | Aug 16, 2019 | Abstractive Text Summarizationdocument understanding | CodeCode Available | 0 |
| Quantifying the Dissimilarity of Texts | May 3, 2023 | ClusteringInformation Retrieval | CodeCode Available | 0 |
| QuickCharNet: An Efficient URL Classification Framework for Enhanced Search Engine Optimization | Oct 22, 2024 | ClassificationEfficient Neural Network | CodeCode Available | 0 |
| Benchmarking Multilabel Topic Classification in the Kyrgyz Language | Aug 30, 2023 | BenchmarkingClassification | CodeCode Available | 0 |
| Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models | Feb 18, 2025 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| BagBERT: BERT-based bagging-stacking for multi-topic classification | Nov 10, 2021 | ClassificationEnsemble Learning | CodeCode Available | 0 |
| A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot | May 16, 2023 | Emotion ClassificationQuestion Answering | CodeCode Available | 0 |