| On Multilingual Encoder Language Model Compression for Low-Resource Languages | May 22, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings | May 17, 2025 | Abusive LanguageTopic Classification | CodeCode Available | 0 |
| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| A thorough benchmark of automatic text classification: From traditional approaches to large language models | Apr 2, 2025 | Sentiment Analysistext-classification | CodeCode Available | 0 |
| Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Statistical Theory of Contrastive Learning via Approximate Sufficient Statistics | Mar 21, 2025 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models | Feb 18, 2025 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| Concept Navigation and Classification via Open-Source Large Language Model Processing | Feb 7, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Analyzing the Effect of Linguistic Similarity on Cross-Lingual Transfer: Tasks and Experimental Setups Matter | Jan 24, 2025 | Cross-Lingual TransferDependency Parsing | —Unverified | 0 |
| Evaluating Pixel Language Models on Non-Standardized Languages | Dec 12, 2024 | Dependency ParsingIntent Detection | —Unverified | 0 |
| DISHONEST: Dissecting misInformation Spread using Homogeneous sOcial NEtworks and Semantic Topic classification | Dec 12, 2024 | MisinformationTopic Classification | —Unverified | 0 |
| LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Nov 29, 2024 | ArticlesClassification | CodeCode Available | 0 |
| QuickCharNet: An Efficient URL Classification Framework for Enhanced Search Engine Optimization | Oct 22, 2024 | ClassificationEfficient Neural Network | CodeCode Available | 0 |
| From Measurement Instruments to Data: Leveraging Theory-Driven Synthetic Training Data for Classifying Social Constructs | Oct 16, 2024 | Classificationtext-classification | —Unverified | 0 |
| Inference and Verbalization Functions During In-Context Learning | Oct 12, 2024 | In-Context LearningNatural Language Inference | CodeCode Available | 0 |
| The Large Language Model GreekLegalRoBERTa | Oct 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model-Driven Data Pruning Enables Efficient Active Learning | Oct 5, 2024 | Active LearningLanguage Modeling | —Unverified | 0 |
| Multilingual Topic Classification in X: Dataset and Analysis | Oct 4, 2024 | ClassificationDiversity | —Unverified | 0 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 |
| Optimal and efficient text counterfactuals using Graph Neural Networks | Aug 4, 2024 | counterfactualDecision Making | CodeCode Available | 0 |
| Assessing In-context Learning and Fine-tuning for Topic Classification of German Web Data | Jul 23, 2024 | Binary ClassificationIn-Context Learning | —Unverified | 0 |
| Automatic Classification of News Subjects in Broadcast News: Application to a Gender Bias Representation Analysis | Jul 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multi-task Prompt Words Learning for Social Media Content Generation | Jul 10, 2024 | Keyword ExtractionScene Recognition | —Unverified | 0 |
| STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data | Jul 3, 2024 | ClassificationSentence | —Unverified | 0 |
| Retrieval Augmented Zero-Shot Text Classification | Jun 21, 2024 | ClassificationRetrieval | CodeCode Available | 0 |