| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| A thorough benchmark of automatic text classification: From traditional approaches to large language models | Apr 2, 2025 | Sentiment Analysistext-classification | CodeCode Available | 0 |
| Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Statistical Theory of Contrastive Learning via Approximate Sufficient Statistics | Mar 21, 2025 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models | Feb 18, 2025 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| Concept Navigation and Classification via Open-Source Large Language Model Processing | Feb 7, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Analyzing the Effect of Linguistic Similarity on Cross-Lingual Transfer: Tasks and Experimental Setups Matter | Jan 24, 2025 | Cross-Lingual TransferDependency Parsing | —Unverified | 0 |
| DISHONEST: Dissecting misInformation Spread using Homogeneous sOcial NEtworks and Semantic Topic classification | Dec 12, 2024 | MisinformationTopic Classification | —Unverified | 0 |
| Evaluating Pixel Language Models on Non-Standardized Languages | Dec 12, 2024 | Dependency ParsingIntent Detection | —Unverified | 0 |
| LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Nov 29, 2024 | ArticlesClassification | CodeCode Available | 0 |