| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 |
| Topic Classification of Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment | May 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | May 16, 2024 | Bias DetectionDiversity | CodeCode Available | 1 |
| InsightNet: Structured Insight Mining from Customer Feedback | May 12, 2024 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |
| Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling | May 1, 2024 | HallucinationTopic Classification | CodeCode Available | 0 |
| What Drives Performance in Multilingual Language Models? | Apr 29, 2024 | Cross-Lingual TransferMultilingual NLP | CodeCode Available | 0 |
| L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Apr 28, 2024 | ArticlesDocument Classification | CodeCode Available | 0 |
| Forget NLI, Use a Dictionary: Zero-Shot Topic Classification for Low-Resource Languages with Application to Luxembourgish | Apr 5, 2024 | Language ModellingNatural Language Inference | CodeCode Available | 0 |
| Few-Shot Cross-Lingual Transfer for Prompting Large Language Models in Low-Resource Languages | Mar 9, 2024 | Abstractive Text SummarizationCross-Lingual Transfer | —Unverified | 0 |
| Zero-Shot Topic Classification of Column Headers: Leveraging LLMs for Metadata Enrichment | Mar 1, 2024 | Retrievaltext-classification | CodeCode Available | 0 |
| LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons | Feb 21, 2024 | Sentiment AnalysisTopic Classification | CodeCode Available | 1 |
| Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of Language Models | Feb 15, 2024 | FairnessFew-Shot Learning | —Unverified | 0 |
| Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies | Feb 14, 2024 | Sentiment AnalysisText Augmentation | —Unverified | 0 |
| L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages | Jan 4, 2024 | ArticlesClassification | CodeCode Available | 1 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment Analysis | Dec 16, 2023 | ClassificationContrastive Learning | —Unverified | 0 |
| DRAFT: Dense Retrieval Augmented Few-shot Topic classifier Framework | Dec 5, 2023 | ClassificationIn-Context Learning | CodeCode Available | 0 |
| NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian | Dec 3, 2023 | Natural Language UnderstandingQuestion Answering | CodeCode Available | 0 |
| How good are Large Language Models on African Languages? | Nov 14, 2023 | In-Context LearningLanguage Modelling | —Unverified | 0 |
| Attention-Enhancing Backdoor Attacks Against BERT-based Models | Oct 23, 2023 | Sentiment AnalysisTopic Classification | —Unverified | 0 |
| Domain-Specific Language Model Post-Training for Indonesian Financial NLP | Oct 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| In-Context Learning with Iterative Demonstration Selection | Oct 15, 2023 | Few-Shot LearningIn-Context Learning | CodeCode Available | 1 |
| HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model | Oct 6, 2023 | Automatic Speech RecognitionRepresentation Learning | —Unverified | 0 |
| UPB @ ACTI: Detecting Conspiracies using fine tuned Sentence Transformers | Sep 28, 2023 | Binary ClassificationClassification | —Unverified | 0 |
| Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligence | Sep 24, 2023 | BenchmarkingChange Detection | CodeCode Available | 0 |