| A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations | May 20, 2025 | SentenceSentence Classification | CodeCode Available | 1 |
| Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks | Jan 5, 2025 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| Consolidating and Developing Benchmarking Datasets for the Nepali Natural Language Understanding Tasks | Nov 28, 2024 | BenchmarkingNatural Language Inference | —Unverified | 0 |
| Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models | Nov 27, 2024 | ClassificationSentence | CodeCode Available | 3 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Analyzing the Evolution of Graphs and Texts | Nov 9, 2024 | Representation LearningSentence Classification | —Unverified | 0 |
| Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers | Aug 6, 2024 | Classificationregression | —Unverified | 0 |
| Constructing the CORD-19 Vaccine Dataset | Jul 26, 2024 | Question AnsweringSentence | —Unverified | 0 |
| Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning | Jul 24, 2024 | Anomaly DetectionIn-Context Learning | CodeCode Available | 0 |
| MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models | Jul 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |