| MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs | Nov 14, 2024 | General KnowledgeMath | CodeCode Available | 0 | 5 |
| Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models | Aug 19, 2024 | DescriptiveDrug Discovery | CodeCode Available | 0 | 5 |
| Pruning neural network models for gene regulatory dynamics using data and domain knowledge | Mar 5, 2024 | General KnowledgeNetwork Pruning | CodeCode Available | 0 | 5 |
| Patching as Translation: the Data and the Metaphor | Aug 24, 2020 | General KnowledgeProgram Repair | CodeCode Available | 0 | 5 |
| PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization | Nov 16, 2023 | Document SummarizationGeneral Knowledge | CodeCode Available | 0 | 5 |
| Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models | Apr 6, 2025 | Computational EfficiencyGeneral Knowledge | CodeCode Available | 0 | 5 |
| PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning | Jul 16, 2025 | Continual LearningGeneral Knowledge | CodeCode Available | 0 | 5 |
| Quantized Prompt for Efficient Generalization of Vision-Language Models | Jul 15, 2024 | General KnowledgeLanguage Modelling | CodeCode Available | 0 | 5 |
| REFinD: Relation Extraction Financial Dataset | May 22, 2023 | ArticlesGeneral Knowledge | CodeCode Available | 0 | 5 |
| RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Jun 17, 2024 | BenchmarkingGeneral Knowledge | CodeCode Available | 0 | 5 |
| SciDeBERTa: Learning DeBERTa for Science Technology Documents and Fine-Tuning Information Extraction Tasks | Jun 8, 2022 | General KnowledgeJoint Entity and Relation Extraction | CodeCode Available | 0 | 5 |
| Should We Really Edit Language Models? On the Evaluation of Edited Language Models | Oct 24, 2024 | General KnowledgeModel Editing | CodeCode Available | 0 | 5 |
| Survey on Abstractive Text Summarization: Dataset, Models, and Metrics | Dec 22, 2024 | Abstractive Text SummarizationGeneral Knowledge | CodeCode Available | 0 | 5 |
| Task-Driven and Experience-Based Question Answering Corpus for In-Home Robot Application in the House3D Virtual Environment | Jun 1, 2022 | General KnowledgeQuestion Answering | CodeCode Available | 0 | 5 |
| Test-Time Self-Adaptive Small Language Models for Question Answering | Oct 20, 2023 | General KnowledgeQuestion Answering | CodeCode Available | 0 | 5 |
| Towards Knowledge-Augmented Visual Question Answering | Dec 1, 2020 | General KnowledgeGraph Attention | CodeCode Available | 0 | 5 |
| Unveiling Causal Reasoning in Large Language Models: Reality or Mirage? | Jun 26, 2025 | counterfactualGeneral Knowledge | CodeCode Available | 0 | 5 |
| Visual Question Answering: A Survey of Methods and Datasets | Jul 20, 2016 | General KnowledgeSurvey | CodeCode Available | 0 | 5 |
| What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge | Dec 31, 2019 | General KnowledgeKnowledge Graphs | CodeCode Available | 0 | 5 |
| What Makes Cryptic Crosswords Challenging for LLMs? | Dec 12, 2024 | General Knowledge | CodeCode Available | 0 | 5 |
| WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | Jul 25, 2022 | Common Sense ReasoningGeneral Knowledge | CodeCode Available | 0 | 5 |
| World Knowledge in Multiple Choice Reading Comprehension | Nov 13, 2022 | General KnowledgeMultiple-choice | CodeCode Available | 0 | 5 |
| Dobby: A Conversational Service Robot Driven by GPT-4 | Oct 10, 2023 | AI AgentDecision Making | —Unverified | 0 | 0 |
| Towards Few-shot Out-of-Distribution Detection | Nov 20, 2023 | General KnowledgeOut-of-Distribution Detection | —Unverified | 0 | 0 |
| How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization | Jan 23, 2025 | General Knowledge | —Unverified | 0 | 0 |
| Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs | Apr 29, 2024 | DiagnosticGeneral Knowledge | —Unverified | 0 | 0 |
| DKT: Diverse Knowledge Transfer Transformer for Class Incremental Learning | Jan 1, 2023 | class-incremental learningClass Incremental Learning | —Unverified | 0 | 0 |
| Igea: a Decoder-Only Language Model for Biomedical Text Generation in Italian | Jul 8, 2024 | Computational EfficiencyDecoder | —Unverified | 0 | 0 |
| Image Captioning and Visual Question Answering Based on Attributes and External Knowledge | Mar 9, 2016 | General KnowledgeImage Captioning | —Unverified | 0 | 0 |
| Distributed Fine-tuning of Language Models on Private Data | Jan 1, 2018 | General KnowledgeLanguage Modeling | —Unverified | 0 | 0 |
| Disentangling Knowledge-based and Visual Reasoning by Question Decomposition in KB-VQA | Jun 27, 2024 | General KnowledgeQuestion Answering | —Unverified | 0 | 0 |
| What's a Good Prediction? Challenges in evaluating an agent's knowledge | Jan 23, 2020 | Continual LearningGeneral Knowledge | —Unverified | 0 | 0 |
| Improving Multi-label Emotion Classification by Integrating both General and Domain-specific Knowledge | Nov 1, 2019 | Emotion ClassificationGeneral Classification | —Unverified | 0 | 0 |
| Towards Ontology Reshaping for KG Generation with User-in-the-Loop: Applied to Bosch Welding | Sep 22, 2022 | General KnowledgeKnowledge Graphs | —Unverified | 0 | 0 |
| INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning | Jan 22, 2024 | class-incremental learningClass Incremental Learning | —Unverified | 0 | 0 |
| Inductive Graph Alignment Prompt: Bridging the Gap between Graph Pre-training and Inductive Fine-tuning From Spectral Perspective | Feb 21, 2024 | General KnowledgeGraph Classification | —Unverified | 0 | 0 |
| A new algorithm for Subgroup Set Discovery based on Information Gain | Jul 26, 2023 | General KnowledgeSubgroup Discovery | —Unverified | 0 | 0 |
| Insect-Foundation: A Foundation Model and Large Multimodal Dataset for Vision-Language Insect Understanding | Feb 14, 2025 | General KnowledgeQuestion Answering | —Unverified | 0 | 0 |
| Acquiring Knowledge from Pre-trained Model to Neural Machine Translation | Dec 4, 2019 | General KnowledgeKnowledge Distillation | —Unverified | 0 | 0 |
| Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model | Jul 3, 2019 | General KnowledgeImitation Learning | —Unverified | 0 | 0 |
| Intelligent Conversational Bot for Massive Online Open Courses (MOOCs) | Jan 26, 2016 | General Knowledgespeech-recognition | —Unverified | 0 | 0 |
| Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era | Jun 11, 2025 | General Knowledge | —Unverified | 0 | 0 |
| Investigating Forgetting in Pre-Trained Representations Through Continual Learning | May 10, 2023 | Continual LearningGeneral Knowledge | —Unverified | 0 | 0 |
| Investigating Pre-trained Language Models on Cross-Domain Datasets, a Step Closer to General AI | Jun 21, 2023 | General KnowledgeListOps | —Unverified | 0 | 0 |
| AcademicGPT: Empowering Academic Research | Nov 21, 2023 | Abstract generationGeneral Knowledge | —Unverified | 0 | 0 |
| Joint Embedding Learning of Educational Knowledge Graphs | Nov 20, 2019 | General Knowledgegraph construction | —Unverified | 0 | 0 |
| Juru: Legal Brazilian Large Language Model from Reputable Sources | Mar 26, 2024 | General KnowledgeLanguage Modeling | —Unverified | 0 | 0 |
| KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution | Jan 12, 2023 | Entity ResolutionGeneral Knowledge | —Unverified | 0 | 0 |
| KALA: Knowledge-Augmented Language Model Adaptation | Nov 16, 2021 | Domain AdaptationGeneral Knowledge | —Unverified | 0 | 0 |
| DiPrompT: Disentangled Prompt Tuning for Multiple Latent Domain Generalization in Federated Learning | Mar 11, 2024 | Domain GeneralizationFederated Learning | —Unverified | 0 | 0 |