| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| GIST: Generating Image-Specific Text for Fine-grained Object Classification | Jul 21, 2023 | ClassificationFine-Grained Image Classification | CodeCode Available | 1 |
| Generative News Recommendation | Mar 6, 2024 | ArticlesLanguage Modelling | CodeCode Available | 1 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation | Oct 15, 2024 | Explainable RecommendationLanguage Modelling | CodeCode Available | 1 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GCoder: Improving Large Language Model for Generalized Graph Problem Solving | Oct 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Text to Pixel: Advancing Long-Context Understanding in MLLMs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Code4Struct: Code Generation for Few-Shot Event Structure Prediction | Oct 23, 2022 | Code GenerationEvent Argument Extraction | CodeCode Available | 1 |
| On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents | Aug 2, 2024 | Code GenerationLarge Language Model | CodeCode Available | 1 |
| FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions | May 28, 2023 | AttributeImage Captioning | CodeCode Available | 1 |
| Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction | Feb 29, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification | Nov 10, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Foundation Models Meet Imbalanced Single-Cell Data When Learning Cell Type Annotations | Oct 27, 2023 | Cell Entity Annotationimbalanced classification | CodeCode Available | 1 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Sep 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| A Large Language Model Enhanced Sequential Recommender for Joint Video and Comment Recommendation | Mar 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Agentic Feedback Loop Modeling Improves Recommendation and User Simulation | Oct 26, 2024 | Large Language ModelUser Simulation | CodeCode Available | 1 |
| Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning | Jul 5, 2023 | DecoderLanguage Modelling | CodeCode Available | 1 |