| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning a Structural Causal Model for Intuition Reasoning in Conversation | May 28, 2023 | Causal DiscoveryLanguage Modelling | CodeCode Available | 1 | 5 |
| Large Language Model Unlearning via Embedding-Corrupted Prompts | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Feb 9, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Foundation Models Meet Imbalanced Single-Cell Data When Learning Cell Type Annotations | Oct 27, 2023 | Cell Entity Annotationimbalanced classification | CodeCode Available | 1 | 5 |
| Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers | Jul 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLM-SAP: Large Language Models Situational Awareness Based Planning | Dec 26, 2023 | Decision MakingLanguage Modelling | CodeCode Available | 1 | 5 |
| Large Language Model Unlearning | Oct 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |