| Compositional Chain-of-Thought Prompting for Large Multimodal Models | Nov 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | May 15, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Feb 20, 2025 | AutoMLCode Generation | CodeCode Available | 1 |
| Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective | Jun 22, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Inference with Reference: Lossless Acceleration of Large Language Models | Apr 10, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model | Mar 4, 2025 | es-enLanguage Modeling | CodeCode Available | 1 |
| Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers | Apr 25, 2025 | Large Language Model | CodeCode Available | 1 |
| Evaluation of large language models for discovery of gene set function | Sep 7, 2023 | BenchmarkingLanguage Modelling | CodeCode Available | 1 |