| I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Feb 20, 2025 | AutoMLCode Generation | CodeCode Available | 1 |
| HPS: Hard Preference Sampling for Human Preference Alignment | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt-to-Leaderboard | Feb 20, 2025 | ChatbotLanguage Modeling | CodeCode Available | 3 |
| Rapid Word Learning Through Meta In-Context Learning | Feb 20, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators | Feb 20, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation | Feb 20, 2025 | Generative Adversarial NetworkLanguage Modeling | CodeCode Available | 0 |
| SR-LLM: Rethinking the Structured Representation in Large Language Model | Feb 20, 2025 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Feb 20, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| STeCa: Step-level Trajectory Calibration for LLM Agent Learning | Feb 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |