| MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents | Jun 12, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Multimodal Table Understanding | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Real2Code: Reconstruct Articulated Objects via Code Generation | Jun 12, 2024 | Code GenerationImage Segmentation | —Unverified | 0 |
| A Study of Backdoors in Instruction Fine-tuned Language Models | Jun 12, 2024 | Data PoisoningLanguage Modelling | —Unverified | 0 |
| Large Language Model Unlearning via Embedding-Corrupted Prompts | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Markov Constraint as Large Language Model Surrogate | Jun 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Flextron: Many-in-One Flexible Large Language Model | Jun 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-empowered multimodal strain sensory system for shape recognition, monitoring, and human interaction of tensegrity | Jun 11, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis | Jun 11, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |