| Tree Search for Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Jun 28, 2024 | Interactive SegmentationLanguage Modeling | CodeCode Available | 3 |
| APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models | Jun 19, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| CarLLaVA: Vision language models for camera-only closed-loop driving | Jun 14, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| Multimodal Table Understanding | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective Tasks | Jun 12, 2024 | BenchmarkingChatbot | CodeCode Available | 3 |
| A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |