| Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning | Oct 5, 2023 | NavigateSpatial Reasoning | CodeCode Available | 1 | 5 |
| A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks | Dec 20, 2023 | Model SelectionNavigate | CodeCode Available | 1 | 5 |
| ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language Models | May 11, 2023 | NavigateNews Generation | CodeCode Available | 1 | 5 |
| AEye: A Visualization Tool for Image Datasets | Aug 7, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Evaluating Language Models for Mathematics through Interactions | Jun 2, 2023 | Language ModellingMathematical Problem-Solving | CodeCode Available | 1 | 5 |
| Can GPT-4 Perform Neural Architecture Search? | Apr 21, 2023 | NavigateNeural Architecture Search | CodeCode Available | 1 | 5 |
| Evaluating Long-Term Memory in 3D Mazes | Oct 24, 2022 | Navigatereinforcement-learning | CodeCode Available | 1 | 5 |
| Expander Graph Propagation | Oct 6, 2022 | Graph ClassificationGraph Representation Learning | CodeCode Available | 1 | 5 |
| Aerial Vision-and-Dialog Navigation | May 24, 2022 | Navigate | CodeCode Available | 1 | 5 |
| BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational Bioimaging | Oct 23, 2023 | ChatbotInformation Retrieval | CodeCode Available | 1 | 5 |