| Unlocking Video-LLM via Agent-of-Thoughts Distillation | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Medchain: Bridging the Gap Between LLM Agents and Clinical Practice through Interactive Sequential Benchmarking | Dec 2, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Yi-Lightning Technical Report | Dec 2, 2024 | ChatbotLarge Language Model | —Unverified | 0 |
| FD-LLM: Large Language Model for Fault Diagnosis of Machines | Dec 2, 2024 | Fault DetectionFault Diagnosis | —Unverified | 0 |
| WAFFLE: Multimodal Floorplan Understanding in the Wild | Dec 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatSplat: 3D Conversational Gaussian Splatting | Dec 1, 2024 | Large Language ModelScene Understanding | —Unverified | 0 |
| ARChef: An iOS-Based Augmented Reality Cooking Assistant Powered by Multimodal Gemini LLM | Dec 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Free and Customizable Code Documentation with LLMs: A Fine-Tuning Approach | Dec 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models | Nov 30, 2024 | Large Language Model | —Unverified | 0 |